期刊文献+
共找到998篇文章
< 1 2 50 >
每页显示 20 50 100
Text categorization based on fuzzy classification rules tree 被引量:2
1
作者 郭玉琴 袁方 刘海博 《Journal of Southeast University(English Edition)》 EI CAS 2008年第3期339-342,共4页
To deal with the problem that arises when the conventional fuzzy class-association method applies repetitive scans of the classifier to classify new texts,which has low efficiency, a new approach based on the FCR-tree... To deal with the problem that arises when the conventional fuzzy class-association method applies repetitive scans of the classifier to classify new texts,which has low efficiency, a new approach based on the FCR-tree(fuzzy classification rules tree)for text categorization is proposed.The compactness of the FCR-tree saves significant space in storing a large set of rules when there are many repeated words in the rules.In comparison with classification rules,the fuzzy classification rules contain not only words,but also the fuzzy sets corresponding to the frequencies of words appearing in texts.Therefore,the construction of an FCR-tree and its structure are different from a CR-tree.To debase the difficulty of FCR-tree construction and rules retrieval,more k-FCR-trees are built.When classifying a new text,it is not necessary to search the paths of the sub-trees led by those words not appearing in this text,thus reducing the number of traveling rules.Experimental results show that the proposed approach obviously outperforms the conventional method in efficiency. 展开更多
关键词 text categorization fuzzy classification association rule classification rules tree fuzzy classification rules tree
在线阅读 下载PDF
A New Approach of Feature Selection for Text Categorization 被引量:6
2
作者 CUI Zifeng XU Baowen +1 位作者 ZHANG Weifeng XU Junling 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1335-1339,共5页
This paper proposes a new approach of feature selection based on the independent measure between features for text categorization. A fundamental hypothesis that occurrence of the terms in documents is independent of e... This paper proposes a new approach of feature selection based on the independent measure between features for text categorization. A fundamental hypothesis that occurrence of the terms in documents is independent of each other, widely used in the probabilistic models for text categorization (TC), is discussed. However, the basic hypothesis is incom plete for independence of feature set. From the view of feature selection, a new independent measure between features is designed, by which a feature selection algorithm is given to ob rain a feature subset. The selected subset is high in relevance with category and strong in independence between features, satisfies the basic hypothesis at maximum degree. Compared with other traditional feature selection method in TC (which is only taken into the relevance account), the performance of feature subset selected by our method is prior to others with experiments on the benchmark dataset of 20 Newsgroups. 展开更多
关键词 feature selection independency CHI square test text categorization
在线阅读 下载PDF
Comparison of Text Categorization Algorithms 被引量:4
3
作者 SHIYong-feng ZHAOYan-ping 《Wuhan University Journal of Natural Sciences》 EI CAS 2004年第5期798-804,共7页
This paper summarizes several automatic text categorization algorithms in common use recently, analyzes and compares their advantages and disadvantages. It provides clues for making use of appropriate automatic classi... This paper summarizes several automatic text categorization algorithms in common use recently, analyzes and compares their advantages and disadvantages. It provides clues for making use of appropriate automatic classifying algorithms in different fields. Finally some evaluations and summaries of these algorithms are discussed, and directions to further research have been pointed out. Key words text categorization - naive bayes - KNN - SVM - neural network CLC number TP 391 Foundation item: Supported by the National Natural Science Foundation of China (70031010) and the Research Foundation of Beijing Institute of TechnologyBiography: SHI Yong-feng (1980-), male, Master candidate, research direction: web information mining. 展开更多
关键词 text categorization naive bayes KNN SVM neural network
在线阅读 下载PDF
A Two-Stage Feature Selection Method for Text Categorization by Using Category Correlation Degree and Latent Semantic Indexing 被引量:2
4
作者 王飞 李彩虹 +2 位作者 王景山 徐娇 李廉 《Journal of Shanghai Jiaotong university(Science)》 EI 2015年第1期44-50,共7页
With the purpose of improving the accuracy of text categorization and reducing the dimension of the feature space,this paper proposes a two-stage feature selection method based on a novel category correlation degree(C... With the purpose of improving the accuracy of text categorization and reducing the dimension of the feature space,this paper proposes a two-stage feature selection method based on a novel category correlation degree(CCD)method and latent semantic indexing(LSI).In the first stage,a novel CCD method is proposed to select the most effective features for text classification,which is more effective than the traditional feature selection method.In the second stage,document representation requires a high dimensionality of the feature space and does not take into account the semantic relation between features,which leads to a poor categorization accuracy.So LSI method is proposed to solve these problems by using statistically derived conceptual indices to replace the individual terms which can discover the important correlative relationship between features and reduce the feature space dimension.Firstly,each feature in our algorithm is ranked depending on their importance of classification using CCD method.Secondly,we construct a new semantic space based on LSI method among features.The experimental results have proved that our method can reduce effectively the dimension of text vector and improve the performance of text categorization. 展开更多
关键词 text categorization feature selection latent semantic indexing(LSI) category correlation degree(CCD)
原文传递
Lazy learner text categorization algorithm based on embedded feature selection 被引量:1
5
作者 Yan Peng Zheng Xuefeng +1 位作者 Zhu Jianyong Xiao Yunhong 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2009年第3期651-659,共9页
To avoid the curse of dimensionality, text categorization (TC) algorithms based on machine learning (ML) have to use an feature selection (FS) method to reduce the dimensionality of feature space. Although havin... To avoid the curse of dimensionality, text categorization (TC) algorithms based on machine learning (ML) have to use an feature selection (FS) method to reduce the dimensionality of feature space. Although having been widely used, FS process will generally cause information losing and then have much side-effect on the whole performance of TC algorithms. On the basis of the sparsity characteristic of text vectors, a new TC algorithm based on lazy feature selection (LFS) is presented. As a new type of embedded feature selection approach, the LFS method can greatly reduce the dimension of features without any information losing, which can improve both efficiency and performance of algorithms greatly. The experiments show the new algorithm can simultaneously achieve much higher both performance and efficiency than some of other classical TC algorithms. 展开更多
关键词 machine learning text categorization embedded feature selection lazy learner cosine similarity.
在线阅读 下载PDF
A Text Categorization System with Soft Real-Time Guarantee 被引量:1
6
作者 WANG Hua-yong CHEN Yu DAI Yi-qi 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期226-229,共4页
In order to provide predictable runtime performante for text categorization (TC) systems, an innovative system design method is proposed for soft real time TC systems. An analyzable mathematical model is established... In order to provide predictable runtime performante for text categorization (TC) systems, an innovative system design method is proposed for soft real time TC systems. An analyzable mathematical model is established to approximately describe the nonlinear and time-varying TC systems. According to this mathematical model, the feedback control theory is adopted to prove the system's stableness and zero steady state error. The experiments result shows that the error of deadline satisfied ratio in the system is kept within 4 of the desired value. And the number of classifiers can be dynamically adjusted by the system itself to save the computa tion resources. The proposed methodology enables the theo retical analysis and evaluation to the TC systems, leading to a high-quality and low cost implementation approach. 展开更多
关键词 information retrieval text categorization soft real-time system feedback control theory
在线阅读 下载PDF
A formal study of feature selection in text categorization 被引量:15
7
作者 XU Yan 《通讯和计算机(中英文版)》 2009年第4期32-41,共10页
关键词 特征分类 约束 文本分类 信息
在线阅读 下载PDF
The Role of Rare Terms in Enhancing the Performance of Polynomial Networks Based Text Categorization
8
作者 Mayy M. Al-Tahrawi 《Journal of Intelligent Learning Systems and Applications》 2013年第2期84-89,共6页
In this paper, the role of rare or infrequent terms in enhancing the accuracy of English Text Categorization using Polynomial Networks (PNs) is investigated. To study the impact of rare terms in enhancing the accuracy... In this paper, the role of rare or infrequent terms in enhancing the accuracy of English Text Categorization using Polynomial Networks (PNs) is investigated. To study the impact of rare terms in enhancing the accuracy of PNs-based text categorization, different term reduction criteria as well as different term weighting schemes were experimented on the Reuters Corpus using PNs. Each term weighting scheme on each reduced term set was tested once keeping the rare terms and another time removing them. All the experiments conducted in this research show that keeping rare terms substantially improves the performance of Polynomial Networks in Text Categorization, regardless of the term reduction method, the number of terms used in classification, or the term weighting scheme adopted. 展开更多
关键词 POLYNOMIAL NETWORKS text categorization Document Classification Infrequent TERMS RARE TERMS
暂未订购
Smart Approaches to Efficient Text Mining for Categorizing Sexual Reproductive Health Short Messages into Key Themes
9
作者 Tobias Makai Mayumbo Nyirenda 《Open Journal of Applied Sciences》 2024年第2期511-532,共22页
To promote behavioral change among adolescents in Zambia, the National HIV/AIDS/STI/TB Council, in collaboration with UNICEF, developed the Zambia U-Report platform. This platform provides young people with improved a... To promote behavioral change among adolescents in Zambia, the National HIV/AIDS/STI/TB Council, in collaboration with UNICEF, developed the Zambia U-Report platform. This platform provides young people with improved access to information on various Sexual Reproductive Health topics through Short Messaging Service (SMS) messages. Over the years, the platform has accumulated millions of incoming and outgoing messages, which need to be categorized into key thematic areas for better tracking of sexual reproductive health knowledge gaps among young people. The current manual categorization process of these text messages is inefficient and time-consuming and this study aims to automate the process for improved analysis using text-mining techniques. Firstly, the study investigates the current text message categorization process and identifies a list of categories adopted by counselors over time which are then used to build and train a categorization model. Secondly, the study presents a proof of concept tool that automates the categorization of U-report messages into key thematic areas using the developed categorization model. Finally, it compares the performance and effectiveness of the developed proof of concept tool against the manual system. The study used a dataset comprising 206,625 text messages. The current process would take roughly 2.82 years to categorise this dataset whereas the trained SVM model would require only 6.4 minutes while achieving an accuracy of 70.4% demonstrating that the automated method is significantly faster, more scalable, and consistent when compared to the current manual categorization. These advantages make the SVM model a more efficient and effective tool for categorizing large unstructured text datasets. These results and the proof-of-concept tool developed demonstrate the potential for enhancing the efficiency and accuracy of message categorization on the Zambia U-report platform and other similar text messages-based platforms. 展开更多
关键词 Knowledge Discovery in text (KDT) Sexual Reproductive Health (SRH) text categorization text Classification text Extraction text Mining Feature Extraction Automated Classification Process Performance Stemming and Lemmatization Natural Language Processing (NLP)
在线阅读 下载PDF
基于TextRank算法和互信息相似度的维吾尔文关键词提取及文本分类 被引量:9
10
作者 阿力甫.阿不都克里木 李晓 《计算机科学》 CSCD 北大核心 2016年第12期36-40,共5页
针对维吾尔语文本的分类问题,提出一种基于TextRank算法和互信息相似度的维吾尔文关键词提取及文本分类方法。首先,对输入文本进行预处理,滤除非维吾尔语的字符和停用词;然后,利用词语语义相似度、词语位置和词频重要性加权的TextRank... 针对维吾尔语文本的分类问题,提出一种基于TextRank算法和互信息相似度的维吾尔文关键词提取及文本分类方法。首先,对输入文本进行预处理,滤除非维吾尔语的字符和停用词;然后,利用词语语义相似度、词语位置和词频重要性加权的TextRank算法提取文本关键词集合;最后,根据互信息相似度度量,计算输入文本关键词集和各类关键词集的相似度,最终实现文本的分类。实验结果表明,该方案能够提取出具有较高识别度的关键词,当关键词集大小为1250时,平均分类率达到了91.2%。 展开更多
关键词 维吾尔语 文本分类 关键词提取 textRank算法 互信息相似度
在线阅读 下载PDF
A Novel Active Learning Method Using SVM for Text Classification 被引量:26
11
作者 Mohamed Goudjil Mouloud Koudil +1 位作者 Mouldi Bedda Noureddine Ghoggali 《International Journal of Automation and computing》 EI CSCD 2018年第3期290-298,共9页
Support vector machines(SVMs) are a popular class of supervised learning algorithms, and are particularly applicable to large and high-dimensional classification problems. Like most machine learning methods for data... Support vector machines(SVMs) are a popular class of supervised learning algorithms, and are particularly applicable to large and high-dimensional classification problems. Like most machine learning methods for data classification and information retrieval, they require manually labeled data samples in the training stage. However, manual labeling is a time consuming and errorprone task. One possible solution to this issue is to exploit the large number of unlabeled samples that are easily accessible via the internet. This paper presents a novel active learning method for text categorization. The main objective of active learning is to reduce the labeling effort, without compromising the accuracy of classification, by intelligently selecting which samples should be labeled.The proposed method selects a batch of informative samples using the posterior probabilities provided by a set of multi-class SVM classifiers, and these samples are then manually labeled by an expert. Experimental results indicate that the proposed active learning method significantly reduces the labeling effort, while simultaneously enhancing the classification accuracy. 展开更多
关键词 text categorization active learning support vector machine (SVM) pool-based active learning pairwise coupling.
原文传递
TextCNN文本分类技术在OA系统中的应用研究 被引量:3
12
作者 皎海军 廖晨阳 +1 位作者 杜胜贤 于劲松 《办公自动化》 2020年第14期45-48,共4页
随着大数据的发展,传统的办公软件迎来新的发展趋势。本文将Text CNN深度学习网络引入政务便民服务的全电子化系统中,研究自然语言处理领域的文本分类技术与协同型OA系统融合的方法,以实现政府公文的分发推荐服务。本着辅助而不干预的原... 随着大数据的发展,传统的办公软件迎来新的发展趋势。本文将Text CNN深度学习网络引入政务便民服务的全电子化系统中,研究自然语言处理领域的文本分类技术与协同型OA系统融合的方法,以实现政府公文的分发推荐服务。本着辅助而不干预的原则,计算机的智能决策结果将清晰地反馈给公文分派员,以辅助其做出最终的判断。该服务解决了政府部门人员短缺,公文分发出错率高的问题有效减少退回率,加快了公文的流转效率。 展开更多
关键词 textCNN 协同型OA 自然语言处理(NLP) 文本分类
在线阅读 下载PDF
A fuzzy method to learn text classifier from labeled and unlabeled examples
13
作者 刘宏 黄上腾 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2004年第1期98-102,共5页
In text classification, labeling documents is a tedious and costly task, as it would consume a lot of expert time. On the other hand, it usually is easier to obtain a lot of unlabeled documents, with the help of some ... In text classification, labeling documents is a tedious and costly task, as it would consume a lot of expert time. On the other hand, it usually is easier to obtain a lot of unlabeled documents, with the help of some tools like Digital Library, Crawler Programs, and Searching Engine. To learn text classifier from labeled and unlabeled examples, a novel fuzzy method is proposed. Firstly, a Seeded Fuzzy c-means Clustering algorithm is proposed to learn fuzzy clusters from a set of labeled and unlabeled examples. Secondly, based on the resulting fuzzy clusters, some examples with high confidence are selected to construct training data set. Finally, the constructed training data set is used to train Fuzzy Support Vector Machine, and get text classifier. Empirical results on two benchmark datasets indicate that, by incorporating unlabeled examples into learning process, the method performs significantly better than FSVM trained with a small number of labeled examples only. Also, the method proposed performs at least as well as the related method-EM with Nave Bayes. One advantage of the method proposed is that it does not rely on any parametric assumptions about the data as it is usually the case with generative methods widely used in semi-supervised learning. 展开更多
关键词 text categorization FUZZY CLUSTERING
在线阅读 下载PDF
基于角度-振幅混合编码的量子神经网络及其应用研究
14
作者 杨帆 程学云 +3 位作者 朱鹏程 姜一博 顾晖 管致锦 《电子科技大学学报》 北大核心 2025年第5期789-800,共12页
传统量子神经网络与自注意机制结合的模型需消耗较高的量子位资源,针对其在当前NISQ设备上运行效率低和设计复杂性高的问题,提出了一种混合编码方式,将数据集特征通过特定的方式嵌入量子态中,从而实现角度编码与振幅编码的有效混合;基... 传统量子神经网络与自注意机制结合的模型需消耗较高的量子位资源,针对其在当前NISQ设备上运行效率低和设计复杂性高的问题,提出了一种混合编码方式,将数据集特征通过特定的方式嵌入量子态中,从而实现角度编码与振幅编码的有效混合;基于该编码方法设计出一种结构独特的双环Ansatz,借鉴自注意机制中的分而治之思想,构建出具备更高表现力的量子神经网络。在鸢尾花分类任务中训练损失值收敛于0,证明模型有效捕捉到鸢尾花特征之间的内在联系;在文本分类任务中与已有方法相比,分类精确度平均提升了8.9%,且在保证效果良好的前提下,成功减少了训练参数的数量。基于角度-振幅混合编码的量子神经网络的轻量化和低复杂度特性使其更适用于当前的NISQ设备。 展开更多
关键词 量子神经网络 混合编码 自注意机制 文本分类
在线阅读 下载PDF
Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization 被引量:3
15
作者 Min DU Xing-shu CHEN 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2013年第6期407-416,共10页
Text categorization is a significant technique to manage the surging text data on the Internet.The k-nearest neighbors(kNN) algorithm is an effective,but not efficient,classification model for text categorization.In t... Text categorization is a significant technique to manage the surging text data on the Internet.The k-nearest neighbors(kNN) algorithm is an effective,but not efficient,classification model for text categorization.In this paper,we propose an effective strategy to accelerate the standard kNN,based on a simple principle:usually,near points in space are also near when they are projected into a direction,which means that distant points in the projection direction are also distant in the original space.Using the proposed strategy,most of the irrelevant points can be removed when searching for the k-nearest neighbors of a query point,which greatly decreases the computation cost.Experimental results show that the proposed strategy greatly improves the time performance of the standard kNN,with little degradation in accuracy.Specifically,it is superior in applications that have large and high-dimensional datasets. 展开更多
关键词 k-nearest neighbors(kNN) text categorization Accelerating strategy Principal COMPONENT analysis(PCA)
原文传递
Non-Independent Term Selection for Chinese Text Categorization 被引量:2
16
作者 李景阳 孙茂松 《Tsinghua Science and Technology》 SCIE EI CAS 2009年第1期113-120,共8页
Chinese text categorization differs from English text categorization due to its much larger term set (of words or character n-grams), which results in very slow training and working of modern high-performance classi... Chinese text categorization differs from English text categorization due to its much larger term set (of words or character n-grams), which results in very slow training and working of modern high-performance classifiers. This study assumes that this high-dimensionality problem is related to the redundancy in the term set, which cannot be solved by traditional term selection methods. A greedy algorithm framework named "non-independent term selection" is presented, which reduces the redundancy according to string-level correlations. Several preliminary implementations of this idea are demonstrated. Experiment results show that a good tradeoff can be reached between the performance and the size of the term set. 展开更多
关键词 Chinese text categorization term selection dimentionality
原文传递
Stochastic Variational Inference-Based Parallel and Online Supervised Topic Model for Large-Scale Text Processing 被引量:1
17
作者 Yang Li Wen-Zhuo Song Bo Yang 《Journal of Computer Science & Technology》 SCIE EI CSCD 2018年第5期1007-1022,共16页
Topic modeling is a mainstream and effective technology to deal with text data, with wide applications in text analysis, natural language, personalized recommendation, computer vision, etc. Among all the known topic m... Topic modeling is a mainstream and effective technology to deal with text data, with wide applications in text analysis, natural language, personalized recommendation, computer vision, etc. Among all the known topic models, supervised Latent Dirichlet Allocation (sLDA) is acknowledged as a popular and competitive supervised topic model. How- ever, the gradual increase of the scale of datasets makes sLDA more and more inefficient and time-consuming, and limits its applications in a very narrow range. To solve it, a parallel online sLDA, named PO-sLDA (Parallel and Online sLDA), is proposed in this study. It uses the stochastic variational inference as the learning method to make the training procedure more rapid and efficient, and a parallel computing mechanism implemented via the MapReduce framework is proposed to promote the capacity of cloud computing and big data processing. The online training capacity supported by PO-sLDA expands the application scope of this approach, making it instrumental for real-life applications with high real-time demand. The validation using two datasets with different sizes shows that the proposed approach has the comparative accuracy as the sLDA and can efficiently accelerate the training procedure. Moreover, its good convergence and online training capacity make it lucrative for the large-scale text data analyzing and processing. 展开更多
关键词 topic modeling large-scale text classification stochastic variational inference cloud computing online learning
原文传递
文本分类数据增强技术对图书馆用户服务的赋能机制研究
18
作者 林慧娟 《移动信息》 2025年第10期253-255,共3页
文中针对图书馆用户服务中文本分类效率低、多语言支持不足等问题,提出基于数据增强技术的解决方案,通过构建回译转换、语义替换和样本合成三位一体的增强技术体系,结合4层架构(原始数据层-增强层-计算层-应用层)实现数据价值转化。实... 文中针对图书馆用户服务中文本分类效率低、多语言支持不足等问题,提出基于数据增强技术的解决方案,通过构建回译转换、语义替换和样本合成三位一体的增强技术体系,结合4层架构(原始数据层-增强层-计算层-应用层)实现数据价值转化。实验表明,该技术使咨询工单分类精确率达91.2%(提升19.5%),多语言识别覆盖,处理效率较传统模式提升17倍。在古籍数字化服务场景中,响应时间从72 h压缩至4 h,年度运营成本减少41万元。该成果为公共文化机构突破人力资源瓶颈、实现服务智能化转型提供了可复用的技术范式。 展开更多
关键词 数据增强技术 文本分类 图书馆用户服务
在线阅读 下载PDF
An incremental learning approach for the text categorization using hybrid optimization
19
作者 Mamta Kayest Sanjay Kumar Jain 《International Journal of Intelligent Computing and Cybernetics》 EI 2019年第3期333-351,共19页
Purpose–Document retrieval has become a hot research topic over the past few years,and has been paid more attention in browsing and synthesizing information from different documents.The purpose of this paper is to de... Purpose–Document retrieval has become a hot research topic over the past few years,and has been paid more attention in browsing and synthesizing information from different documents.The purpose of this paper is to develop an effective document retrieval method,which focuses on reducing the time needed for the navigator to evoke the whole document based on contents,themes and concepts of documents.Design/methodology/approach–This paper introduces an incremental learning approach for text categorization using Monarch Butterfly optimization–FireFly optimization based Neural Network(MB–FF based NN).Initially,the feature extraction is carried out on the pre-processed data using Term Frequency–Inverse Document Frequency(TF–IDF)and holoentropy to find the keywords of the document.Then,cluster-based indexing is performed using MB–FF algorithm,and finally,by matching process with the modified Bhattacharya distance measure,the document retrieval is done.In MB–FF based NN,the weights in the NN are chosen using MB–FF algorithm.Findings–The effectiveness of the proposed MB–FF based NN is proven with an improved precision value of 0.8769,recall value of 0.7957,F-measure of 0.8143 and accuracy of 0.7815,respectively.Originality/value–The experimental results show that the proposed MB–FF based NN is useful to companies,which have a large workforce across the country. 展开更多
关键词 text categorization Monarch butterfly optimization Neural network Holoentropy Cluster-based indexing Data mining
在线阅读 下载PDF
An up -to -date comparative analysis of the KNN classifier distance metrics for text categorization
20
作者 Onder Coban 《Data Science and Informetrics》 2023年第2期67-78,共12页
Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by ... Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by using machine learning(ML)techniques.Choosing the best classifier in this task is the most important step in which k-Nearest Neighbor(KNN)is widely employed as a classifier as well as several other well-known ones such as Support Vector Machine,Multinomial Naive Bayes,Logistic Regression,and so on.The KNN has been extensively used for TC tasks and is one of the oldest and simplest methods for pattern classification.Its performance crucially relies on the distance metric used to identify nearest neighbors such that the most frequently observed label among these neighbors is used to classify an unseen test instance.Hence,in this paper,a comparative analysis of the KNN classifier is performed on a subset(i.e.,R8)of the Reuters-21578 benchmark dataset for TC.Experimental results are obtained by using different distance metrics as well as recently proposed distance learning metrics under different cases where the feature model and term weighting scheme are different.Our comparative evaluation of the results shows that Bray-Curtis and Linear Discriminant Analysis(LDA)are often superior to the other metrics and work well with raw term frequency weights. 展开更多
关键词 text categorization k-nearest neighbor distance metric distance learning algorithms
原文传递
上一页 1 2 50 下一页 到第
使用帮助 返回顶部