摘要
本文对文本分类过程中关键的部分进行了改进,在分词阶段,对分词的速度和精度进行了改进,在特征选取阶段,把多种特征选取方法进行了融合,最后对分类器进行了优化,并给出了实验测试的结果,实验的结果表明,文本分类的效率的确有了提高。
Some key part has been improved in the text classification process. At the Chinese word segmenta- tion stage, the Speed and accuracy have been improved. At the feature selection stage, a variety of feature selection methods have been integrated. Finally, the classifiers are optimized, and the results of the experimental test are giv- en. The experimental results show that the efficiency of the text classification has been increased indeed.
出处
《情报理论与实践》
CSSCI
北大核心
2009年第5期95-98,共4页
Information Studies:Theory & Application
关键词
文本分词
特征选取
文本分类
Chinese word segmentation
feature selection
text classification