期刊文献+

基于集成合并的文本特征提取方法 被引量:1

A TEXT FEATURE SELECTION METHOD BASED ON INTEGRATION AND COMBINATION
在线阅读 下载PDF
导出
摘要 文本分类是在给定的分类体系下,根据文本的内容自动确定文本类别的过程。在文本分类中,特征的提取对于分类的结果相当重要。从特征提取这一阶段出发,提出了一个集成合并的特征提取方法,该方法主要集成多种特征提取方法并合并关系密切的特征,并利用支持向量机SVM(Support Vector Machine)分类的高准确率,能够求出全局最优方法等优点来对得到的特征向量进行分类评估。实验证明,此种特征提取能够降低分类时间和提高分类的准确率。 Text categorization is the process that determines the category of the given text depends on its contents automatically. In text categorization, the feature selection is a very important process. So from the stage of feather selection, we post a feature selection method with integration and combination that assembles main methods for feature selection and gathers the correlated similar features together. At last we use the SVM ( Support Vector machine) to classify and evaluate the feature vectors we get, whose advantages are that of high accuracy and of getting best point by global optimized approach in whole space. In our experiment, we can reduce the classification time and get higher accuracy of classification using our selection method with SVM.
作者 褚力 张世永
出处 《计算机应用与软件》 CSCD 北大核心 2008年第10期212-213,233,共3页 Computer Applications and Software
关键词 文本特征 特征提取 支持向量机 Text feature Feature selection SVM
  • 相关文献

参考文献11

二级参考文献62

  • 1张先飞,李弼程,刘安斐.基于改进KNFL算法的海量文本分类研究[J].微计算机信息,2005,21(11S):159-160. 被引量:4
  • 2黄萱青 吴立德.独立于语种的文本分类方法[M].,2000.37-43.
  • 3鲁松 白硕 等.文本中词语权重计算方法的改进[M].,2000.31-36.
  • 4卜东波.聚类/分类理论研究及其在大模型文本挖掘的应用:博士论文[M].,2000..
  • 5李国正 王蒙 曾华军译.支持向量机导论[M].北京:电子工业出版社,2004-03..
  • 6Vladimir N Vapnik.An Overview of Statistical Learning Theory[J]. IEEE Transactions on Neural Networks, 1999; 10(5) :988-999.
  • 7Hsu C-W,Lin C-J.A Comparison of Methods for Muhiclass Support Vector Machine[J].lEEE Transactions on Neural Networks,2002; (13) : 415-425.
  • 8Boonserm Kijsirkul,Nitiwut Ussivakul.Muhiclass Support Vector Machines Using Adaptive Directed Acyclic Graph[C].In:IEEE/INNS International Joint Conference on Neural Networks( IJCNN-2002 ), 2002 : 980-985.
  • 9Vojtech Franc,Vaclav Hlavac.Muhi-class Support Vector Machine [C].In:16th International Conference on Pattern Recognition(lCPR'02), 2002 : 236-239.
  • 10黄萱菁,2000 International Conference on Multilingual Information Processing,2000年,37页

共引文献2953

同被引文献7

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部