期刊文献+

文本分类技术研究 被引量:13

Study on Text Categorization Techniques
在线阅读 下载PDF
导出
摘要 文本分类是文本挖掘的基础和核心。文中系统地介绍了文本分类过程中涉及的各种关键技术,对特征表示、特征提取、文本分类方法及分类模型评估进行了较为详细地论述。最后,提出了文本分类中存在的问题及今后的发展。 Text categorization is the basis and core of textual data mining. This survey introduces key techniques to every step of categorization process, including feature representation, feature extraction, the algorithms of text classification and model evaluation. Lastly, it presents some existing problems and future developments in text categorization field.
出处 《微计算机信息》 北大核心 2006年第05X期209-211,共3页 Control & Automation
基金 四川省重大基础研究项目(04JY029-001-4)
关键词 文本挖掘 文本分类 特征表示 特征抽取 模型评估 text mining text categorization feature representation feature extraction model evaluation
  • 相关文献

参考文献7

  • 1AH-HWEE TAN.Text Mining:The state of the art and the challenges [C].PAKDD'99 Workshop on Knowledge discovery from Advanced Databases (KDAD'99),Beijing,1999.
  • 2Fabrizio Sebastiani.Machine Learning in Automated Text Categorization[J].ACM Computing Sruveys,2002,34(1):1-47.
  • 3Yang Yiming,Pederson J O.A Comparative Study on Feature Selection in Text Categorization[C].Proceedings of the 14th International Conference on Machine learning.Nashville:Morgan Kanfmann,1997: 412-420.
  • 4Mlademnic,D.,Grobelnik,M.Feature Selection for unbalanced class distribution and Native Bayees [C].Proceedings of the Sisteenth International Conference on Machine Learning.Bled:Morgan Kanfmann, 1999:258-267.
  • 5Belur V D.Nearest Neighbor(NN)Norms:NN pattern Classification Techniques [J].IEEE Computer Society Press,New York:IEEE press, 1991.59.
  • 6Joachims T.Text Categorization with Support Vector Machines:Learning with Many Relevant Features [J].Machine Learning,1998,11398:137-142.
  • 7张先飞,李弼程,刘安斐.基于改进KNFL算法的海量文本分类研究[J].微计算机信息,2005,21(11S):159-160. 被引量:4

二级参考文献1

共引文献3

同被引文献93

引证文献13

二级引证文献62

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部