期刊文献+

基于树桩网络的贝叶斯文本分类算法 被引量:4

Bayes Text Classification Algorithm Based on Stump Network
在线阅读 下载PDF
导出
摘要 分析贝叶斯文本分类算法的不足,提出相应的改进算法。放宽朴素贝叶斯文本分类模型中的属性独立性假设,采用一种改进的基于贝叶斯定理的文本分类模型"树桩网络",改进朴素贝叶斯文本分类模型。实验证明,改进后的文本分类模型适合于文本分类的需要,改善了原有分类器的性能。 This paper analyzes the shortcomings of Bayes and puts forward a better method to improve it. It releases attribute independence assumption of Naive Bayes text classifier. An improved text classification model based on Bayes theorem called stump network is presented to amend the Naive Bayes text classifier. Experiment shows that the revised text categorization model meets the need of text categorization, and improves the performance of former one.
出处 《计算机工程》 CAS CSCD 北大核心 2009年第16期201-202,205,共3页 Computer Engineering
关键词 文本分类 朴素贝叶斯 属性独立性假设 树桩网络 text classification Naive Bayes attribute independence assumption stump network
  • 相关文献

参考文献4

二级参考文献15

  • 1Mitchell T M.Machine Learning[M].McGraw-Hill Companies,1997:129-132.
  • 2Slonim N,Tishby N.The Power of Word Clusters for Text Classification[C]//Proc.of the 23th European Colloquium on Information Retrieval Research.2001.
  • 3Baker L D,McCallum A.Distributional Clustering of Words for Text Classification[C]//Proceedings of the 21th Annual International ACM SIGIR.1998:96-103.
  • 4Bekkerman R,El-Yaniv R,Winter Y,et al.On Feature Distributional Clustering for Text Categorization[C]//Proc.of ACM SIGIR'01.2001:146-153.
  • 5Dhillon I S,Mallela S,Kumar R.A Divisive Information-theoretic Feature Clustering Algorithm for Text Classification[J].Journal of Machine Learning Research,2003,3:1265-1287.
  • 6Davidson I,Satyanarayana A.Speeding Up k-means Clustering by Bootstrap Averaging[C]//Proceedings of the 3rd IEEE International Conference on Data Mining.2003:16-25.
  • 7Pereira F,Tishby N,Lee L.Distributional Clustering of English Words[C]//Proc.of the 31th Annual Meeting of the ACL.1993:183-190.
  • 8Lang K.Newsweeder:Learning to Filter News[C]//Proceedings of the 12th International Conference on Machine Learning.1995:331-339.
  • 9McCallum A K.Bow:A Toolkit for Statistical Language Modeling,Text Retrieval,Classification and Clustering[Z].(1996).http://www.cs.cmu.edu/~mccallum/bow.
  • 10Lewis D D.An Evaluation of Phrasal and Clustered Representations on a Text Categorization Task[C]//Proceedings of the 15th ACM International Conference on Research and Development in Information Retrieval.New York,USA:ACM Press,1992.

共引文献4

同被引文献37

引证文献4

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部