期刊文献+

多标记文本分类中信息增益特征选择方法研究

Study on Information Gain Feature Selection in Multi-labeled Text Categorization
在线阅读 下载PDF
导出
摘要 针对信息增益特征选择方法没有很好考虑多标记的分布,在多标记文本分类中表现不佳的问题,用标记矩阵的协方差改善特征选择时标记之间的关联产生的影响,提高分类效果。最后通过实验证明,改进的信息增益特征选择方法具有可行性和有效性。 To solve the problem of the poor effect of information gain feature selection on the multi-labeled text categorization,which does not consider multi labeled distribution.Class corelation is taken into account and use multi-label covariance to improve IG feature selection in classification.The experimental results verify the efficiency and probability of the improved information gain feature selection in Multi-labeled text categorization.
作者 许朝阳
机构地区 莆田学院
出处 《廊坊师范学院学报(自然科学版)》 2012年第5期46-48,共3页 Journal of Langfang Normal University(Natural Science Edition)
基金 莆田市科技项目[2011G04(2)]支持
关键词 文本分类 多标记分类 信息增益 特征选择 协方差 text categorization Multi-labeled classification information gain feature selection Covariance
  • 相关文献

参考文献6

  • 1Yu K, Yu S, Tresp V. Multi-label informed latent semanticindexing[ C]. In Proceedings of the 28th annual interna-tional ACM SIGIR Conference on Research And develop-ment in information retrieval( SIGIR, 05 ) , Salvador, Brazil,2005:258-265.
  • 2M.-L. Zhang and Z.-H. Zhou. ML-kNN: A lazy learningapproach to multi-label learning[ J]. Pattern Recognition,2007,40(7):2038-2048.
  • 3Park C H. Dimension reduction using least squares regres-sion in multi-labeled text categorization[ C]. In Proceedingsof the 8th IEEE International Conference on Computer andInformation Technology (CIT) ,2008:71 - 76.
  • 4Zhang Min-ling, Pena J M, Robles V. Feature selection formulti-label naive Bayes classification [ J]. Information Sci-ences ,2009 ,(179):3218-3229.
  • 5G. Nadia, A. McCallum. Collective Multi-Label Classifica-tion [C]. In Proceedings of the 14Th ACM InternationalConference on Information and Knowledge Management,2005:195-200.
  • 6周志华,杨强.机器学习及其应用(2011)[M].北京:清华大学出版社,2011.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部