期刊文献+

一种面向机械领域文本分类器的设计 被引量:3

Design of Mechanical Information Text Classifier
在线阅读 下载PDF
导出
摘要 提出了一种面向机械领域的文本分类器.特征选择采用基于文档频率的特征提取法和灰色关联度计算相结合的方法,有效降低了特征维数,削弱了特征词之间的关联,为采用贝叶斯分类创造了条件.分类阶段引进了基于类别区分度的加权因子对朴素贝叶斯分类器进行优化.实验证明,该分类器能够有效地提高机械领域文本分类的召回率和正确率,具有较好的使用效果. A information text classifier of machinery-oriented is proposed in this paper.The method of document frequency and grey relation analysis are used to select feature,which reduce the characteristics of dimensionality,weaken relation between feature words and create the conditions for the Bayes.The Bayesian Classifier is ameliorated by using word's kinds-difference as weighted factor.The experimental results indicate that the classifier is able to improve recall and precision,and is useful in practice.
出处 《微电子学与计算机》 CSCD 北大核心 2012年第4期142-145,共4页 Microelectronics & Computer
基金 陕西省自然科学基金(2009JM8006) 陕西省教育厅专项科研项目(2010JK620)
关键词 机械领域 灰色关联分析 贝叶斯分类器 特征选择 machinery-oriented grey relation analysis Bayesian classifier feature selection
  • 相关文献

参考文献7

二级参考文献23

  • 1杨健,杨静宇,叶晖.Fisher线性鉴别分析的理论研究及其应用[J].自动化学报,2003,29(4):481-493. 被引量:97
  • 2刘海峰,姚泽清,汪泽焱,张学仁.基于位置的文本特征加权方法研究[J].微电子学与计算机,2009,26(2):188-192. 被引量:9
  • 3宋枫溪,程科,杨静宇,刘树海.最大散度差和大间距线性投影与支持向量机[J].自动化学报,2004,30(6):890-896. 被引量:59
  • 4Yang Y, Pedersen J O. A comparative study on feature se lection in text categorization[C]//Proceedings of ICML- 97, 14th International Conference on Machine Learning US, Nashville, 1997 : 412 - 420.
  • 5Hong Z Q, Yang J Y. Optimal discriminant plane for a small number of samples and design method of classifier on the plane[J]. Pattern Recognition, 1991,24 (4) : 317 - 324.
  • 6Chen Li Fen, Liao H Y Mark, Ko M T, et al. A new LDA -based face recognition system which can solve the small sample size problem [ J ]. Pattern Recognition, 2000, 33 (10):1713-1726.
  • 7Perone M. An overview of spare blocking techniques[Z]. Cupertino. CA:Technical report, Barracuda Networks corp. 2004.
  • 8Spam StopHere corp. How URL Spam Filtering Beats Bayesian/Heuristics Hands Down[Z]. Parkland Plaza: Greenview Data Inc. 2005.
  • 9Hershkop S, Stolfo S J. Combining email models for false positive reduction[C]//The Eleventh ACM SIGK- DD International Conference on Knowledge Discovery and Data Mining, Chicago, Illinois, USA:ACM, 2005.
  • 10Castellano G, Fanelli A M. Variable selection using neural- network models [J]. Neurocomputing, 2000, 31(1) :113.

共引文献112

同被引文献17

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部