期刊文献+

基于分类器性能评价的Bagging文本分类算法 被引量:5

Bagging Text Classification Algorithm Based on Classifier Performance Evaluation
在线阅读 下载PDF
导出
摘要 提出一个文本分类器性能评价模型,对文本分类结果的可信度进行了估计,给出计算可信度的公式。将每一个子分类器的可信度指标用于Bagging集成学习算法,得到了改进的基于子分类器性能评价的Bagging算法(PBagging)。应用支持向量机作为子分类器基本模型,对日本共同社大样本新闻集进行分类。实验表明,与Bagging算法相比,PBagging算法分类准确率有了明显提高。 This paper presents an evaluation model for the text classifier. The reliability of classifying result of a classifier is computed according to its learning result and naive Bayesian. Based on the performance evaluation model, Performance Bagging(PBagging), an improved text classification algorithm is proposed. In the algorithm, the reliability is served as the weight of sub-classifier's result when using Bagging, an ensemble learning method. Using SVM as the sub-classifier model, it applies the PBagging algorithm to classify news corpus in kyodo news agent, the result shows that PBagging performs better than Bagging with more accuracy.
出处 《计算机工程》 CAS CSCD 北大核心 2008年第1期61-63,共3页 Computer Engineering
关键词 文本分类 分类器性能 评价模型 BAGGING算法 text classification classifier performance evaluation model Bagging algorithm
  • 相关文献

参考文献6

  • 1Dietterich T G.Ensemble Methods in Machine Learning[M].2nd ed.Cambridge,MA:MIT Press,2002.
  • 2Peter B,Yu Bin.Analysing Bagging[J].Annals of Statistics,2002,30(4):927-961.
  • 3林果为.诊断试验的研究与评价[J].诊断学理论与实践,2003,2(1). 被引量:8
  • 4Pauker S G,Kopelman R I.Interpreting Hoofbeats:Can Bayes Help Clear the Haze?[J].The New England Journal of Medicine,1993,328(4):290.
  • 5贾银山,贾传荧.一种加权支持向量机分类算法[J].计算机工程,2005,31(12):23-25. 被引量:20
  • 6Leo B.Bagging Predictors[J].Machine Learning,1996,24(2):123.

二级参考文献13

  • 1[2]Knottnerus JA,van Weel C,Muris JW.Evaluation of diagnostic procedures[J].BMJ,2002,324(7335):477-480.
  • 2[4]Sackett DL, Richardson WS,Rosenberg W,et al.Evidencebased Medicine, How to practice & teach EBM [M].Churchill Livingstone, 1997,P81-84.
  • 3[5]Sackett DL, Haynes RB. The architecture of diagnostic research[J].BNJ,2002,324(7336):539-541.
  • 4[7]Empson MB. Statistics in the pathology laboratory:characteristics of diagnostic tests [J].Pathology,2001,33 (11):93-95.
  • 5[12]Fletcher RH,Fletcher SW,Wagner EH. Clinical Epidemiology The Essentials. 3rd ed [M]. Baltimore:Williams & wilkins, 1996,P43-74.
  • 6Vapnik V. The Nature of Statistical Learning Theory[M].Springer-Verlag, 1995.
  • 7Cortes C, Vapnik V. Support Vector Networks[J]. Machine learning,1995, 20(3):273-297.
  • 8Scholkopf B, Smola A J. Williamson R C, et al. New Support Vector Algorithms[J]. Neural Computation, 2000, 12(5):1207-1245.
  • 9Scholkopf B, Smola A J. Learning with Kernels[M]. MIT Press, 2002.
  • 10Chew H G, Bogner R E, Lim C C. Dual-nu Support Vector Machine with Error Rate and Training Size Biasing[A]. Proceedings of the 26th International Conference on Acoustics, Speech and Signal Processing [C], IEEE, 200 1 :1269-1272.

共引文献26

同被引文献81

引证文献5

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部