期刊文献+

嵌入误分类代价和拒识代价的二元分类算法 被引量:3

Binary Classification with Misclassification Cost and Reject Cost
在线阅读 下载PDF
导出
摘要 传统分类算法隐含的假设(每个样本的误差都具有相同的代价,且接受每个样本的分类结果)并不适用于医疗诊断、故障诊断、欺诈检测等领域的实际需求。在定义拒识代价的基础上,本文提出一种嵌入非对称误分类代价和非对称拒识代价的二元分类算法(CSVM-CM C2RC),包括以下4个步骤:学习代价敏感支持向量机、估计每个样本的后验概率、计算每个样本的分类可靠性、确定每类样本的最优拒识阈值。基于标准数据集的试验研究表明,CSVM-CM C2RC能有效地降低误分类率和平均代价,提高分类结果的可靠性。 To minimize "0-1" loss,most of conventional classification algorithms non-explicitly assume that all results of classification are accepted. However,the assumption is inappliable to knowledge extraction in such fields as medical diagnosis, fault diagnosis and fraud detection. In this paper ,the algorithm Cost-sensitive SVM with Class-dependent Misclassification Cost and Class-dependent Reject Cost (CSVM-CMC2RC) is proposed. In CSVM-CMCZRC algorithm,firstly,a cost-sensitive SVM is trained to obtain the preliminary classification results. Secondly, the post probability of each sample is computed. Thirdly,the classification reliability of each sample is estimated. Finally,the optimal reject threshold and the final reject decision are determined based on minimizing the average cost. Experimental results demonstrate that the proposed CSVM-CMC2RC algorithm can reduce the misclassification rate and average cost,and the classification reliability is improved.
出处 《广西师范大学学报(自然科学版)》 CAS 北大核心 2010年第3期104-108,共5页 Journal of Guangxi Normal University:Natural Science Edition
基金 国家自然科学基金资助项目(60905034) 浙江省自然科学基金资助项目(Y1080950) 国家公益行业专项资助(2007GYJ016) 云南省教育厅科学研究基金资助项目(08C0019)
关键词 非对称误分类代价 非对称拒识代价 代价敏感支持向量机 class-dependent misclassification cost class-dependent reject cost cost-sensitive SVM
  • 相关文献

参考文献17

  • 1VAPNIK V N. The nature of statistical learning theory[M]. New York:Springer-Verlag,1999:138-167.
  • 2王修信,秦丽梅,罗涟玲,农京辉,朱启疆.提高城市TM图像分类精度的两种方法比较[J].广西师范大学学报(自然科学版),2009,27(4):19-22. 被引量:7
  • 3DOMINGOS P. Metacost :a general method for making classifiers cost-sensitive [C]//Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York:ACM Press, 1999:155-164.
  • 4SaNCHEZ D,VILA M A ,CERDA L ,et al. Association rules applied to credit card fraud detection[J]. Expert Systems with Applications, 2009,36 (2) : 3630-3640.
  • 5SRIVASTAVA A,KUNDU A,SURAL S,et al. Credit card fraud detection using hidden Markov model[J]. IEEE Transactions on Dependable and Secure Computing, 2008,5 (1) : 37-48.
  • 6周生明,廖元秀.Cost-Sensitive学习的一个新课题[J].广西师范大学学报(自然科学版),2007,25(4):55-58. 被引量:2
  • 7ELKAN C. The foundations of cost-sensitive learning[C]//Proeeedings of the 7th International Joint Conference on Artificial Intelligence. California : AAAI Press, 2001: 937-978.
  • 8凌晓峰,SHENG Victor S..代价敏感分类器的比较研究(英文)[J].计算机学报,2007,30(8):1203-1212. 被引量:35
  • 9ZHOU Zhi-hua,LIU Xu-ying. On multi-class cost-sensitive learning[C]//Proceedings of the 21st National Conference on Artificial Intelligence. California : AAAI Press, 2006,1 : 567-572.
  • 10LIU Xu-ying,ZHOU Zhi-hua. The influence of class imbalance on cost-sensitive learning:an empirical study[C]// Proceedings of the Sixth International Conference on Data Mining. Las Vegas :CSREA Press ,2006:970-974.

二级参考文献44

共引文献41

同被引文献34

  • 1郑恩辉,李平,宋执环.代价敏感支持向量机[J].控制与决策,2006,21(4):473-476. 被引量:35
  • 2杨晓伟,杨祖元,余舒.支持向量机在鼻咽癌患者5年生存状态预测中的应用[J].计算机应用与软件,2007,24(6):17-19. 被引量:1
  • 3Han J, Kamber M. Data mining: concepts and techniques[M~]. San Francisco CA= Morgan Kanfnann, 2001 = 29-31.
  • 4Huang G B, Zhu Q Y, Slew C K. Extreme learning machine : theo- ry and applications[J]. Neurocomputing, 2006 (70) : 489-501.
  • 5Huang G B, Zhu Q Y, Siew C K. Extreme learning machine: a new learning scheme of feedforward neural networks[C]~ // Pro- ceedings of International Joint Conference on Neural Networks. 2004:25-29.
  • 6Huang G B,Wang D H, Lan Y. Extreme Learning Machines: A Survey[J]. International Journal of Machine Learning and Cy bernetics,2011,2(2) : 107-122.
  • 7Huang G B, Ding X J, Zhou H M. Optimization method based extreme learning machine for classification[J]. Neurocompu- ting, 2010,74 : 155-163.
  • 8Lan Yuan,Yeng C-S, Huang Guang-bin. Ensemble of online se- quential extreme learning machine[J]. Neurocomputing, 2009, 73 (13 15)..3391-3395.
  • 9Tian Hui-xin,Meng Bo. A new modeling method based on bag- ging ELM for day-ahead electricity price prediction[C]//2010 IEEE Fifth International Conference Bio-Inspired Computing.. Theories and Applications (BIC-TA). Changsha, China, 2010.. 1076-1079.
  • 10Tian Hui-xin, Mao Zhi-zhong. An Ensemble ELM Based on Modified AdaBoost. RT Algorithm for Predicting the Tempera- lure of Molten Steel in Ladle Furnace[J]. Automation Science and Engineering, 2009,7 ( 1 ) ; 73-80.

引证文献3

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部