期刊文献+

最小二乘支持向量机的一种非均衡数据分类算法 被引量:3

Research on Classification of Imbalanced Data Based on Least Squares Support Vector Machine
在线阅读 下载PDF
导出
摘要 为了提高支持向量机的非平衡数据分类能力,分析了最小二乘支持向量机的本质特征,提出了一种基于中心距离比的非平衡数据分类算法,同时通过修剪边界样本,解决了最小二乘支持向量机缺失稀疏性的问题.在UCI标准数据集上进行的试验表明:该算法能够有效地提高支持向量机对非均衡分布数据的正确性,且该算法在不影响训练精度的前提下,可以得到稀疏解,算法的训练速度也有了一定的提高. To improve the classification performance of unbalanced datasets, the nature characteristics of sparse least squares support vector machines (LS-SVM) was analyzed and an algorithm based on center distance ratio for the unbalanced samples was proposed. Meanwhile, the problem of sparseness lacking in the least squares support vector machines was solved by pruning the boundary samples. The new algorithm was tested on the UCI datasets. The results indicate that this method can effectively improve the classification accuracy of LS-SVM for the unbalanced samples, the proposed algorithm can properly obtain the sparse solutions without affecting the capacity of classification, and the speed of classifiers is also improved.
出处 《微电子学与计算机》 CSCD 北大核心 2010年第4期33-37,共5页 Microelectronics & Computer
基金 国家自然科学基金项目(70671035) 河南省重点科技攻关项目(082102210015)
关键词 最小二乘支持向量机 非均衡数据分类 稀疏性 least squares support vector machines unbalanced data classification sparseness
  • 相关文献

参考文献10

二级参考文献35

  • 1曾文华,马健.支持向量机增量学习的算法与应用[J].计算机集成制造系统-CIMS,2003,9(Z1):144-148.
  • 2Vapnik V N,The nature of statistical learning [M].Theory Second Edition,New York : Springer,2000.
  • 3Chew Hong-Gunn,Crisp D J,Bogner R E et al.Target detection in radar imagery using support vector machines with training size biasing[C].In: Proceedings of the sixth international conference on control,Automation,Robotics and Vision,Singapore,CD-ROM,2000.
  • 4Lin Chun-Fu,Wang Sheng-De.Fuzzy Support Vector Machines[J]. IEEE Transactions on Neural Networks, 2002; 13 ( 2 ) : 464-471.
  • 5Xu P,Chan A K.An efficient algorithm on multi-class support vector machine model selection [C].In :Proceedings of the International Joint Conference on Neural Networks 2003,Portland,2003:3229-3232.
  • 6Jen-Hao Lee,Chih-Jen Lin.Automatic model selection for support vector machines [R].Department of Computer Science and Information Engineering,National Taiwan University,2000.
  • 7Blake C,Merz C.UCI repository of machine learning databases[DB/ OL].University of California,Irvine, http ://www.ics.uci.edu/mlearn/ML- Repository.html, 1998.
  • 8VapnikVN著 张学工译.统计学习理论的本质[M].北京:清华大学出版社,2000..
  • 9NelloCristianini JohnShawe-Taylor 李国正 王猛 曾华军译.支持向量机导论[M].北京:电子工业出版社,2004..
  • 10Vapnik V N. The Nature of Statistical Learning Theory[M]. New York: Springer Verlag, 1995.

共引文献109

同被引文献17

  • 1凌晓峰,SHENG Victor S..代价敏感分类器的比较研究(英文)[J].计算机学报,2007,30(8):1203-1212. 被引量:35
  • 2Friedman J H, Olshen R A, Stone C J, et al. Classifica- tion and regression trees[M]. American Statistical Asso- ciation: The Film House, 1986.
  • 3Elkan (2. The foundations of cost- sensitive learning [C]//Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI' 01). Wash- ington DC, 2001 : 973-978.
  • 4Ciraco M, Rogalewski M, Weiss G. Improving classifier utility by altering the misclassification cost ratio[C]//the 1st International Workshop on Utility-based Data Mining. New York, 2005 : 46-52.
  • 5Fan W, Stofol S, Zhang J X. Ada cost: misclassification cost--sensitive boosting[C]//Proc of the 16th lnt' 1 Conf on Machine Lming. Slovenia: Bled, 1999 : 97-105.
  • 6Maloof M. Learning when data sets are imbalanced and when costs are unequal and unknown[C]// Working Notes of the ICML'03 Workshop on Learning from Im- balanced Data Sets. Washingtzon, DC. 2003.
  • 7The Center for Machine Learning and Intelligent Systems. UC irvine machine learning repository[DB/OL]. [1989-01-01]. http://archive, ics. uci. edu/ml/dataset: html.
  • 8Chawla N V, Bowyer K W, Hall L O, et al. Smote: synthetic minority over-sampling technique[J]. Journal of Artificial Intelligence Research, 2002, 16 (3) : 321 - 357.
  • 9Pawlak Z. Rough sets[J]. International Journal of In- formation and Computer Science, 1982, 11(5): 314- 356.
  • 10An A J, Stefanowski, Ramanna S, et al. Rough sets, fuzzy sets, data mining and granular computing[C]// RSFDGrC 2007. Toronto: Springer, 2007.

引证文献3

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部