期刊文献+

一种不平衡支持向量机的校正方法

Revision method for imbalanced support vector machines
在线阅读 下载PDF
导出
摘要 针对支持向量机中两类不平衡数据的分离超平面的偏移问题提出一种校正方法:先对两类样本数据在核空间中进行核主成分分析,分别求出两类样本数据的在特征空间中的主要特征值;然后根据两样本容量以及各自的特征值所提供的信息,对两类数据给出惩罚因子比例;最后通过优化训练产生一个新的分离超平面。该分类面校正了标准支持向量机的分类误差,与标准的支持向量机相比,该方法不仅平衡了错分率,同时还能减少错分率。实验结果验证了方法的有效性。 A revision method was proposed for the offset of separation hyperplane of binary-classification imbalaneed data in Support Vector Machine (SVM). Firstly, the principal values were found respectively of the two classes of samples in feature space by using Kernel Principal Component Analysis (KPCA). Secondly, one penalty proportion was given based on the information provided by the sizes of the two sample data and their values. Finally, a new separation hyperplane was generated through the optimization training. The hyperplane revised the error of the standard support vector machines. Experiment results prove the validity of the method. Compared with standard support vector machines, the proposed method can not only balance but also decrease the classification error.
出处 《计算机应用》 CSCD 北大核心 2007年第12期2896-2898,共3页 journal of Computer Applications
基金 国家自然科学基金资助项目(60574075)
关键词 不平衡数据 核主成分分析 支持向量机 偏移 imbalanced data kernel principal component analysis Support Vector Machines (SVM) offset
  • 相关文献

参考文献9

  • 1JAPKOWICZ N, STEPHEN S. The class imbalance problem: A systematic study [J]. Intelligent Data Analysis, 2002, 6(5): 429 -449.
  • 2CHAWLA N V, BOWYER K W, HALL L O, et al. Smote: synthetic minority over-sampling technique [J]. Jounlal of Artificial Intelligence Research, 2002, 16(3) : 321 -357.
  • 3LING, C, LI C. Data mining for direct marketing problems and solutions [C]//Proceedings of the Fourth International Conference on knowledge Discovery and Data Ming. New York: AAAI Press, 1998 : 73 - 79.
  • 4KUBAT M, MATWIN S. Addressing the curse of imbalanced datasets [C] // One-sided Sampling Proceedings of the Fourteenth International Conference on Machine Learning. Nashville: Tennessee, 1997:178 - 186.
  • 5REHAN A, STEPHEN K, NATHALIE J. Applying support vector machines to imbalanced datasets [ C]// Fifteenth European Conference on Machines Learning. Berlin: Springer-Verlag, 2004:39 - 50.
  • 6郑恩辉,李平,宋执环.不平衡数据知识挖掘:类分布对支持向量机分类的影响[J].信息与控制,2005,34(6):703-708. 被引量:17
  • 7LIN Y, GRACEW Y L. Support vector machines for classification in nonstandard situations [J]. Machine Learning, 2002, 46(2) : 191 - 202.
  • 8贾银山,贾传荧.一种加权支持向量机分类算法[J].计算机工程,2005,31(12):23-25. 被引量:20
  • 9BLAKE C L, MERZ C J. UCI repository of machine learning databases [EB/OL]. [2007-05-02]. http://www. ics. uci. edu/-mleam/MLRepository.html.

二级参考文献21

  • 1Vapnik V. The Nature of Statistical Learning Theory[M].Springer-Verlag, 1995.
  • 2Cortes C, Vapnik V. Support Vector Networks[J]. Machine learning,1995, 20(3):273-297.
  • 3Scholkopf B, Smola A J. Williamson R C, et al. New Support Vector Algorithms[J]. Neural Computation, 2000, 12(5):1207-1245.
  • 4Scholkopf B, Smola A J. Learning with Kernels[M]. MIT Press, 2002.
  • 5Chew H G, Bogner R E, Lim C C. Dual-nu Support Vector Machine with Error Rate and Training Size Biasing[A]. Proceedings of the 26th International Conference on Acoustics, Speech and Signal Processing [C], IEEE, 200 1 :1269-1272.
  • 6Lin Chunfu, Wang Shengde. Fuzzy Support Vector Machines[J]. IEEE Transactions on Neural Networks, 2002, 13(2 ):464-471.
  • 7Wu Xiaoyun, Rohini S. New v-Support Vector Machines and Their Sequential Minimal Optimization[A]. Proceedings of the TwentiethInternational Conference on Machine Learning[C], AAAI Press,2003:824-831.
  • 8Chih-Chang, Chih-Jen Lin. LIBSVM: a Library for Support Vector Machine[CP/OL]. http:∥www, csie.ntu.tw/~cjlin/libsvm,2001.
  • 9Vapnik V N. The Nature of Statistical Learning Theory [ M ].New York, USA: Springer-Verlag, 1999.
  • 10Burges C. A tutorial on support vector machines from pattern recognition [J]. Data Mining and Knowledge Discovery, 1998, 2 (2) : 121 -167.

共引文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部