期刊文献+

基于样本差异度的SVM训练样本缩减算法 被引量:6

Algorithm for reduction SVM training sample based on sample dissimilarity
在线阅读 下载PDF
导出
摘要 为了对大规模训练样本进行缩减,提出了k近邻向量,给出了一种新的样本差异度的计量方法,证明了该差异度关于噪声识别和类边界距离的几个性质。依据此性质提出了一个高效的SVM训练样本缩减算法,算法首先根据样本差异度的性质剔除噪声样本,然后用类间差异度近似表示类边界距离,结合样本相似性,直接从原始样本空间剔除次要的训练样本。仿真结果表明,减样算法可以有效缩减样本,提高训练效率。 To reduce large-scale training sample set, the concept of k-nearest vectors is proposed, and a new account method for dissimilarity is given accordingly. Then, the paper proposes and proves the methods of noise identification and boundaries distance description. Based on these methods, an efficient sample reduction algorithm is proposed. The algorithm removes noise samples according to the dissimilarity at first step, then according to the similarity of samples, and the dissimilarity which describes the distance between sample and classification boundary, the algorithm removes minor training samples from the original sample space directly. Experiments indicate that the reduction algorithm can effectively reduce the sample, and improve the training efficiency.
出处 《计算机工程与应用》 CSCD 2012年第7期20-22,共3页 Computer Engineering and Applications
基金 国家自然科学基金(No.61005010) 安徽省高校省级自然基金(No.KJ2012B149) 合肥学院人才科研基金(No.11RC06)
关键词 大规模样本集 减样 去噪 支持向量机 样本差异度 large-scale sample set samples reduction de-noising support vector machine sample dissimilarity
  • 相关文献

参考文献9

  • 1Lina H J, Yeh J EOptimal reduction of solutions for support vector machines[J].Applied Mathematics and Computation,2009, 214(2) :329-335.
  • 2李红莲,王春花,袁保宗,朱占辉.针对大规模训练集的支持向量机的学习策略[J].计算机学报,2004,27(5):715-719. 被引量:53
  • 3Chan Guangxi, Xu Jian, Xiang Xiaolin.Neighborhood preprocessing SVM for large-scale data sets classification[C]//Fifth International Conference on Fuzzy Systems and Knowledge Discovery, Shandong, 2008,2: 245-249.
  • 4Wang Jigang, Neskovic P, Cooper L N.Training data selection for support vector machines[C]//lst International Conference on Advances in Natural Computation,ICNC,2005:554-564.
  • 5罗瑜,易文德,王丹琛,何大可.大规模数据集下支持向量机训练样本的缩减策略[J].计算机科学,2007,34(10):211-213. 被引量:13
  • 6刘万里,刘三阳,薛贞霞.基于距离核函数的除噪和减样方法[J].系统工程理论与实践,2008,28(7):160-164. 被引量:4
  • 7Li Yuangui,Hu Zhonghui, Cai Yunze, et al.Support vector based prototype selection method for nearest neighbor rules[C]//Lectute Notes in Computer Science 3610: ICNC.Berlin: Springer, 2005: 528-535.
  • 8Zhang Ling, Zhang Bo.Relational between support vector set and kernel functions in SVM[J].Joumal of Computer Science & Technology,2002,17(5) :549-555.
  • 9李永丽,任辉明,董立岩,李威,陈思国,赵宇.基于数据模式聚类算法的离群点检测[J].吉林大学学报(理学版),2007,45(3):435-437. 被引量:3

二级参考文献31

  • 1李红莲,王春花,袁保宗,朱占辉.针对大规模训练集的支持向量机的学习策略[J].计算机学报,2004,27(5):715-719. 被引量:53
  • 2Hearst M.A., Dumais S.T., Osman E., Platt J., Scholkopf B.. Support vector machines. IEEE Intelligent Systems, 1998, 13(4): 18~28
  • 3Vapnik V.N.. An overview of statistical learning theory. IEEE Transactions on Neural Networks, 1999, 10(5): 988~999
  • 4Vapnik V.N.. Statistical Learning Theory.2nd ed..New York: Springer-Verlag, 1999
  • 5Müller Klaus-Robert, Mika Sebastian, Rtsch Gunnar, Tsuda Koji, Schlkopf Bernhard. An introduction to kernel-based learning algorithms. IEEE Transactions on Neural Networks, 2001, 12(2): 181~201
  • 6Burges C.J.C.. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 1998, 2(2): 121~167
  • 7Ke Hai-Xin,Zhang Xue -Gong.Editing support vector machines. In: Proceedings of the International Joint Conference on Neural Networks, Washington, DC, 2001, 2: 1464~1467
  • 8Charu C A,Philip S Y.Outlier Detection for High Dimensional Data[C]//Proceedings of the ACM Sigmod International Conference on Management of Data.Santa Barbara,CA:ACM Press,2001:37-47.
  • 9Edwin M K,Raymond T N,Vladimir Tucakov.Distance-based Outliers:Algorithms and Applications[J].VLDB Journal,2000,324(8):237-253.
  • 10VapnikVN.统计学习理论的本质[M].清华大学出版社,2000

共引文献66

同被引文献58

  • 1雷绍兰,孙才新,周湶,张晓星.电力短期负荷的多变量时间序列线性回归预测方法研究[J].中国电机工程学报,2006,26(2):25-29. 被引量:96
  • 2葛海峰,林继鹏,刘君华,丁晖.基于支持向量机和小波分解的气体识别研究[J].仪器仪表学报,2006,27(6):573-578. 被引量:13
  • 3高建良,徐勇军,李晓维.基于加权中值的分布式传感器网络故障检测(英文)[J].软件学报,2007,18(5):1208-1217. 被引量:39
  • 4VAPNIK V N.统计学习理论[M].许建华,张学工,译.北京:电子工业出版社,2004.
  • 5Shi B, Li Y X, Yu X H, et al.A modified particle swarm optimization and radial basis function neural network hybrid algorithm model and its application[C]~~2009 WRI Global Congress on Intelligent Systems, 2009,1 : 134-138.
  • 6Wang H,Li B S,Han X Y,et al.Study of neural networks for electric power load forecasting[C]//The 3rd International Symposium on Neural Networks,2010: 1277-1283.
  • 7Chuang Li-Yeh, Tsai Sheng-Wei, Yang Cheng-hong.Chaotic catfish particle swarm optimization for solving global numeri- cal optimization problems[J].Applied Mathematics and Com- putation, 2011,217 : 6900-6916.
  • 8Shi B, Li Y X, Yu X H, et al.A modified particle swarm optimization and radial basis function neural network hybrid algorithm model and its application[C]~~2009 WRI Global Congress on Intelligent Systems, 2009,1 : 134-138.
  • 9Wang H,Li B S,Han X Y,et al.Study of neural networks for electric power load forecasting[C]//The 3rd International Symposium on Neural Networks,2010: 1277-1283.
  • 10Chuang Li-Yeh, Tsai Sheng-Wei, Yang Cheng-hong.Chaotic catfish particle swarm optimization for solving global numeri- cal optimization problems[J].Applied Mathematics and Com- putation, 2011,217 : 6900-6916.

引证文献6

二级引证文献75

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部