期刊文献+

一种安全的基于分歧的半监督分类算法 被引量:2

A Safe Semi-supervised Classification Algorithm Based on Disagreement
在线阅读 下载PDF
导出
摘要 为提高半监督分类的性能,提出一种安全的基于分歧的半监督分类算法Safe Co-SSC。通过有标记样本训练3个有监督分类器,利用无标记样本的信息增加分类器的差异性,采取3个分类器加权投票的策略实现对无标记样本的伪标记;对伪标记样本进行二次验证,选用能使分类器误差减小的新增标记样本扩充标记样本集。保证新样本的添加既减小了分类器的分类误差,又提高了分类器的分歧性。对UCI数据集进行分类实验的结果表明,该算法具有较高的分类率和样本标记率。 In order to improve the performance of semi-supervised classifier , a safe disagreement-based semi-supervised classifica-tion algorithm named Safe Co-SSC was proposed .The limited labeled samples were divided into three equal training sets and used to train three classifiers by a supervised learning algorithm .A large number of unlabeled samples were used to increase the differences be-tween the classifiers and the weighted voting strategy was used to achieve pseudo -labeled for unlabeled samples .Passing through sec-ondary verification, the ones making classifier error minimum were added into the labeled samples set .Finally, the experiment was car-ried out on the UCI data set , the results showed that the proposed algorithm had higher classification rate and sample labeling rate .
作者 赵建华
出处 《西华大学学报(自然科学版)》 CAS 2014年第5期1-6,共6页 Journal of Xihua University:Natural Science Edition
基金 陕西省教育厅科研计划项目(12JK0748) 商洛学院科研基金(10sky1001)
关键词 半监督学习 分类 安全性 分歧 semi-supervised learning classification safety disagreement
  • 相关文献

参考文献18

  • 1ZHU X J. Semi-supervised Learning Literature Survey[ R/OL]. University of Wisconsin, Madison Department of Computer Sciences ,2012 -03 - 15 [ 2014 - 03 - 15 ] http://diqital, library, wisc. edu/ 1793/60444.
  • 2李昆仑,曹铮,曹丽苹,张超,刘明.半监督聚类的若干新进展[J].模式识别与人工智能,2009,22(5):735-742. 被引量:51
  • 3Zhou Z H, Li M. Semi-supervised Learning by Disagreement [ J ]. Knowledge and Information Systems, 2010, 24 (3) : 415 - 439.
  • 4周志华.基于分歧的半监督学习[J].自动化学报,2013,39(11):1871-1878. 被引量:89
  • 5Blum A, Mitchell T. Combining Labeled and Unlabeled Data with Co-training. [ C ]//Proceedings of the 11 th Annual Conference on Computational Learning Theory ( COLT' 98 ). Wisconsin, USA: ACM, 1998:92 - 100.
  • 6Zhou Z H, Li M. Tri-training : Exploiting Unlabeled Data using Three Classifiers [ J ]. IEEE Transactions on Knowledge and Data Engi- neering,2005,17 ( 11 ) : 1529 - 1541.
  • 7Wang W, Zhou Z H. Co-Training with Insufficient Views [ C ]// Asian Conference on Machine Learning. Dallas, TX, USA : 1EEE, 2013 : 467 - 482.
  • 8Zhou Z H. Unlabeled Data and Multiple Views [ J ]. Partially Supervised Learning Lecture Notes in Computer Science, 2012,7081 : 1 - 7.
  • 9Darnsta dt M, Simon H U, Szo renyi B. Supervised Learning and Co-training[ J ]. Theoretical Computer Science, 2014, 519:68 -87.
  • 10Cheng J, Wang K. Active Learning for Image Retrieval with Co-SVM[ J]. Pattern Recognition, 2007, 40( 1 ) : 330 - 334.

二级参考文献163

  • 1杨燕,靳蕃,Mohamed Kamel.一种基于蚁群算法的聚类组合方法[J].铁道学报,2004,26(4):64-69. 被引量:39
  • 2王卫平,杨杰.基于蚁群智能的客户群偏好分析方法[J].管理科学,2005,18(4):54-57. 被引量:2
  • 3Olivier C, Bernhard S, Alexander Z. Semi-Supervised Learning. Cambridge, USA : MIT Press, 2006 : 3 - 10.
  • 4Blum A, Mitchell T. Combining Labeled and Unlabeled Data with Co-Training//Proe of the 11th Annual Conference on Computational Learning Theory. Madison, USA, 1998 : 92 - 100.
  • 5Zhong Shi. Semi-Supervised Model-Based Document Clustering: A Comparative Study. Machine Learning, 2006, 65 ( 1 ) : 3 - 29.
  • 6Wagstaff K, Cardie C, Rogers S, et al. Constrained K-means Clustering with Background Knowledge // Proc of 18th International Conference on Machine Learning. San Francisco, USA, 2001:577 -584.
  • 7Wagstaff K, Cardie C. Clustering with Instance-Level Constraints// Proc of the 17th International Conference on Machine Learning. SanFrancisco, USA, 2000:1103 - 1110.
  • 8Huang Desheng, Pan Wei. Incorporating Biological Knowledge into Distance-Based Clustering Analysis of Micro Array Gene Expression Data. Bioinformatics, 2006, 22 (10) : 1259 - 1268.
  • 9Tari L, Baral C, Kim S. Fuzzy C-Means Clustering with Prior Biological Knowledge. Journal of Biomedical Informatics, 2009, 42 (1): 74-81.
  • 10Ceccarelli M, Maratea A. Improving Fuzzy Clustering of Biological Data by Metric Learning with Side Information. International Journal of Approximate Reasoning, 2008, 47 ( 1 ) : 45 - 57.

共引文献155

同被引文献30

  • 1ZHU X J.Semi-supervised Learning Literature Survey[R].Madison:University of Wisconsin,2008.
  • 2孙雁飞,张顺颐,亓晋,等.一种基于半监督学习的GASOM聚类方法:中国,201010576193[P].2011-04-20.
  • 3Shen Furao,Yu Hui,Sakurai Keisuke,et al.An Incremental Online Semi-supervised Active Learning Algorithm Based on Self-organizing Incremental Neural Network[J].Neural Computing&Applications,2011,20(7):1061-1074.
  • 4Astudillo Cesar A,John Oommen B.On Achieving Semi-supervised Pattern Recognition by Utilizing Tree-based SOMs[J].Pattern Recognition,2013,46:293-304.
  • 5HAGAN M T,DEMUTH H B,BEALE M H.Neural Network Design[M].Beijing:China Machine Press,2002:64-85.
  • 6ZHOU Z H,LI M.Tri-Training:Exploiting Unlabeled Data using Three Classifiers[J].IEEE Transactions on Knowledge and Data Engineering,2005,17(11):1529-1542.
  • 7李昆仑,曹铮,曹丽苹,张超,刘明.半监督聚类的若干新进展[J].模式识别与人工智能,2009,22(5):735-742. 被引量:51
  • 8赵建华,李伟华.有监督SOM神经网络在入侵检测中的应用[J].计算机工程,2012,38(12):110-111. 被引量:15
  • 9唐明珠,阳春华,桂卫华.基于改进的QBC和CS-SVM的故障检测[J].控制与决策,2012,27(10):1489-1493. 被引量:17
  • 10文志强,胡永祥,朱文球.流形上的k最近邻分类方法[J].计算机应用,2012,32(12):3311-3314. 被引量:3

引证文献2

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部