期刊文献+

不平衡入侵检测数据的代价敏感分类策略 被引量:6

Cost-sensitive classification strategy for imbalanced datasets of intrusion detection
在线阅读 下载PDF
导出
摘要 提出一种新的预处理算法AdaP,不仅有效避免了数据过度拟合,且可独立使用。针对不平衡的入侵检测数据集,引入代价敏感机制,基于权值矩阵最小化误分类代价的思想,去除部分训练密集区域、拓展稀疏区域的同时再过滤噪声,最终实现了AdaP算法与AdaCost算法相结合的策略。实验证明此策略充分体现了提升算法有效提升前端弱分类算法分类精度和预处理算法平衡稀有类数据的优势,且可有效提高不平衡入侵检测数据的分类性能。 This paper proposed a new data preprocessing algorithm called AdaP for avoiding over-fitting effectively while processed independently. In view of the imbalanced datasets, introduced the cost-sensitive mechanism into the intrusion detection system by the ideas: weighting matrix to minimize the miselassification costs, removing some datum in dense region expanding in rare region, as well as filtering noises. At last combined the date preprocessing algorithm AdaP with the boosting algorithm AdaCost successfully. The experiment fully reflects the advantages of boosting the classification precision with weak leaner to balance rare classes and shows the strategy which can improve classification performance of the intrusion detention in terms of the imbalanced datasets immensely.
作者 边婧 彭新光
出处 《计算机应用研究》 CSCD 北大核心 2009年第8期3036-3038,3043,共4页 Application Research of Computers
基金 山西省自然科学基金资助项目(2009011022-2)
关键词 不平衡数据 数据预处理 代价敏感 入侵检测 imbalanced data data preprocessing cost-sensitive intrusion detection
  • 相关文献

参考文献2

二级参考文献34

  • 1Turney P D.Types of cost in inductive concept learning//Proceedings of the Workshop on Cost-Sensitive Learning at the Seventeenth International Conference on Machine Learning.Stanford University,California,2000:15-21
  • 2Domingos P.MetaCost:A general method for making classifiers cost-sensitive//Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining.San Diego,CA,USA,1999:155-164
  • 3Elkan C.The foundations of cost-sensitive learning//Proceedings of the 17th International Joint Conference of Artificial Intelligence.Seattle,WA,USA,2001:973-978
  • 4Zadrozny B,Elkan C.Learning and making decisions when costs and probabilities are both unknown//Proceedings of the 7th International Conference on Knowledge Discovery and Data Mining.San Francisco,CA,USA,2001:204-213
  • 5Zadrozny B,Langford J,Abe N.Cost-sensitive learning by cost-proportionate example weighting//Proceedings of the 3th International Conference on Data Mining.2003
  • 6Ting K M.Inducing cost-sensitive trees via instance weighting//Proceedings of the 2nd European Symposium on Principles of Data Mining and Knowledge Discovery.Lecture Notes in Computer Science 1510.London,UK:Springer-Verlag,1998:139-147
  • 7Drummond C,Holte R.Exploiting the cost (in)sensitivity of decision tree splitting criteria//Proceedings of the 17th International Conference on Machine Learning.2000:239-246
  • 8Drummond C,Holte R C.C4.5,Class imbalance,and cost sensitivity:Why under-sampling beats over-sampling//Proceedings of the Workshop on Learning from Imbalanced Datasets Ⅱ,Washington,DC,USA,2003
  • 9Turney P D.Cost-sensitive classification:Empirical evaluation of a hybrid genetic decision tree induction algorithm.Journal of Artificial Intelligence Research,1995,2:369-409
  • 10Ling C X,Yang Q,Wang J,Zhang S.Decision trees with minimal costs//Proceedings of the 2004 International Conference on Machine Learning (ICML'2004).2004

共引文献49

同被引文献36

  • 1郑恩辉,李平,宋执环.代价敏感支持向量机[J].控制与决策,2006,21(4):473-476. 被引量:35
  • 2Paolo S. A Multi-objective Optimization Approach for Class Imbalance Learning[J]. Pattern Recognition, 2011, 44(8): 1801- 1810.
  • 3Tan Songbo. Neighbor-weighted K-nearest Neighbor for Unbalanced Text Corpus[J]. Expert Systems with Applications, 2005, 28(4): 667-671.
  • 4Jason V H, Taghi K. Knowledge Discovery from Imbalanced and Noisy Data[J]. Knowledge and Data Engineering, 2009, 68(12): 1513-1542.
  • 5Holland J H. Adaptation in Nature and Artificial Systems[M]. Ann Arbor, USA: The University of Michigan Press, 1975.
  • 6Joo Daejoon,Hong Taeho,Han Ingoo.The neural networkmodels for IDS based on the asymmetric costs of falsenegative errors and false positive errors[J].Expert Systemswith Applications,2009:69-75.
  • 7López V,del Río S,Benítez J M,et al.Cost-sensitivelinguistic fuzzy rule based classification systems underthe MapReduce framework for imbalanced big data[J].Fuzzy Sets and Systems,2015,258:5-38.
  • 8Aslantas V,Dogru M.A new SVD based fragile imagewatermarking by using genetic algorithm[C].Sixth InternationalConference on Graphic and Image Processing(ICGIP 2014),2015.
  • 9Wu Tianfu,Zhu Songchun.Learning near-optimal costsensitivedecision policy for object detection[J].PatternAnalysis and Machine,2015,37(5):1013-1027.
  • 10姚全珠,田元,王季,杨增辉,张楠.基于最小二乘支持向量机的非平衡分布数据分类[J].计算机工程与应用,2008,44(5):166-169. 被引量:5

引证文献6

二级引证文献45

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部