期刊文献+

基于快速SVM的大规模网络流量分类方法 被引量:5

Large-scale network traffic classification with fast support vector machine method
在线阅读 下载PDF
导出
摘要 支持向量机方法具有良好的分类准确率、稳定性与泛化性,在网络流量分类领域已有初步应用,但在面对大规模网络流量分类问题时却存在计算复杂度高、分类器训练速度慢的缺陷。为此,提出一种基于比特压缩的快速SVM方法,利用比特压缩算法对初始训练样本集进行聚合与压缩,建立具有权重信息的新样本集,在损失尽量少原始样本信息的前提下缩减样本集规模,进一步利用基于权重的SVM算法训练流量分类器。通过大规模样本集流量分类实验对比,快速SVM方法能在损失较少分类准确率的情况下,较大程度地缩减流量分类器的训练时间以及未知样本的预测时间,同时,在无过度压缩前提下,其分类准确率优于同等压缩比例下的随机取样SVM方法。本方法在保留SVM方法较好分类稳定性与泛化性能的同时,有效提升了其应对大规模流量分类问题的能力。 SVM has been applied for network traffic classification preliminarily because of its high classification accuracy, sta- bility and generalization. However, scaling up SVM to large-scale network traffic classification is still an open problem because of the high computation complexity as well as long training and prediction time. This paper proposed a hit-reduction based fast SVM. Firstly, it applied the bit-reduction algorithm to reduce the cardinality of the samples by weighting representative exam- ples, and reduced the scale of training dataset with minimum loss of initial sample information. Then it developed SVM trained on weighted samples. The experiment results of large-scale network traffic classification show that bit-reduction SVM produces a significant reduction in the time required for both classifier training and prediction of unknown samples with minimum loss in accuracy. Meanwhile, its results in more accurate classifiers than random sampling based SVM when the dataset are not overcompressed. This method scales up SYM to large-scale network traffic classification with retaining the stability and generalization performance of SVM.
作者 王涛 程良伦
出处 《计算机应用研究》 CSCD 北大核心 2012年第6期2301-2305,共5页 Application Research of Computers
基金 国家自然科学基金-广东省联合基金重点资助项目(U0935002) 广东省重大科技专项资助项目(2009A080207008) 广州市科技计划资助项目(2010Z1-D00061) 广东省高校优秀青年创新人才培养计划资助项目(LYM11057)
关键词 支持向量机 大规模流量分类 比特压缩 权重SVM 分类器 分类准确率 support vector machine (SVM) large-scale network traffic classification bit reduction weighted SVM classifi- er classification accuracy
  • 相关文献

参考文献15

  • 1MADI-IUKAR A, WILLIAMSON C. A longitudinal study of P2P traffic classification[ C]//Proc of the 14th IEEE Int'l Symposium on Modeling, Analysis, and Simulation. 2006.
  • 2CALLADO A, KAMIENSKI C, SZABO G, et al. A survey on internet traffic identification[ J]. IEEE Communications Surveys & Tutorials,2009,11 ( 3 ) : 37- 52.
  • 3NGUYEN T, ARMITAGE G. A survey of techniques for Intemet traffic using machine learning[ J]. IEEE Communications Surveys & Tutorials, 2008,10 (4) : 56- 76.
  • 4ROUGHAN M, SEN S, SPATSCHECK O, et al. Class-of-service mapping for QoS: a statistical signature-based approach to IP traffic classification[ C ]//Proc of ACM/SIGCOMM Internet Measurement Conference (IMC). 2004.
  • 5MOORE A W, ZUEV D. Internet traffic classification using Bayesian analysis techniques [ C ]//Proc of ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS). 2005.
  • 6AULD T, MOORE A W, GULL S F. Bayesian neural networks for Intemet traffic classification[ J]. IEEE Trans on Neural Networks, 2007,18( 1 ) :223-239.
  • 7王宇,余顺争.网络流量的决策树分类[J].小型微型计算机系统,2009,30(11):2150-2156. 被引量:8
  • 8徐鹏,林森.基于C4.5决策树的流量分类方法[J].软件学报,2009,20(10):2692-2704. 被引量:171
  • 9徐鹏,刘琼,林森.基于支持向量机的Internet流量分类研究[J].计算机研究与发展,2009,46(3):407-414. 被引量:59
  • 10ZANDER S, NGUYEN T, ARMITAGE G. Automated traffic classification and application identification using machine learning [ C ]// Proc of the 30th IEEE Conference on Local Computer Networks. 2005.

二级参考文献31

  • 1Madhukar A, Williamson C. A longitudinal study of P2P traffic classification [C]//Proc of the 14th IEEE Int Syrup on Modeling, Analysis, and Simulation. Washington, DC IEEE Computer Society, 2006:179-188
  • 2Moore A W, Papagiannaki K. Toward the accurate identification of network applications [G]//Dovrolis C. LNCS 3431: Proc of the PAM 2005. Heidelberg: Springer, 2005:41-54
  • 3Karagiannis T, Papagiannaki K, Faloutsos M. BLINC: Multilevel traffic classification in the dark [C]//Proc of ACM SIGCOMM. New York: ACM, 2005.. 229-240
  • 4Roughan M, Sen S, Spatscheck O, et al. Class of service mapping for QoS: A statistical signature-hased approach to IP traffic classification [C]//Proc of ACM SIGCOMM Internet Measurement Conf 2004. New York: ACM, 2004: 135-148
  • 5Zuev D. Moore A W. Traffic classification using a statistical approach [G]//Dovrolis C. LNCS 3431: Proc of the PAM. Heidelberg, Germany: Springer, 2005:321-324
  • 6Moore A W, Zuev D. Internet traffic classification using Bayesian analysis techniques [C] //Proc of the 2005 ACM SIGMETRICS Int Conf on Measurement and Modeling of Computer Systems. New York: ACM, 2005: 50-60
  • 7Tan P N, Steinbach M, Kumar V. Introduction to Data Mining [M]. Boston: Addison Wesley, 2006
  • 8Moore A W, Zuev D, Crogan M. Discriminators for use in flow-based classification, RR-05-13 [R]. London: Queen Mary University of London, 2005
  • 9Witten I H, Frank E. Data Mining: Practical Machine Learning Tools and Techniques [M]. 2nd ed. Amsterdam: Elsevier Inc. , 2005
  • 10Chang C C, Lin C J. LIBSVM: A library for support vector machines[EB/OL]. 2001 [2007-08-06]. http://www.csie. ntu. edu. tw/-ejlin/libsvm

共引文献213

同被引文献50

  • 1杨斌,路游.基于统计学习理论的支持向量机的分类方法[J].计算机技术与发展,2006,16(11):56-58. 被引量:17
  • 2VAPNIK V N.统计学习理论[M].许建华,张学工,译.北京:电子工业出版社,2004.
  • 3Vapnik V N. The Nature of Statistical Learning Theory [M]. Berlin: Springer, 1995.
  • 4B. Scholkopf, A. J. Smola. Leaming with kernels. Cambridge, Massachusetts [M]. London, England: The MIT Press, 2002.
  • 5王敬宇.支持向量机方法及其在企业信用风险评级中的应用[D].长春:吉林大学,2010:4.
  • 6Keerthi S S, Lin C J. Asymptotic behaviors of support vector machine with gaussian kernel [J]. Neural Computation, 2003, 15(7): 1667-1689.
  • 7Hsu C, Lin C J. A comparison of methods for multi-class support vector machines [J]. IEEE Transactions on Neural Networks, 2002, 13(2): 415-425.
  • 8Astofino A, Gorgone E, Gandioso M, et al. Data preprocessing in semi - supervised SVM classification [ J ]. Optimization, 2011,60 ( 1-2 ) : 143-151.
  • 9Hsu C W, Lin C J. Acomparison of methods for multi-class support vector machines [ J ]. IEEE transactions on neural net- works,2002 ( 13 ) : 415-425.
  • 10Chang C C, Lin C J. LIBSVM:A library for support vector ma- chines[ EB/OL]. 2001 [ 2013-03-04 ]. http://www, csie. ntu. edu. tw! - cjlin/papers/libsvm, pdf.

引证文献5

二级引证文献41

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部