期刊文献+

基于半监督聚类的Web流量分类 被引量:3

Web Traffic Classification Based on Semi-supervised Clustering
在线阅读 下载PDF
导出
摘要 提出了一种基于半监督学习的方法对Web流量进行聚类分析,使用隐马尔可夫模型对用户流量进行描述和聚类分析。该方法通过对少量数据进行人工标识,利用已标识数据对无监督聚类结果进行调整,以得到与人工分类匹配的聚类结果。使用真实的Web流量对提出的方法进行验证,实验结果表明该方法能有效地对Web流量进行分类,并得到相应的描述模型。 This paper presented a Web traffic classification method based on semi-supervised clustering, which uses HMM (Hidden Markov Model) to model and analyze Web client traffic. The method first runs an unsupervised clustering process on the whole data set,and then uses the pre-labeled data to adjust the result clusters. The paper also presented the experiment result on real network data to validate the purposed method.
出处 《计算机科学》 CSCD 北大核心 2009年第2期90-94,共5页 Computer Science
基金 国家高技术研究发展计划(863)资助项目(批准号:2007AA01Z449) 国家自然科学基金-广东联合基金重点项目(U0735002) 国家自然科学基金项目(90304011)资助
关键词 半监督聚类 隐马尔可夫模型 WEB流量 Semi-supervised clustering, Hidden markov model, Web traffic
  • 相关文献

参考文献10

  • 1Mah B, An empirical model of http network traffic //Proceedings of the INFOCOM' 97. Vol 2. IEEE Computer Society Press, 1997:592-600
  • 2Rabiner L. A tutorial on hidden Markov models and selected applications in speech recognition//Proc, of IEEE. 1989,77 (2): 257-286
  • 3Lu wz, Yu sz. Clustering Web Traffic of Request Bursts//Proceedings of the IEEE TENCON. 2006 : 1-4
  • 4李和平,胡占义,吴毅红,吴福朝.基于半监督学习的行为建模与异常检测[J].软件学报,2007,18(3):527-537. 被引量:30
  • 5Basu S, Banerjee A, Mooney R. Semi-supervised clustering by seeding//Proc. 19th Int. Conf. Machine Learning. 2002 : 19-26
  • 6Wagstaff K , et al. Constrained k- means clustering with back - ground knowledge//Proc. 18th Int. Conf. Machine Learning. 2001:577-584
  • 7Cohn D, Caruana R, McCallum A. Semi-supervised clustering with user feedback. Technical Report TR2003-1892. Cornell University, 2003
  • 8Shi Z. Semi-supervised model- based document clustering: A comparative study. Machine Learning, 2006, 65(1) 3-29
  • 9孙广玲,唐降龙.基于分层高斯混合模型的半监督学习算法[J].计算机研究与发展,2004,41(1):156-161. 被引量:24
  • 10Frigui H, Mahdi R. Semi-Supervised Clustering and Feature Discrimination with Instance-Level Constraints//Proceedings of IEEE International Fuzzy Systems Conference 2007. 2007 : 1-6

二级参考文献9

  • 1[1]M Weber, M Welling, P Perona. Towards automatic discovery of object categories. IEEE Conf on Computer Vision and Pattern Recognition, Hilton Head Island, 2000
  • 2[2]K P Bennett, A Demiriz. Semi-supervised support vector machines. In: D C M Kearns, S Solla, eds. Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 1998. 368~374
  • 3[3]K Nigam, A McCallum, S Thrun .et al.. Text classification from labeled and unlabeled documents using EM. Machine Learning, 1999, 39: 103~134
  • 4[4]W Ying, H Thomas S. Self-supervised learning for object recognition based on kernel discriminant-EM algorithm. The IEEE Int'l Conf on Computer Vision, Vancouver, 2001
  • 5[5]B Sugato. Semi-supervised clustering by seeding. The 19th Int'l Conf on Machine Learning, Sydney, 2002
  • 6[6]Kristin P Bennett, Ayhan Derniriz .et al.. Exploiting unlabeled data in ensemble methods. The 8th ACM SIGKDD Int'l Conf on Knowledge Discovery and Data Mining, Edmonton, 2002
  • 7[7]A P Dempster, N M Laired, D B Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, 1977, B(39): 1~38
  • 8[8]Nei Kato. A handwritten character recognition system using directional element feature and asymmetric Mahalanobis distance. IEEE Trans on Pattern Analysis and Machine Intelligence, 1999, 21(3): 258~262
  • 9胡卫明,谢丹,谭铁牛,沈俊.轨迹分布模式学习的层次自组织神经网络方法[J].计算机学报,2003,26(4):417-426. 被引量:15

共引文献52

同被引文献23

  • 1俞研,黄皓.一种半聚类的异常入侵检测算法[J].计算机应用,2006,26(7):1640-1642. 被引量:17
  • 2蔡嘉荣 印鉴 刘玉葆.一种有效的文本聚类算法.计算机科学,2006,.
  • 3Rasmussen E.Clustering algorithms.In William B.Frakes and Ricardo Baeza-Yates[C]//Information Retrieval:Data Structures and Algorithms.Englewood Cliffs,New Jersey:Prentice Hall,1992,419-422.
  • 4Wagstaff K.Constrained k-means clustering with back-ground knowledge[C]∥Proceedings of the 18th International Conference on Machine Learning.San Francisco,CA,USA:Morgan Kaufmann Publishers Inc,2001,577-584.
  • 5Cohn D,Caruana R,McCallum A.Semi-supervised clustering with user feedback[J].Technical Report TR2003-1892.Cornell U-niversity,2003.
  • 6Shi Z.Semi-supervised model-based document clustering:A comparative study[J].Machine Learning,2006,65(1):3-29.
  • 7Basu S,Banerjee A,Mooney R.Semi-supervised clustering by seeding[C].Proceedings of the 19th International Conference on Machine Learning.Machine Learning.San Fransisco:Morgan Kaufmann Publishers,2002,19-26.
  • 8CHAPELLE O,SCHOLKOPF B,ZIEN. A semi-supervised learning[M]. Cambridge :MIT Press, 2006..12-27.
  • 9MILLER D J,UYAR H S. A mixture of experts classifier with learning based on both labelled and unlabelled data [C]//MOZER M,JORDAN M I,PETSCHE T,et al. Advances in Neural Information Processing Systems 9. Cam- bridge :MIT Press, 1997 : 571-577.
  • 10PEDRYCZ W,WALETZKY J. Fuzzy clustering with partial supervision[J]. IEEE Transaction on Systems ,Man,and Cybernetics ~ Part B,1997,27(5) :787-795.

引证文献3

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部