期刊文献+

基于Boosting算法的垃圾邮件过滤方法研究 被引量:7

A Filtering Method Against Junk Mail Using Boosting Algorithm
在线阅读 下载PDF
导出
摘要 为解决垃圾邮件过滤的精确度和有效性问题,提出了一种基于邮件内容过滤的垃圾邮件过滤方法,该方法采用Boosting算法构造了一种垃圾邮件过滤器,利用该垃圾邮件过滤器实现对垃圾邮件的过滤.本文借鉴文本分类和信息检索领域所使用的评价指标,构建了垃圾邮件过滤器的评价体系,利用该评价体系,针对基于Boosting算法所构造的垃圾邮件过滤器对垃圾邮件的过滤实验所得到的数据进行了测试和评估,测试和评估的结果验证了Boosting算法在垃圾邮件过滤中的有效性,其性能优于传统的贝叶斯算法. To heighten the accuracy and effectiveness of filtering junk mail, this paper presents a mail filtering method based on filtering mail contents. The junk mail filter was constructed using Boosting algorithm. Referring to the evaluation index adopted in such fields as text classification and information retrieval, the evaluation system of presented junk mail filter was also established. The experiment and performance evaluation were implemented. Results show that, compared with traditional Bias algorithm, the performance of filtering method using Boosting algorithm is obviously superior to the method using traditional Bias algorithm.
出处 《北京理工大学学报》 EI CAS CSCD 北大核心 2013年第1期79-83,共5页 Transactions of Beijing Institute of Technology
基金 黑龙江齐齐哈尔市科学技术计划项目(GYGG2010-06-02)
关键词 BOOSTING算法 垃圾邮件 过滤 分类器 评价 Boosting algorithm junk mail filtering classifier evaluation
  • 相关文献

参考文献10

  • 1Valiant L G. A theory of leamable[J].Communications of the ACM,1984.1134-1142.
  • 2Schapire R E. The strength of weak leamability[J].Machine Learning,1990.197-227.
  • 3Freund Y. Boosting a weak algorithm by majority[J].Information and Computation,1995,(02):256-285.doi:10.1006/inco.1995.1136.
  • 4Freund Y,Schapire R E. A decision-theoretic generaiization of online learning and an application to Boosting[J].Journal of Computer and Systems Sciences,1997,(01):119-139.
  • 5Duffy N,Helmbold D. Boosting methods for regression[J].Machine Learning,2002,(2-3):153-200.
  • 6杨峰,曹麒麟,段海新,李星.基于DNS Blocklist的反垃圾邮件系统的设计与实现[J].计算机工程与应用,2003,39(7):11-12. 被引量:11
  • 7潘文峰.基于内容的垃圾邮件过滤研究[D]北京:中国科学院计算技术研究所,2004.
  • 8Freund Yoav,Schapire R E. A short introduction to boosting[J].Journal of Japanese Society for Artificial Intelligence,1999,(05):771-780.
  • 9曾春,邢春晓,周立柱.基于内容过滤的个性化搜索算法[J].软件学报,2003,14(5):999-1004. 被引量:118
  • 10Andnoutsopoulos I,Koutisias J,Chnadrinos K V. An evaluation of Narve Bayesine anti-spam filtering[A].Bracelone,Spain,2000.9-17.

二级参考文献6

  • 1[1]Jon Postel. On the Junk Mail Problem[S].RFC706,1975-11
  • 2[2]James Niccolai.China seen as a growing source of spam.http://www.infoworld.com/articles/en/xml/00/04/04/000404enspam.xml, 2000-04
  • 3[3]Joseph Neubauer. Is Your Exchange Server Relay-Secure?http://www.exchangeadmin.com/Articles/Index.cfm?ArticleID=7696,1999-11-29
  • 4[4]Travis Berkerly.E-mail please,hold the spam[J].Network World,2000-02;17(7) :71~75
  • 5[5]G Linderberg. Anti-Spam Recommendations for SMTP MTAs[S].RFC2505,1999-02
  • 6曾春,邢春晓,周立柱.个性化服务技术综述[J].软件学报,2002,13(10):1952-1961. 被引量:396

共引文献127

同被引文献54

  • 1王斌,潘文锋.基于内容的垃圾邮件过滤技术综述[J].中文信息学报,2005,19(5):1-10. 被引量:129
  • 2RICHARD O D.贝叶斯决策论模式分类[M].北京:机械工业出版社,2003:80-95.
  • 3董振东.[EB/OL].知网http://www.keenage.com,1999.
  • 4Vapnik V N. The nature of statistical learning theory[M]. New York: Spring-Verlag Press, 1995.
  • 5Dumais S, PlattJ, Heckerman D, et al. Inductive learning algorithms and representations for text categorization[CJ II Proceedings of the 7th International Conference on Information and Knowledge Manage?ment. Bethesda, USA:[so n.J, 1998.
  • 6Osuna E, Freund R, Girosi F. Training support vector machines: an application to face detection[CJ II Proceedings of IEEE Conf. Computer Vision and Pattern Recognition.[So 1.J: IEEE, 1997: 130 - 136.
  • 7Gretton A. Support vector regression for black-box system identificationj C] II Proceedings of the 11 th IEEE Signal Processing Workshop on Statistical Signal Processing. Singapore:[so n.J, 2001: 341 - 344.
  • 8Tay F E H,Cao LJ. Modified support vector machines in financial time series forecasting[]]' Neurocomputing, 2002,48 :847 - 86l.
  • 9Kim H,Park H. Prediction of protein relative solvent ac?cessibility with support vector machines and long-range interaction 3D local descriptor[J]. Proteins: Structure. Function. And Bioinformatics , 2004.54(3) :557 - 562.
  • 10Brown M, Grundy W, Lin D, et al. Support vector machine classification of microarray gene expression data, UCSC-CRL-99-09[R]. Santa Cruz: University of California, 1999.

引证文献7

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部