期刊文献+

基于词条时序的朴素贝叶斯垃圾邮件过滤方法 被引量:2

Method of Spam Filtering Based on Naive Bayesian Algorithm with Token Time-Series
在线阅读 下载PDF
导出
摘要 朴素贝叶斯分类算法是一种有效的垃圾邮件过滤技术.互联网上的信息随着时间推移产生概念的变迁,最近出现的垃圾邮件词条可作为判定垃圾邮件的重要依据.将新近的垃圾邮件词条单独记录,在进行邮件分类时,对于最近出现的垃圾词条,提高其对垃圾邮件判定的先验概率.通过实验对比,提出的垃圾邮件过滤方法较传统的朴素贝叶斯垃圾邮件过滤具有更高的准确性、精确性和召回率. Naive Bayesian classification algorithm is an effective spam filtering method. Information on the Internet changed over time and generated new concepts. The recent spam tokens were an important basis for spam filtering. In this approach,new spam tokens were recorded separately according to their time series and those tokens got higher prior probability in the email classification. Experimental results show that the proposed algorithm has higher accuracy,precision and recall than the traditional Nave Bayesian spam filtering method.
作者 常青
机构地区 辽宁医学院
出处 《微电子学与计算机》 CSCD 北大核心 2010年第5期212-216,共5页 Microelectronics & Computer
关键词 朴素贝叶斯 垃圾邮件过滤 概念漂移 时间序列 Nave Bayesian spam filtering concept drift time series
  • 相关文献

参考文献7

  • 1Androutsopoulos I, Koutsias J, Cbandrinos G, et al. An evaluation of naive bayesian anti- spam filtering[C]//Proceedings of the Workshop on Machine Learning in the New Information Age,11th European Conference on Machine Learning. Barcelona, Spain, 2000: 9 - 17.
  • 2Androutsopoulos I, Koutsias J, Chandrinos K, et al. An experimental comparison of naive bayesian and keyword - based anti- spare filtering with encrypted personal email messages[C]//Procs of 23rd International ACM SIGIR Conference on Research and Development in Information Retrieval. Greece, Athens, 2000:160 - 167.
  • 3Sakkis G, Androutsopoulos I, Paliouras G, et al. A memory- based approach to anti - spare filtering for mailing lists [J ]. Information Retrieval, 2003,6( 1 ) : 49 - 73.
  • 4李翔鹰,叶枫.一种基于多贝叶斯算法的垃圾邮件过滤方法[J].计算机工程与应用,2006,42(31):114-116. 被引量:7
  • 5廖明涛,张德运,李金库.基于朴素贝叶斯和层次聚类的两阶段垃圾邮件过滤方法[J].微电子学与计算机,2007,24(8):1-3. 被引量:5
  • 6Sarah J D, Padraig C, Alexey T, et al. A case - based technique for tracking concept drift in spare filtering[C]// Applications and Innovations in Intelligent Systems Ⅻ. Procs. of AI, England:Cambridge, 2004:3- 16.
  • 7Androutsopoulos I, Koutsial J, Cbandrinos K V, et al. An experimental comparison of naive bayesian and keyword- based anti - spam filtering with encrypted personal e-mail messages[C]//Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2000 ). Athens, Greece,2000:160 - 167.

二级参考文献13

  • 1欧阳,韩逢庆.基于多Bayes网的垃圾邮件智能过滤研究[J].计算机科学,2004,31(8):61-63. 被引量:2
  • 2李惠娟,高峰,管晓宏,黄亮.基于贝叶斯神经网络的垃圾邮件过滤方法[J].微电子学与计算机,2005,22(4):107-111. 被引量:21
  • 3SAHAMI M,DUMAIS S,HECKERMAN D,et al.A Bayesian approach to filtering Junk e-mail[C]//AAAI Workshop on Learning for Text Categorization.Madison,Wisconsin:[s.n.],1998:55-62.
  • 4GRAHAM P.A plan for spam[EB/OL].URL:http://paulgraham.com/spam.html.
  • 5GRAHAM P.Better bayesian filtering[EB/OL].URL:http://paulgraham.com/better.html.
  • 6ROBINSON G.A statistical approach to the spam problem[J/OL].Linux Journal,2003 (107).URL:http:/ /www.linuxjournal.com/ article.php?sid=6467.
  • 7SEGAL R,CRAWFORD J,KEPHART J,et al.SpamGuru:an enterprise anti-spam filtering system[C/OL]//Proceedings of First Conference on Email and Anti-Span (CEAS).Mountain View,CA:[s.n.],2004.URL:http://www.ceas.cc/papers-2004/126.pdf.
  • 8ANDROUTSOPOULOS I,KOUTSIAS J,CBANDRINOS K V,et al.An experimental comparison of naive bayesian and keyword-based anti-spam filtering with encrypted personal e-mail messages[C]//Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR 2000).Athens,Greece:[s.n.],2000:160-167.
  • 9MCCALLUM A,NIGAM K.A comparison of event models for naive bayes text classification[C]//Proceedings of AAAI-98 Workshop on Learning for Text Categorization.Menlo Park,CA:AAAI Press,1998:41-48.
  • 10Sahami M,Dumais S,Heckerman D,et al.A Bayesian approach to filtering junk email[A].Proc.of AAAI Workshop on Learning for Text Categorization[C].1998

共引文献10

同被引文献21

引证文献2

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部