期刊文献+

基于内容过滤的反垃圾邮件系统模型研究 被引量:2

THE STUDY ON CONTENT-BASED ANTI-SPAM FILTERING SYSTEM MODEL
原文传递
导出
摘要 在反垃圾邮件系统中,基于内容的过滤技术得到了广泛应用,将贝叶斯统计方法应用于邮件文本过滤取得了较好的性能。本文利用贝叶斯过滤算法构建了反垃圾邮件系统模型,应用系统性能评价指标对贝叶斯过滤算法的多变量贝努利事件模型和基于词频的多项式事件模型进行验证及评价,实验结果表明,若保证有充足的训练样本条件下,并根据Spam的变异调整训练样本集,贝叶斯过滤算法会获得较高的Spam过滤性能,反垃圾邮件效果显著。 In the anti -spare systems, content- based filtering technology has been widely used and. Bayesian statistical method is used in the message text filter to obtain a better performance. In this paper, Bayesian filtering algorithm is applied to construct the anti - spam system model. The application performance evaluation of Bayesian filtering algorithm multivariate Bernoulli event model and e- vent model based on word frequency polynomial verification and evaluation of experimental results indicated that if suffleient training sample conditions, the training sample set and in accordance with the variation adjustment of spam, Bayesian filtering algorithm will be high spare filtering performance and anti - spare results are obvious.
出处 《内蒙古农业大学学报(自然科学版)》 CAS 北大核心 2013年第3期163-167,共5页 Journal of Inner Mongolia Agricultural University(Natural Science Edition)
基金 内蒙古工业大学基金项目(X200801)
关键词 垃圾邮件 贝叶斯过滤 训练样本集 贝努利事件模型 多项式事件模型 Spam bayesian filtering training sample set bernoulli event model polynomial event model
  • 相关文献

参考文献8

二级参考文献50

  • 1李渝勤,孙丽华.基于规则的自动分类在文本分类中的应用[J].中文信息学报,2004,18(4):9-14. 被引量:20
  • 2刘建毅,张鹏飞,王枞,郭燕慧,李赟.高性能电子邮件过滤系统的设计与实现[J].计算机应用研究,2005,22(4):224-225. 被引量:9
  • 3吕锋,张炜玮.4种序列模式挖掘算法的特性研究[J].武汉理工大学学报,2006,28(2):57-60. 被引量:14
  • 4Androutsopoulos I, Koutsias J, Chandrinos K V, et al.An evaluation of naive bayesian anti-spam fihering[C]//Proceedings of the Workshop on Machine Learning in the New Information Age, 11th European Conference on Machine Learning.Barcelona,Spain,2000:9-17.
  • 5Zorkadis V,Karras D A,Panayotou M.Efficient information theoretic strategies for classifier combination//feature extraction and performance evaluation in improving false positives and false nega- tives for spam e-mail filtering[J].Neural Networks,2005,18:799-807.
  • 6Lai Chih-Chin.An empirical study of three machine learning methods for spam fiherlng[J].Knowledge-Based System, doi : 10.1016/j. knosys.
  • 7Stone T.Parameterization of naive bayes for spam fihering[R/OL].University of Colorado at Boulder,2003.http://trevorstone.org/school/ spamfiltering.pdf.
  • 8[5]Secker,A.,Freitas,A.A.and Timmis,J.,AISEC:an artificial immune system for e-mail classification,The 2003 Congresson Evolutionary Computation,200S(1):131-139.
  • 9[6]Srinivasan P,Pant G,Menczer F.Target seeking crawlers and their topical performance[C].In:Proc of SIGIR Conference on Research and Development in lntormation Retrieval,ACM press,2002.
  • 10[7]H.Lim.An improved kNN learning based Korean text classifier with heuristic information.In Proceedings of the 9th International Conference on Neural Information proceeding,Singapore,Nov,2005:731-735.

共引文献135

同被引文献21

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部