期刊文献+

基于粗糙集和朴素贝叶斯的垃圾邮件过滤系统 被引量:5

Spam Filtering System Based on Rough Set and Naive Bayes
在线阅读 下载PDF
导出
摘要 提出了基于粗糙集理论和贝叶斯分类算法的垃圾邮件过滤方法。利用粗糙集约简算法对邮件样本集进行特征约简,删除对邮件过滤结果影响不大的冗余特征,从而降低了输入样本集的维数,解决了贝叶斯分类器训练时间长,样本集占用的存储空间过大的问题。实验证明,该方法可以提高邮件过滤的准确性和训练的速度。 This paper proposed a spam filtering method based on Rough set theory and Bayesian classifier algo-rithm. Then the amount of features are reduced by deleting redundant features with little significance on filtering effect based on rough set theory, resulting in a input sample with reduced number of dimension. Using this method, it can overcome the shortages of Bayies classifier-time-consuming of training and massive dataset storage. Experiments proved that this mechanism could greatly boost both the system' s accuracy and the training speed.
出处 《南昌大学学报(工科版)》 CAS 2009年第1期45-48,共4页 Journal of Nanchang University(Engineering & Technology)
基金 江西省教育厅科技计划资助项目(赣教技字[2007]23号 赣教技字[2007]344号)
关键词 垃圾邮件 粗糙集 朴素贝叶斯分类器 spam rough set naive bayesian classifier
  • 相关文献

参考文献7

二级参考文献21

  • 1戴劲松,白英彩.基于贝叶斯理论的垃圾邮件过滤技术[J].计算机应用与软件,2006,23(1):110-111. 被引量:16
  • 2闫龙,王文杰.基于贝叶斯方法的一种垃圾邮件过滤的实现[J].微电子学与计算机,2006,23(2):86-88. 被引量:10
  • 3翟凤文,赫枫龄,左万利.字典与统计相结合的中文分词方法[J].小型微型计算机系统,2006,27(9):1766-1771. 被引量:42
  • 4殷海波,宁绍军,王东.基于内容的贝叶斯自学习邮件过滤模型[J].计算机应用与软件,2007,24(1):177-179. 被引量:7
  • 5王国胤.Rough集理论和知识获取[M].西安:西安交通大学出版社,2001..
  • 6Massey B, Thomure M, Budrevich R, et al. Learning spam: simple techniques for freely-available software[C] // USENIX Annual Technical Conference. Berkeley: USENIX Association, 2003 : 63 - 76.
  • 7Karlberger C, Bayler G, Kruegel C, et al. Exploiting redundancy in natural language to penetrate bayesian spare filters[C] //Proceedings of the first USENIX workshop. Berkeley: USENIX Association, 2007: 1- 7.
  • 8Ramachandran A, Feamster N, Vempala S. Filtering spam with behavioral blacklisting [ C ]//Conference on Computer and Communications Security. New York: ACM, 2007:342 - 351.
  • 9Brodsky A, Brodsky D. A distributed content independent method for spam detection[C] // Proceedings of the first conference. Berkeley: USENIX Association, 2007: 1 - 10.
  • 10Cheng D, Kannan R, Vempala S, et al. A divide-and-merge methodology for clustering [ J ]. ACM, 2006, 31 (4) :1499 - 1525.

共引文献635

同被引文献40

引证文献5

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部