期刊文献+

使用否定选择算法改进文本过滤 被引量:2

Using a Negative-Selection Algorithm to Improve Text Filtering
在线阅读 下载PDF
导出
摘要 本文针对基于关联规则的文本过滤器设计做了如下探索:(1)针对中文网络语言的特点,引入n-Gram方法提取文本的特征;(2)提出边界样本的概念;(3)在设计基于关联规则的文本过滤器时,引进了否定选择算法,采用否定选择算法对过滤器的检测器集合进行自体耐受,最终建立高准确率的文本过滤器。实验表明,经过自体耐受的过滤器可以有效地提高过滤准确率。 As for the text filter design based on association rules, the paper makes the following efforts: (1)As for the charateristics of the Chinese web language, we introduce the n-Gram method to extract text features; (2)We propose the concept of edge sample; (3)When designing the text filters based on association rules, we introduce a negative-selection algorithm to make the filters' detector set tolerant, and finally build a high-precision text filter. Experiments show that the filters after proper toleration can effectively increase the precision of filtering.
出处 《计算机工程与科学》 CSCD 2008年第8期61-64,共4页 Computer Engineering & Science
关键词 文本过滤 否定选择算法 N-GRAM 关联规则 text filtering negative-selection algorithm n-Gram association rule
  • 相关文献

参考文献5

  • 1Gonzalez F. Combining Negative Selection and Classification Techniques for Anomaly Detection [C]//Proc of the 2002 Congress on Evolutionary Computation, 2002.
  • 2Liu B, Hsu W,Ma Y. Integrating Classification and Association Rule Mining[C]//Proc of KDD'98,1998.
  • 3Zipf George K. Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology[M]. Reading Mass: Addison-Wesley, 1949.
  • 4Cavnar W B. N-Grarrr-Based Text Categorization [C]//Proc of the Syrnp on Document Analysis and Information Retrieval, 1994:171-179,
  • 5Han J, Pei J, Yin Y. Mining Frequent Patterns without Candidate Generation[C]//Proc of SIGMOD'00,2000.

同被引文献15

引证文献2

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部