期刊文献+

Web文本挖掘中的特征表示和特征提取 被引量:2

Document Feature and Feature Extraction in Web Text Mining
在线阅读 下载PDF
导出
摘要 该文介绍了Web文本挖掘的概念和一般处理过程,着重就Web文本挖掘中前期的分词、特征表示和特征提取的常用方法进行研究,同时对不同方法进行了初步比较。 Firstly,this paper introduces the definition and algorithms of Web text mining technology in a summing-up way. Then, several methods on Chinese automatic segmentation,document feature and feature extraction are analyzed, discussed and contrasted.
出处 《电脑知识与技术》 2006年第5期67-68,共2页 Computer Knowledge and Technology
关键词 特征提取 WEB挖掘 文本挖掘 分词 feature extraction web mining text mining word segmentation
  • 相关文献

参考文献3

二级参考文献17

  • 1Zalane O R,Proc of 1998ACM-SIGMOD Conf onManagement of Data.Seattle,1998年,581页
  • 2Wang Ke,Newport Beach,1997年
  • 3Salton G,Commun ACM,1975年,18卷,5期,613页
  • 4Yang Y, Wilbur W J. Using Corpus Statistics to Remove Redundant Words in Text Categorization. In J. Amer. Soc. Inf Sci.,1996.
  • 5Yang Y, Pedersen J O. A Comparative Study on Feature Selection in Text Categorization. KDD-2000 Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston,MA,UA, 2000.
  • 6Galavotti L, Sebastiani F, Simi M. Feature Selection and Negative Evidence in Automated Text Categorization. KDD-2000 Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston,MA, UA, 2000.
  • 7Mena J. Data Mining Your Website. America, 2000:368.
  • 8Yang Y. An Evaluation of Statistical Approaches to Text Categorization. Journal of Information (Retrieval 1 ),1999:69-90.
  • 9Mladenic M. Feature Subset Selection in Text-learning. http://www.ai.ijs.si/DunjaMladenic.
  • 10Wulfekuhler M R,Punch W F,Finding Salient Features for Personal Web Page Categorization. In Proc.of 6th International World Wide Web Conference,1997.

共引文献309

同被引文献8

引证文献2

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部