期刊文献+

A Chinese Web Page Clustering Algorithm Based on the Suffix Tree 被引量:4

A Chinese Web Page Clustering Algorithm Based on the Suffix Tree
在线阅读 下载PDF
导出
摘要 In this paper, an improved algorithm, named STC-I, is proposed for Chinese Web page clustering based on Chinese language characteristics, which adopts a new unit choice principle and a novel suffix tree construction policy. The experimental results show that the new algorithm keeps advantages of STC, and is better than STC in precision and speed when they are used to cluster Chinese Web page. Key words clustering - suffix tree - Web mining CLC number TP 311 Foundation item: Supported by the National Information Industry Development Foundation of ChinaBiography: YANG Jian-wu (1973-), male, Ph. D, research direction: information retrieval and text mining. In this paper, an improved algorithm, named STC-I, is proposed for Chinese Web page clustering based on Chinese language characteristics, which adopts a new unit choice principle and a novel suffix tree construction policy. The experimental results show that the new algorithm keeps advantages of STC, and is better than STC in precision and speed when they are used to cluster Chinese Web page. Key words clustering - suffix tree - Web mining CLC number TP 311 Foundation item: Supported by the National Information Industry Development Foundation of ChinaBiography: YANG Jian-wu (1973-), male, Ph. D, research direction: information retrieval and text mining.
作者 YANGJian-wu
出处 《Wuhan University Journal of Natural Sciences》 EI CAS 2004年第5期817-822,共6页 武汉大学学报(自然科学英文版)
基金 theNationalInformationIndustryDevelopmentFoundationofChina
关键词 CLUSTERING suffix tree Web mining clustering suffix tree Web mining
  • 相关文献

同被引文献25

引证文献4

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部