期刊文献+

一种新型的Web挖掘数据采集模型 被引量:5

A New Web Mining Data Integration Model Based on XML
在线阅读 下载PDF
导出
摘要 本文在简要论述了当前Web挖掘采用的数据源不足后,分析了XML文档结构与Web挖掘算法结构的相似性,提出了采用XML技术在应用服务层采集用户访问数据的数据源模型X-DIM,并分析了它的优越性。该模型克服了以往基于Web访问日志在数据预处理中的一系列问题,具有数据完备、准确度高、便于为挖掘算法使用等优点,有较高的应用价值。 The paper briefly describes the demerits of insufficient data sources adopted in the current Web mining,analyses the similarity between the XML document structure and the Web mining algorithm structure,proposes a data source model X-DIM of adopting the XML technology in the application service layer to sample users' access data,and analyes its advantages.The model overcomes a series of problems previously encountered in data preprocessing based on the Web access log,and features the merits of data completeness,high accuracy,ease of use in mining algorithms,and high application value.
出处 《计算机工程与科学》 CSCD 2007年第2期36-39,共4页 Computer Engineering & Science
关键词 XML X-DIM WEB挖掘 电子商务 XML,X-DIM,Web mining,E-commerce
  • 相关文献

参考文献5

二级参考文献36

  • 1[1]Cooley R,Mobasher B,Srivastava J.Grouping Web Page References into Transactions for Mining World Wide Web Browsing Patterns[R].Minneapolis,USA,Dept.of Computer Science,Univ.of Minnesota,1997.
  • 2[2]Cooley R,Mobasher B,Srivastava J.Data Preparation for Mining World Wide Web Browsing Patterns[J].Knowledge and Information Systems,1999,1(1):5- 32.
  • 3[3]Berent B,Mobasher B,Spiliopoulou M,et al.Measuring the Accuracy of Sessionizers for Web Usage Analysis[C].Workshop on Web Mining at the First SIAM International Conference on Data Mining,Chicago,USA,2001.7- 14.
  • 4[4]The Common Logfile Format[EB/OL].http://www.w3.org/Daemon/User/Config/Logging.html#common-logfile-format,2003.
  • 5[5]Ansari S,Kohavi R,Mason L,et al.Integrating E-commerce and Data Mining:Architecture and Challenges[C].The 2001 IEEE International Conference on Data Mining,San Jose,California,USA,2001.27- 34.
  • 6[6]Kohavi R.Mining E-commerce Data:The Good,the Bad,and the Ugly[C].Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,2001.8- 13.
  • 7[7]Shahabi,Zarkesh A M,Adibi J,et al.Knowledge Discovery from Users Web-page Navigation[C].Workshop on Research Issues in Data Engineering,Birmingham,England,1997.
  • 8[8]Zarkesh,Adibi J,Shahabi C,et al.Analysis and Design of Server Informative WWW-sites[C].Sixth International Conference on Information and Knowledge Management,Las Vegas,Nevada,1997.254- 261.
  • 9[9]Han J W,Pei J,Yin Y W.Mining Frequent Patterns without Candidate Generation[C].2000 ACM SIGMOD Intl.Conference on Management of Data,USA,2000.1- 12.
  • 10[10]Mao R.Adaptive-FP:An Efficient and Effective Method for Multi-Level Multi-Dimensional Frequent Pattern Mining[M].Master Thesis.BC,Canada: Simon Fraser University,2001.

共引文献41

同被引文献27

引证文献5

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部