期刊文献+

Web日志挖掘中3GWAP子网的获取研究

Mining Web logs to discover 3G Web site
在线阅读 下载PDF
导出
摘要 随着3G时代的到来,手机上网已逐步普及,由于手机屏幕较小及上网带宽限制,需要为手机访问者提供只需保留原Web站点主干分支的WAP子网。WWW上用户的访问路径信息会被记录在Web服务器的日志记录中,分析这些日志并挖掘出用户的主要行为模式,可以提取出Web网站被频繁访问的主干部分。首先将原始日志序列转化成用户访问路径会话集UVPSD,然后通过约束的加权网站结构图WWSSG,最终实现了此Web站点的频繁主干子网的发现。在上海社区网上采用此算法提取出的3GWAP子网,实验数据表明,该子网覆盖了上海社区网的大部分热门栏目页面。 With the age of 3G,it is popular to visit WWW using mobile phone.Because of the small screen and slow net speed, it is better to provide a major sub Web site for the mobile phone visitors.The behavior of the Web page readers is imprinted in the Web server log files.Analyzing and exploring regularities in this behavior can find the high frequency visit path.Firstly,in this paper,converts the original sequence of log data into User Visit Path Session Dataset(UVPSD),then implements the discovery of major sub Web site structure by using reduced Weighted Web Site Structure Graph(WWSSG).This paper applies the algorithm on Shanghai community services Web site to get the 3G WAP major sub net.The experiment data indicates the sub net covers the major popular pages of the Web site.
作者 鲍钰
出处 《计算机工程与应用》 CSCD 北大核心 2009年第18期132-134,197,共4页 Computer Engineering and Applications
基金 国家重点基础研究发展规划(973)No.2005CB321904~~
关键词 WEB日志 用户访问路径会话集发现算法 加权网站结构图生成算法 3G无线应用协议 Web log Discover User Visit Path Session Dataset(DUVPSD) Weighted Web Site Structure Graph(WWSSG) 3G WAP
  • 相关文献

参考文献5

  • 1费爱国,王新辉.一种基于Web日志文件的信息挖掘方法[J].计算机应用,2004,24(6):57-59. 被引量:24
  • 2鲍钰,黄国兴,张召.基于Web日志挖掘的网站结构优化方法[J].计算机工程,2003,29(12):82-84. 被引量:12
  • 3Agrawal R,Sr Ikant R.Mining sequential patterns[C]//Proeeedings International Conference on Data Engineering(ICDE 95),1995:3-14.
  • 4Chen M S,Park J S,Yu P S.Effieient data mining for path traversal patterns[J]. IEEE Transactions on Knowledge and Data Engineering, 1998, 10(2):209-221.
  • 5Pei J.Han J,Mortazaviasl B,et al.Mining access patterns efficiently from Web log[C]//Proceedings Pacific Asia Conference on Knowledge Discovery and Data Mining(PaKDD),Kyoto,Japan,2000:396-407.

二级参考文献8

  • 1Shahabi C,ZarkeshA M,Abidi J,et al.Knowledge Discovery from Users Web-page Naviagtion.In Proc.of the 7th IEEE Intl.Workshop on Research Issues in Data Engineering (RIDE), 1997:20-29.
  • 2Pei J,Han J,Mortazavi-asl B,Zhu H.Mining Access Patterns Efficiently from Web Logs.In Proc.of the 4th Pacific-Asia Conf. on Knowledge Discovery and Data Mining, 2000-04:396-407.
  • 3Chen M S,Park J S,Yu P S.Data Mining for Path Traversal Patterns in a Web Environment.In Proc.of the 16th International Conference on Distributed Computing Systems, 1996-05:385-392.
  • 4Spiliopoulou M,Faulstich L C,Wilkler K.A Data Miner Analyzing the Navigational Behaviour of Web Users.In Proc.of the Workshop on Machine Learning in User Modelling of the ACAI99, Greece, 1999-07.
  • 5Levy A.Efficient Query Processing for Information - Gathering Agents[A].Proceedings of the Workshop on Intelligent Information Agents.Gaithersburg: MD.National Institute of Standards and Technology[ C],1994.
  • 6Hu X.Knowledge Discovery in Database: An Attribute - oriented Rough Set Approach [ D].University of Regina,Canada,1995.
  • 7HanJ KamberM.数据挖掘概念与技术[M].北京:机械工业出版社,2001.185.
  • 8杨炳儒.知识工程与知识发现[M].北京:冶金工业出版社,2001..

共引文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部