摘要
随着3G时代的到来,手机上网已逐步普及,由于手机屏幕较小及上网带宽限制,需要为手机访问者提供只需保留原Web站点主干分支的WAP子网。WWW上用户的访问路径信息会被记录在Web服务器的日志记录中,分析这些日志并挖掘出用户的主要行为模式,可以提取出Web网站被频繁访问的主干部分。首先将原始日志序列转化成用户访问路径会话集UVPSD,然后通过约束的加权网站结构图WWSSG,最终实现了此Web站点的频繁主干子网的发现。在上海社区网上采用此算法提取出的3GWAP子网,实验数据表明,该子网覆盖了上海社区网的大部分热门栏目页面。
With the age of 3G,it is popular to visit WWW using mobile phone.Because of the small screen and slow net speed, it is better to provide a major sub Web site for the mobile phone visitors.The behavior of the Web page readers is imprinted in the Web server log files.Analyzing and exploring regularities in this behavior can find the high frequency visit path.Firstly,in this paper,converts the original sequence of log data into User Visit Path Session Dataset(UVPSD),then implements the discovery of major sub Web site structure by using reduced Weighted Web Site Structure Graph(WWSSG).This paper applies the algorithm on Shanghai community services Web site to get the 3G WAP major sub net.The experiment data indicates the sub net covers the major popular pages of the Web site.
出处
《计算机工程与应用》
CSCD
北大核心
2009年第18期132-134,197,共4页
Computer Engineering and Applications
基金
国家重点基础研究发展规划(973)No.2005CB321904~~