摘要
Web日志挖掘是将数据挖掘技术应用到Web服务器的日志中,发现Web用户的行为模式。在介绍了典型的数据预处理技术的基础上,指出Frame页面降低了挖掘结果的兴趣性,并提出相应的解决方法--Frame页面过滤算法消除其影响。通过实验数据对该算法进行验证,说明Frame页面过滤算法可以显著地提高Web日志挖掘结果的兴趣性。
Web usage mining is the application of data mining to Web server logs in order to discover the behavior patterns of Web site visitors. After introduction of some typical Web log preprocessing techniques, it is pointed out that the frame pages in a Web site can reduce the interestingness of the result page groups. Then, a frame-filtering algorithm is proposed to solve this problem. Our experiments show that this algorithm can effectively reveal new interesting page groups, which would not be found without frame filtering.
出处
《计算机工程》
CAS
CSCD
北大核心
2001年第2期76-77,共2页
Computer Engineering