摘要
用户频繁访问模式的发现是Web日志挖掘的重要研究内容。提出了一种先求两两用户访问模式的交集结果再生成候选频繁访问模式,然后扫描数据库,统计各个候选频繁访问模式的支持度计数的GITC算法。经过理论分析和实验验证,该算法能有效地发现用户频繁访问模式。
The user frequent access patterns discovery is an important task of Web log mining study.The paper proposes GITC algorithm.The algorithm first gets the intersections of each two user access patterns and gives birth to candidate frequent access patterns,then takes count of the number of each candidate frequent access pattern by scanning the original database.Theory anal- ysis and experimental results show that the GITC algorithm can discover user frequent access patterns effectively.
出处
《计算机工程与应用》
CSCD
北大核心
2007年第7期191-194,共4页
Computer Engineering and Applications
基金
合肥工业大学科研发展基金资助项目(No.030503F)