基于Web日志的频繁访问页面挖掘研究
被引量:1
The Study of Frequent Access Pages Mining Based on Web Log WANG Tao -wei
摘要
挖掘最大频繁项目集是多种数据挖掘应用中的关键问题。在经典Apriori算法基础上给出了基于SQL的Apriori算法。对Web日志挖掘进行数据预处理的基础上,利用算法挖掘最大频繁访问页面集。实验结果表明算法的效率较好,并有助于促进网站的建设。
出处
《计算机系统应用》
2006年第10期30-34,共5页
Computer Systems & Applications
参考文献9
-
1R Agrawal et al.Mining association rules between sets of items in larger databases[C].In:Proceedings of the ACM SIGMOD International Conference on Management of Data,Washington,DC,1993,2:207 ~ 216.
-
2Han J W.,Pei J.,Yin Y..Mining partial periodicity using frequent pattern tree[R].In CS Tech,Rep,99 -10,Simon Fraser University,1999.
-
3Han J,Pei B.Mortazavi-Asl:Frequent Patternprojected Sequential pattern Mining[C].Proceedings of the 2000 Int.Conf.KDD' 00,Boston,MA,2000.
-
4Han J.W.,Pei J.,Yin..Mining frequent patterns without candidate generation[A].In:Proceedings of the 2000 ACM-SIGMOD International Conference on Management of Data[C].Dallas,2000,1 ~12.
-
5路松峰,卢正鼎.快速开采最大频繁项目集[J].软件学报,2001,12(2):293-297. 被引量:113
-
6宋余庆,朱玉全,孙志挥,陈耿.基于FP-Tree的最大频繁项目集挖掘及更新算法[J].软件学报,2003,14(9):1586-1592. 被引量:164
-
7惠晓滨,张凤鸣,虞健飞,牛世民.一种基于栈变换的高效关联规则挖掘算法[J].计算机研究与发展,2003,40(2):330-335. 被引量:15
-
8朱玉全,孙志挥,季小俊.基于频繁模式树的关联规则增量式更新算法[J].计算机学报,2003,26(1):91-96. 被引量:81
-
9吉根林,杨明,宋余庆,孙志挥.最大频繁项目集的快速更新[J].计算机学报,2005,28(1):128-135. 被引量:47
二级参考文献38
-
1[1]Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD International Conference on Management of Date, Washington DC, 1993.207~216
-
2[2]Agrawal R, Srikant R. Fast algorithm for mining association rules. In: Proceedings of the 20th International Conference on VLDB, Santiago, Chile, 1994. 487~499
-
3[3]Han J, Kamber M. Data Mining: Concepts and Techniques. Beijing: Higher Education Press, 2001
-
4[5]Agrawal R, Shafer J C. Parallel mining of association rules:Design, implementation, and experience. IBM Research Report RJ 10004,1996
-
5[6]Savasere A, Omiecinski E, Navathe S. An efficient algorithm for mining association rules. In: Proceedings of the 21th International Conference on VLDB, Zurich, Switzerland, 1995. 432~444
-
6[7]Hah J, Jian P et al. Mining frequent patterns without candidate generation. In: Proceedings of ACM SIGMOD International Conference on Management of Data, Dallas, TX, 2000.1~12
-
7[8]Cheung D W, Lee S D, Kao B. A general incremental technique for maintaining discovered association rules. In: Proceedings of databases systems for advanced applications, Melbourne, Australia, 1997. 185~194
-
8[10]Han J, Jian P. Mining access patterns efficiently from web logs. In: Proceedings of Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'00), Kyoto, Japan,2000. 396~407
-
9[11]Agrawal R, Srikant R. Mining sequential pattern. In: Proceedings of the 11th International Conference on Data Engineering, Taipei, 1995. 3~14
-
10Han J.W.,Kamber M..Data Mining:Concepts and Techniques.Beijing:Higher Education Press,2001.
共引文献317
-
1徐龙,杨君锐.基于数据库变化的关联规则增量式更新算法[J].重庆科技学院学报(自然科学版),2007,9(4):67-70. 被引量:1
-
2谢志强,朱孟杰,杨静.基于改进FP-树的最大项目集挖掘算法[J].计算机应用研究,2009,26(2):502-505. 被引量:1
-
3姜晗,贾泂.基于标记域FP-Tree快速挖掘最大频繁项集[J].计算机研究与发展,2007,44(z2):334-349. 被引量:4
-
4杨种学.基于并行FP-growth算法挖掘网上关联交易规则[J].南京晓庄学院学报,2005,21(5):65-70.
-
5王盛,董黎刚,李群.一种基于逆序编码的关联规则挖掘研究[J].杭州电子科技大学学报(自然科学版),2010,30(5):169-172. 被引量:1
-
6陈晴光,李际军.汽车ERP中关联规则挖掘与动态更新的实现策略[J].机械制造,2004,42(6):69-72. 被引量:2
-
7易彤,徐宝文,吴方君.一种基于FP树的挖掘关联规则的增量更新算法[J].计算机学报,2004,27(5):703-710. 被引量:32
-
8杨君锐.逆向启发式开采最大频繁项目集[J].计算机工程,2004,30(14):116-118. 被引量:1
-
9邓小妮,罗雪山.一种基于事务时间分割的关联规则增量式更新方法[J].计算机工程与应用,2004,40(23):176-179. 被引量:1
-
10朱玉全,宋余庆,陈耿.约束最大频繁项目集的增量式更新算法[J].计算机工程,2004,30(18):31-32.
同被引文献5
-
1谭小球,徐妙君,张建科.基于改进FP-Tree的Web频繁序列模式挖掘技术[J].浙江海洋学院学报(自然科学版),2005,24(3):284-288. 被引量:1
-
2朱红蕾,徐志刚,李明,刘密霞.概念格的知识发现研究[J].微计算机信息,2006,22(02X):247-249. 被引量:8
-
3Srivastava J, Cooley R, Deshpande M, et al. Web Usage Mining: Discovery and Application of Usage Patterns from Web Data [ J ]. SIGKDD Explorations, 2000,1 (2) : 12-23.
-
4Ezeife C I, Lu Yi. Mining Web Log Sequential Patterns with Position Coded Pre-order Linked WAP-tree[J]. Data Mining and Knowledge Discovery, 2005 (10) : 5-38.
-
5Godin R, Missaoui R, Alaoui H. Incremental Concept Formation Algorithms Based on Galois(concept) Lattices[J ]. Computational Intelligence, 1995,11 (2) : 246-267.
-
1罗新.一种改进的Apriori算法在web日志挖掘中的应用[J].韩山师范学院学报,2009,30(3):43-48. 被引量:1
-
2王靓明,杨文琳,朱敏.基于Apriori算法的频繁访问页面挖掘分析[J].计算机与现代化,2009(12):21-24.
-
3毕永成.Web日志处理中Apriori算法及其改进[J].电脑知识与技术(过刊),2010,0(14):3573-3574. 被引量:2
-
4刘国红,梅玲.一种关联规则算法在农业网站日志分析中的应用[J].广东农业科学,2010,37(1):177-180. 被引量:1