期刊文献+

快速关联规则挖掘算法 被引量:15

A Fast Algorithm for Mining of Association Rules
在线阅读 下载PDF
导出
摘要 关联规则挖掘是数据挖掘及知识发现领域的重要研究内容之一,其核心任务是挖掘数据库中的频繁项集。Apriori及其改良算法是频繁项集挖掘的有效算法。在类Apriori的算法中,它们都采用哈希树来存储频繁项集的候补项集以便快速计算其支持度。该文在仔细分析这些算法所存在的效率瓶颈的基础上,提出了另一个有效的改进算法。所提算法通过利用一个一维数组替代已有算法中的复杂的哈希树来达到改善它们效率瓶颈的目的。通过多个实验评估,该文所提算法的挖掘效率很高,比Apriori及其改良算法要快2到5倍。 Mining of association rules is considered to be on e of the most important data mining tasks.Frequent itemset mining plays an esse ntial role in mining association rules.A lot of previous studies adopt an Apri ori-like approach,in which hash-tree is used to store candidate itemsets base d on analyzing the bottleneck of performance for Apriori-like algorithm,an eff icient algorithm for faster mining of frequent itemsets is proposed in this pa per.It adopts one-dimension array instead of the complex hash-tree structure to expedite the mining process.The several experiments assess the relative perf ormance of the algorithm in comparison with the Apriori and its extended algorit hm.The experiment evaluation shows that the algorithm is faster than both alg orithms by a factor from two to five.
出处 《计算机工程与应用》 CSCD 北大核心 2002年第11期1-4,47,共5页 Computer Engineering and Applications
基金 国家973重点基础研究发展规划项目(编号:G1999032705) 留学回国人员科研启动基金资助
关键词 频繁项集 关联规则 算法 数据挖掘 数据库 知识发现 Frequent Itemset,Associatio n Rule,Algorithm,Data Mining,Database
  • 相关文献

参考文献6

  • 1[1]C C Aggarwal,P S Yu. Mining Large Itemsets for Association Rules[J].Data Engineering Bulletin, 1998 ;21 ( 1 ) :23~31
  • 2[2]Eui-Hong Han,George Karypis,Vipin Kumar. Scalable Parallel Data Mining for Association Rules[J],IEEE Transactions on Knowledge and Data Engineering,2000; 12(3) :377~352
  • 3[3]R Agrawal,S Srikant.Fast Algorithms for Mining Association Rules[C].Proc.20th Int Conf on VLDB,Santiago,Chile,1994:487~499
  • 4[4]J S Park,M S Chen,P S Yu. An Effective Hash-Based Algorithm for Mining Association Rules[C].Proc ACM SIGMOD Int Conf Management of Data,San Jose,CA, 1995:175~186
  • 5[5]J Liu,J Yin. Towards Efficient Data Re-mining(DRM)[C].Proc PAKDD,5th Pacific-Asia Conf. Hong Kong,China,2001:406~412
  • 6[6]S D Lee,D W Cheung,B Kao.Is Sampling Useful in Data Mining?A Case in the Maintenance of Discovered Association Rules[J].Data Mining and Knowledge Discovery,Kluwer Academic Publishers,1998;2 (3): 233~262

同被引文献79

引证文献15

二级引证文献66

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部