期刊文献+

关联规则算法的实现与改进 被引量:14

Realization and Optimization of Association Rule Mining Algorithm
在线阅读 下载PDF
导出
摘要 关联规则作为一种数据挖掘的工具,它能够发现数据项集之间有趣的关联。在关联规则的算法中,Apriori算法是其中的关键算法之一。面对大量复杂的数据集,怎样选择数据结构,怎样优化处理过程,对于此算法的性能将会十分重要。该文首先介绍了关联规则的原理和Apriori算法的实现,然后提出了对该算法的若干改进,例如:采用树型结构存取频繁项集,使用三种缓存优化的方法等。这些优化都能够在整体上提高算法的效率。对于大数据项,试验显示,这些改进能够正确、有效、快速地实现Apriori算法。 Association rule mining searches for interesting relationships among items.Association rule is now used as one of efficient data mining tools.In all association rule algorithms ,the apriori algorithm is one of the keys.To huge scale and complex of data sets,how to choose the data structure,how to optimize the process,are all very important for the performance of the algorithm.This paper first introduces the principle of association rule and its realization by apri-ori algorithm.Then some improvements of apriori algorithm are proposed,such as creating a tree structure to store all frequent itemsets and using three kinds of cache optimization methods.These optimization can improve the algorithm ef-ficiency at the whole.The experiments show these improvements proposed in this paper can realize the Apriori algorithm accurately,efficiently and quickly in large data sets.
出处 《计算机工程与应用》 CSCD 北大核心 2002年第24期190-192,共3页 Computer Engineering and Applications
关键词 关联规则算法 数据挖掘 频繁项集 数据库 APRIORI算法 Data mining,Association rule,Frequent Itemset
  • 相关文献

参考文献8

  • 1R Agrawal,T Imielinski,A Swami.Mining association rules between sets of items in large databases[C].In:Proc 1993 ACM-SIGMO D Int Conf Management of Data(SIGMOD'93),Washington,DC, 1993
  • 2R Agrawal,R Srikant. Fast Algorithms for Mining Association Rules [C].In: Proceedings of the VLDB Conference,Santiago, Chile, 1994:487~499
  • 3Lebeck A R,Wood D A.Cache Profiling and the SPEC Benchmarks:A Case[J].IEEE Computer, 1994; 27 (10): 15~26
  • 4J Han,M Kamber. Data Mining Concepts and Techniques[M].Morgan Kaufmann Publisher,2001
  • 5J S Park,M S Chen,P S Yu.An effective hash-based algorithm for mining association rules[C].In:Proc 1995 ACM-SIGMOD Int Conf Management of Data,San Jose,CA, 1995:175~186
  • 6M J Zaki.Parallel and distributed association mining:A survey[J].IEEE Concurrency,Special Issue on Parallel Mech-anisms for Data Mining, 1999 ;7(4): 14~25
  • 7A Savasere,E Omiecinski,S Navathe. An efficient algorithm for mining association rules in large databases[C].In:Proceedings of the 21st VLDB Conference ,Zurich,Switzerland, 1995:432~444
  • 8Srikant R,Agrawal R.Mining Quantitative Association Rules in Large Relational Tables[J].Proceedings of ACM SIGMOD International conference on Management of Data, 1996:1~12

同被引文献63

引证文献14

二级引证文献62

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部