期刊文献+

改进的最大频繁项集挖掘算法 被引量:2

Improved algorithm for mining maximal frequent itemsets
在线阅读 下载PDF
导出
摘要 为了提高挖掘关联规则的效率,提出基于改进FP-Tree结构的最大频繁项集挖掘算法。介绍并分析了挖掘最大频繁项集的过程和现有算法,指出现有算法中耗时的关键步骤。克服了MMFI算法中需要反复从头表出发沿相同项目结点链搜索右侧结点的缺点,提出一种改进的最大频繁项集挖掘算法IMMFI。通过在有序FP-Tree中引入叶子链,用沿叶子链搜索取代沿同层结点链搜索,有效地减少了搜索的次数,提高了算法的效率。实验结果表明了该算法的性能良好。 To improve the efficiency of mining the association rules, an algorithm for mining maximal frequent itemsets based on improved frequent pattern tree is presented. Firstly, the process of mining maximal frequent itemsets and existing algorithms are introduced and analyzed, and the primary approach costing time in existing algorithms is indicated. An improved algorithm IMMFI (improved mining maximal frequent itemsets) for mining maximal frequent itemsets is proposed overcoming the disadvantage of searching the right nodes along the same nodes link from head table in algorithm MMFI repeatedly. The times of searching is reduced and the efficiency is improved through introducing the leaf nodes link to the improved frequent pattern tree and replacing searching along the same nodes link with along the leaf nodes link. Finally, the performance of the algorithm IMMFI is demonstrated by instance and experiment.
作者 陈晨 鞠时光
出处 《计算机工程与设计》 CSCD 北大核心 2010年第18期4009-4011,4032,共4页 Computer Engineering and Design
基金 国家自然科学基金项目(60773049)
关键词 数据挖掘 关联规则 最大频繁项集 超集检测 频繁模式树 data mining association rules maximal frequent itemset superset checking frequent pattern tree
  • 相关文献

参考文献8

二级参考文献34

  • 1Agrawal R,Imietinski T, Swami A.Mining association rules between sets of items in large database[C].Washington:Proceeding of the ACM SIGMOD International Conference on Management of Data, 1993:207-216.
  • 2Agrawal R,Srikant.Fast algorithms for mining association rules [C]. Proceeding of the 20th International Conference on Very Large Databases, 1994:487-499.
  • 3Han J, Pei J,Yin Y.Mining frequent patterns without candidate generation[C].Dallas:Proceeding of the ACM SIGMOD Intema- tional Conference on Management of Data,2000:1-12.
  • 4Bayardo R.Efficiently mining long patterns from databases[C]. New York: Proceeding of 1998 ACM SIGMOD International Conference on Management of Data,1998:85-93.
  • 5Burdick D,Calimlim M,Flannick J,et al.MAFIA:A maximal frequent itemset algorithm [J]. IEEE Transactions on Knowledge and Data Engineering,2005(11): 1490-1504.
  • 6Gouda K,Zaki MJ.Efficiently mining maximal frequent itemsets [C].Proceeding of the IEEE International Conference on Data Mining,2001:163 - 170.
  • 7Zhou QH,Wesley C,Lu BJ.SmartMiner:A depth 1st algorithm guided by tail information for mining maximal frequent itemsets [C].Proceeding of the IEEE International Conference on Data Mining,2002:570-577.
  • 8Grahne G,Zhu JF.High performance mining of maximal frequent itemsets[C].Proceeding of the 6th SIAM International Workshop on High Performance Data Mining,2003:135-143.
  • 9Grahne G.,Zhu JF.Fast algorithms for frequent itemset mining using FP-trees [J]. IEEE Transactions on Knowledge and Data Engineering,2005(10): 1347-1362.
  • 10http://www.almaden.ibm.com/cs/people/bayardo/resources. html[EB/OL].

共引文献239

同被引文献21

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部