期刊文献+

基于图的四叉链表存储结构的最大频繁项集挖掘算法

The maximum frequent item set mining algorithm based on the four-fork linked storage structure
在线阅读 下载PDF
导出
摘要 虽然已有的最大频繁项集挖掘算法在结构和技术上已经做了很多改进,但还是存在挖掘速度慢、效率低的缺点,在此提出了图的四叉链表存储结构和基于该存储结构的最大频繁项集挖掘算法,该结构具有一次生成多次使用,不必耗用额外的存储空间等特点,基于该存储结构的最大频繁项集挖掘算法充分利用了该存储结构的特点以及频繁扩展集的性质,有效地减少了冗余候选集的生成,降低了串的冗余存储,将串集合间的比较转化为整型数组的比较,从而使得它比已有的最大频繁项集挖掘算法在挖掘效率上有了明显的提高,最后通过实验证明了该算法较其他已有算法效率有了较大的提高. Although a variety of improvements have been done on the existing maximum frequent item mining algorithms in terms of structures and technologies, they still suffer from low efficiency. Given these shortcomings of the existing algorithms, we propose the quad-pointer linked list structure for graph and the maximum frequent item mining algorithm based on this structure. This structure possesses once-created-multiple-used property, without the need for extra storage space. This structure property and the characteristics of the frequent extension set are utilized fully by our algorithm, which effectively reduce the redundancy for the candidate generation and storage. Besides, we convert the comparison between strings into the comparison between integer arrays, which improves the efficiency greatly for the maximum frequent item mining algorithm. Through the experiments, the efficiency of our algorithm is proved to outperform the other existing algorithms.
出处 《应用科技》 CAS 2013年第1期76-79,共4页 Applied Science and Technology
基金 国家自然科学基金资助项目(60975071) 黑龙江省教育厅科学技术研究资助项目(12513055)
关键词 四叉链表 频繁项集 存储结构 挖掘算法 four-fork link frequent item set storage structure mining algorithm
  • 相关文献

参考文献9

二级参考文献48

  • 1李庆华,王卉,蒋盛益.挖掘最大频繁项集的并行算法[J].计算机科学,2004,31(12):132-134. 被引量:5
  • 2颜跃进,李舟军,陈火旺.基于FP-Tree有效挖掘最大频繁项集[J].软件学报,2005,16(2):215-222. 被引量:69
  • 3宋余庆,朱玉全,孙志挥,杨鹤标.一种基于频繁模式树的约束最大频繁项目集挖掘及其更新算法[J].计算机研究与发展,2005,42(5):777-783. 被引量:21
  • 4王黎明,赵辉.基于FP树的全局最大频繁项集挖掘算法[J].计算机研究与发展,2007,44(3):445-451. 被引量:16
  • 5Agrawal R,Imietinski T, Swami A.Mining association rules between sets of items in large database[C].Washington:Proceeding of the ACM SIGMOD International Conference on Management of Data, 1993:207-216.
  • 6Agrawal R,Srikant.Fast algorithms for mining association rules [C]. Proceeding of the 20th International Conference on Very Large Databases, 1994:487-499.
  • 7Han J, Pei J,Yin Y.Mining frequent patterns without candidate generation[C].Dallas:Proceeding of the ACM SIGMOD Intema- tional Conference on Management of Data,2000:1-12.
  • 8Bayardo R.Efficiently mining long patterns from databases[C]. New York: Proceeding of 1998 ACM SIGMOD International Conference on Management of Data,1998:85-93.
  • 9Burdick D,Calimlim M,Flannick J,et al.MAFIA:A maximal frequent itemset algorithm [J]. IEEE Transactions on Knowledge and Data Engineering,2005(11): 1490-1504.
  • 10Gouda K,Zaki MJ.Efficiently mining maximal frequent itemsets [C].Proceeding of the IEEE International Conference on Data Mining,2001:163 - 170.

共引文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部