期刊文献+

一种基于排序FP-TREE挖掘最大频繁模式的高效算法 被引量:1

An Efficient Algorithm for Mining Maximal Frequent Patterns Based on Sorted FP-TREE
在线阅读 下载PDF
导出
摘要 提出了一种挖掘最大频繁模式的有效算法SFP-MFP,给出了最大频繁模式树MFP-TREE的定义,并使用SFP-TREE结构存储挖掘结果,采用了有效的子集检查方法,极大地降低了算法的时空开销,提高了挖掘效率.理论分析和实验表明,该算法的执行效率较其他同类算法有明显改进. An efficient algorithm for mining maximal frequent patterns (SFP-MFP), based on sorted FP-TREE, is proposed. Maximal frequent patterns tree (MFP-TREE) is defined, A structure of SFP-TREE, which replaces MFP- TREE,was used to store all maximal frequent item sets, and some subset checking approaches were adopted to im- prove it. Therefore, the proposed algorithm greatly cut down the cost of space and memory, and improved the mining efficiency. Theoretical analysis and experimental results show that the executing performance of the new algorithm proposed is better than other similar algorithms.
出处 《广东工业大学学报》 CAS 2009年第2期64-68,共5页 Journal of Guangdong University of Technology
基金 国家科技支撑计划子课题(2006BAI08B01-03)
关键词 最大频繁模式 排序FP-TREE 关联分析 数据挖掘 maximal frequent pattern sorted FP-TREE association rule data mining
  • 相关文献

参考文献9

  • 1Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules [ C ]//VLDB' 94.1994. 487 -499.
  • 2Han Jia-wei, PEI Ji-an, YIN Yi-wen. Mining frequent patterns without candidate genera-tion [ C ] ////Proceedings of ACM SIGMOD'00 International Conference on Management of Data. New York : ACM Press ,2000 : 1-12.
  • 3Bayardor. Efficiently mining long patterns from databases [ C ]//HAAS LM, ed. Proceedings of the ACM SIGMOD International Conference on Management of Data. New York: ACM Press, 1998.85-93.
  • 4Lindi, Kedem Z M. Pincer-Search:A new algorithm for discovering the maximum frequent set [ C ] //Schek H J, ed. Proceedings of the 6th European Conference on Extending Database Technology. Heidelberg: Springer-Verlag, 1998. 105-119.
  • 5路松峰,卢正鼎.快速开采最大频繁项目集[J].软件学报,2001,12(2):293-297. 被引量:113
  • 6宋余庆,朱玉全,孙志挥,陈耿.基于FP-Tree的最大频繁项目集挖掘及更新算法[J].软件学报,2003,14(9):1586-1592. 被引量:164
  • 7冯志新,钟诚.基于FP-tree的最大频繁模式挖掘算法[J].计算机工程,2004,30(11):123-124. 被引量:18
  • 8秦亮曦,李谦,史忠植.基于排序FP-树的频繁模式高效挖掘算法[J].计算机科学,2005,32(4):31-33. 被引量:13
  • 9Jiawei Han,Micheline Kamber.数据挖掘:概念与技术[M].范明,孟小峰等译.北京:机械出版社,2008:157-159.

二级参考文献15

  • 1[1]Han J, Kambe M. Data Mining: Concepts and Techniques. Morgaan Kaufmann Publishers, San Francisco, CA, 2001
  • 2[2]Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules. Proc. 20th Int'l Conf. Very Large Databases, Santiago, Chile, 1994-09: 497-499
  • 3[3]Han J, Pei J, Yin Y. Mining Frequent Patterns Without Candidate Generation. In Proc. 2000 ACM-SIGMOD Int. Conf. Management of Data (SIGMOD'00), Dalas, TX, 2000-05: 1-12
  • 4[4]Pei J. Pattern Growth Methods for Frequent Pattern Mining [doctor thesis]. Simon Fraser University, 2002-06-13
  • 5[5]Burdick D, Calimlim M, Gehrke J. MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases. In Int'l Conf. on Data Engineering, 2001-04
  • 6Lin Dao I,Proc the 6th European Conference on Extending Database Technology,1998年,105页
  • 7Agrawal R,Proc the 11th Inter Conference on Data Engineering,1995年,3页
  • 8http://www. cs. helsinki. fi/u/goethals/
  • 9http://www. ics. uci. edu/~mlearn/MLRepository. html
  • 10Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large database. In: P Buneman, S Jajodia eds. Proc. of 1993 ACM SIGMOD Conf. on Management of Data. Washington DC: ACM Press, 1993. 207~216

共引文献244

同被引文献15

  • 1宋明,刘宗田.基于数据交叠分区的并行DBSCAN算法[J].计算机应用研究,2004,21(7):17-20. 被引量:9
  • 2何中胜,刘宗田,庄燕滨.基于数据分区的并行DBSCAN算法[J].小型微型计算机系统,2006,27(1):114-116. 被引量:16
  • 3贺玲,吴玲达,蔡益朝.数据挖掘中的聚类算法综述[J].计算机应用研究,2007,24(1):10-13. 被引量:235
  • 4郗洋.基于云计算的并行聚类算法研究[D].南京:南京邮电大学,2012.
  • 5孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量:1109
  • 6CHEN M, HAN J, YU P S. Data mining:an overview from a database perspective [ J ]. IEEE Transactions on Knowl- edge and Data Engineering, 1996,8 (6) :866-883.
  • 7XU R, WUNSCH D. Survey of clustering algorithms [ J ]. IEEE Transaction on Neural Networks, 2005,16 ( 3 ) : 645-678.
  • 8ESTER M, KRIEGEL H, JORG S. A densith-based algo- rithm for discovering clusters in large spatial databases with noise [ C ] // Proceedings of International Conference on Knowledge Discovery & Data Mining. Portland: AAAI, 1996 : 226 -231.
  • 9BECKMANN N, KRIEGEL H, SCHNEIDER R, et al. The R* -tree : an efficient and robust access method for points and rectangles [ J ]. International Conference on Management of Data, 1990, 19(2):322-331.
  • 10DEAN J, GHEMAWAT S. MapReduce: simplified data processing on large clusters [ J ]. Communications of the ACM ,2008,51 ( 1 ) : 107-113.

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部