一种基于排序FP-TREE挖掘最大频繁模式的高效算法被引量：1

An Efficient Algorithm for Mining Maximal Frequent Patterns Based on Sorted FP-TREE

下载PDF

导出

摘要提出了一种挖掘最大频繁模式的有效算法SFP-MFP,给出了最大频繁模式树MFP-TREE的定义,并使用SFP-TREE结构存储挖掘结果,采用了有效的子集检查方法,极大地降低了算法的时空开销,提高了挖掘效率.理论分析和实验表明,该算法的执行效率较其他同类算法有明显改进. An efficient algorithm for mining maximal frequent patterns （SFP-MFP）, based on sorted FP-TREE, is proposed. Maximal frequent patterns tree （MFP-TREE） is defined, A structure of SFP-TREE, which replaces MFP- TREE,was used to store all maximal frequent item sets, and some subset checking approaches were adopted to im- prove it. Therefore, the proposed algorithm greatly cut down the cost of space and memory, and improved the mining efficiency. Theoretical analysis and experimental results show that the executing performance of the new algorithm proposed is better than other similar algorithms.

作者段仰广韦玉科

机构地区广东工业大学计算机学院

出处《广东工业大学学报》 CAS 2009年第2期64-68,共5页 Journal of Guangdong University of Technology

基金国家科技支撑计划子课题(2006BAI08B01-03)

关键词最大频繁模式排序FP-TREE 关联分析数据挖掘 maximal frequent pattern sorted FP-TREE association rule data mining

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献9

1Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules [ C ]//VLDB' 94.1994. 487 -499.
2Han Jia-wei, PEI Ji-an, YIN Yi-wen. Mining frequent patterns without candidate genera-tion [ C ] ////Proceedings of ACM SIGMOD'00 International Conference on Management of Data. New York : ACM Press ,2000 : 1-12.
3Bayardor. Efficiently mining long patterns from databases [ C ]//HAAS LM, ed. Proceedings of the ACM SIGMOD International Conference on Management of Data. New York: ACM Press, 1998.85-93.
4Lindi, Kedem Z M. Pincer-Search:A new algorithm for discovering the maximum frequent set [ C ] //Schek H J, ed. Proceedings of the 6th European Conference on Extending Database Technology. Heidelberg: Springer-Verlag, 1998. 105-119.
5路松峰,卢正鼎.快速开采最大频繁项目集[J].软件学报,2001,12(2):293-297. 被引量：113
6宋余庆,朱玉全,孙志挥,陈耿.基于FP-Tree的最大频繁项目集挖掘及更新算法[J].软件学报,2003,14(9):1586-1592. 被引量：164
7冯志新,钟诚.基于FP-tree的最大频繁模式挖掘算法[J].计算机工程,2004,30(11):123-124. 被引量：18
8秦亮曦,李谦,史忠植.基于排序FP-树的频繁模式高效挖掘算法[J].计算机科学,2005,32(4):31-33. 被引量：13
9Jiawei Han,Micheline Kamber.数据挖掘:概念与技术[M].范明,孟小峰等译.北京:机械出版社,2008:157-159.

二级参考文献15

1[1]Han J, Kambe M. Data Mining: Concepts and Techniques. Morgaan Kaufmann Publishers, San Francisco, CA, 2001
2[2]Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules. Proc. 20th Int'l Conf. Very Large Databases, Santiago, Chile, 1994-09: 497-499
3[3]Han J, Pei J, Yin Y. Mining Frequent Patterns Without Candidate Generation. In Proc. 2000 ACM-SIGMOD Int. Conf. Management of Data (SIGMOD'00), Dalas, TX, 2000-05: 1-12
4[4]Pei J. Pattern Growth Methods for Frequent Pattern Mining [doctor thesis]. Simon Fraser University, 2002-06-13
5[5]Burdick D, Calimlim M, Gehrke J. MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases. In Int'l Conf. on Data Engineering, 2001-04
6Lin Dao I，Proc the 6th European Conference on Extending Database Technology，1998年，105页
7Agrawal R，Proc the 11th Inter Conference on Data Engineering，1995年，3页
8http://www. cs. helsinki. fi/u/goethals/
9http://www. ics. uci. edu/～mlearn/MLRepository. html
10Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large database. In: P Buneman, S Jajodia eds. Proc. of 1993 ACM SIGMOD Conf. on Management of Data. Washington DC: ACM Press, 1993. 207～216

共引文献244

1谢志强,朱孟杰,杨静.基于改进FP-树的最大项目集挖掘算法[J].计算机应用研究,2009,26(2):502-505. 被引量：1
2王华金,兰红.一种基于FP-tree挖掘最大频繁模式的改进算法[J].长春工程学院学报（自然科学版）,2007,8(1):59-62. 被引量：1
3姜晗,贾泂.基于标记域FP-Tree快速挖掘最大频繁项集[J].计算机研究与发展,2007,44(z2):334-349. 被引量：4
4杨种学.基于并行FP-growth算法挖掘网上关联交易规则[J].南京晓庄学院学报,2005,21(5):65-70.
5王盛,董黎刚,李群.一种基于逆序编码的关联规则挖掘研究[J].杭州电子科技大学学报（自然科学版）,2010,30(5):169-172. 被引量：1
6陈晴光,李际军.汽车ERP中关联规则挖掘与动态更新的实现策略[J].机械制造,2004,42(6):69-72. 被引量：2
7杨君锐.逆向启发式开采最大频繁项目集[J].计算机工程,2004,30(14):116-118. 被引量：1
8朱玉全,宋余庆,陈耿.约束最大频繁项目集的增量式更新算法[J].计算机工程,2004,30(18):31-32.
9杨君锐,赵群礼.一种不产生候选集的最大频繁集快速挖掘算法[J].微电子学与计算机,2004,21(11):125-128. 被引量：4
10张莹,韩芳溪,柴乔林.基于频繁模式树的AOI聚类算法[J].计算机工程与应用,2004,40(35):178-179.

同被引文献15

1宋明,刘宗田.基于数据交叠分区的并行DBSCAN算法[J].计算机应用研究,2004,21(7):17-20. 被引量：9
2何中胜,刘宗田,庄燕滨.基于数据分区的并行DBSCAN算法[J].小型微型计算机系统,2006,27(1):114-116. 被引量：16
3贺玲,吴玲达,蔡益朝.数据挖掘中的聚类算法综述[J].计算机应用研究,2007,24(1):10-13. 被引量：235
4郗洋.基于云计算的并行聚类算法研究[D].南京:南京邮电大学,2012.
5孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量：1109
6CHEN M, HAN J, YU P S. Data mining:an overview from a database perspective [ J ]. IEEE Transactions on Knowl- edge and Data Engineering, 1996,8 (6) :866-883.
7XU R, WUNSCH D. Survey of clustering algorithms [ J ]. IEEE Transaction on Neural Networks, 2005,16 ( 3 ) : 645-678.
8ESTER M, KRIEGEL H, JORG S. A densith-based algo- rithm for discovering clusters in large spatial databases with noise [ C ] // Proceedings of International Conference on Knowledge Discovery & Data Mining. Portland: AAAI, 1996 : 226 -231.
9BECKMANN N, KRIEGEL H, SCHNEIDER R, et al. The R* -tree : an efficient and robust access method for points and rectangles [ J ]. International Conference on Management of Data, 1990, 19(2):322-331.
10DEAN J, GHEMAWAT S. MapReduce: simplified data processing on large clusters [ J ]. Communications of the ACM ,2008,51 ( 1 ) : 107-113.

引证文献1

1蔡永强,陈平华,李惠.基于云计算平台的并行DBSCAN算法[J].广东工业大学学报,2016,33(1):51-56. 被引量：3

二级引证文献3

1曾瑛,李星南,刘新展.电力通信大数据并行化聚类算法研究[J].电子技术应用,2018,44(5):1-4. 被引量：13
2史爱武,尹杰,范平.Spark并行化改进的SDKB-DBSCAN聚类算法[J].现代计算机,2021,27(14):14-20.
3罗绍辉,罗奕俊.融合KMeans++与DBSCAN算法的工程车辆轨迹聚类研究[J].城市勘测,2023(2):27-30. 被引量：2

广东工业大学学报

2009年第2期

浏览历史

内容加载中请稍等...

一种基于排序FP-TREE挖掘最大频繁模式的高效算法被引量：1

参考文献9

二级参考文献15

共引文献244

同被引文献15

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

一种基于排序FP-TREE挖掘最大频繁模式的高效算法 被引量：1

参考文献9

二级参考文献15

共引文献244

同被引文献15

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

一种基于排序FP-TREE挖掘最大频繁模式的高效算法被引量：1