期刊文献+

PFPonCanTree:一种基于MapReduce的并行频繁模式增量挖掘算法 被引量:9

PFPonCanTree:A parallel frequent patterns incremental mining algorithm based on MapReduce
在线阅读 下载PDF
导出
摘要 频繁模式挖掘是最重要的数据挖掘任务之一,传统的频繁模式挖掘算法是以"批处理"方式执行的,即一次性对所有数据进行挖掘,无法满足不断增长的大数据挖掘的需要。MapReduce是一种流行的并行计算模式,在并行数据挖掘领域已得到了广泛的应用。将传统频繁模式增量挖掘算法CanTree向MapReduce计算模型进行了迁移,实现了并行的频繁模式增量挖掘。实验结果表明,提出的算法实现了较好的负载均衡,执行效率有明显提升。 Frequent pattern mining is one of the most important data mining tasks.Traditional frequent pattern mining algorithmsare executed in a " batch" mode,that is,all the data are mined in one time,so they cannotmeet the needs of the ever-growing bigdata mining.MapReduce is a popular parallel computing modeland has been widely used in the field of parallel data mining.In this paper,we migrate the traditional frequent pattern incremental mining algorithm CanTree to the MapReduce computing model,achieving aparallel frequent pattern incremental miningalgorithm.The experimental results show that the proposed algorithm achievesbetterload balancing and improvesthe execution efficiency significantly.
出处 《计算机工程与科学》 CSCD 北大核心 2018年第1期15-23,共9页 Computer Engineering & Science
基金 安徽省高校自然科学研究项目(KJ2016A623)
关键词 数据挖掘 频繁模式挖掘 增量挖掘 MAPREDUCE HADOOP PFP data mining frequent pattern mining incremental mining MapReduce Hadoop PFP
  • 相关文献

参考文献5

二级参考文献58

  • 1施亮,钱雪忠.基于Hadoop的并行FP-Growth算法的研究与实现[J].微电子学与计算机,2015,32(4):150-154. 被引量:15
  • 2邹翔,张巍,刘洋,蔡庆生.分布式序列模式发现算法的研究[J].软件学报,2005,16(7):1262-1269. 被引量:19
  • 3刘德喜,何炎祥,邢显黎.一种新的频繁项集挖掘算法[J].计算机应用研究,2007,24(2):17-19. 被引量:8
  • 4Dean J, Ghemmawat S. MapReduce: simplied data processing on large clusters [ C ]//Proceedings of the 6th Sympesium on Operating System Design and Implementation. New York: ACM Press, 2004:137 -150.
  • 5Ranger C, Raghuraman R, Penmetsa A. Evaluating MapReduce for multicore and mutiprocessor systems [ C ] //Proceedings of the 2007 IEEE 13th International Symposium on High Performance Computer Architecture. Washington: IEEE Computer Society, 2007 : 13 -24.
  • 6Kruuf M D, Sankaralinggam K. MapReduce for the cell B.E. architecture [ R ]. Madison: University of Wisconsin - Madison, 2007.
  • 7He Bing - sheng, Fang Wen - bin, Naga K Govindaraju, et al. Mars : a MapReduce framework on graphics processors [ C ] // Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques. New York: ACM Press, 2008 : 260 "269.
  • 8Zaharia M, Konwinski A, Joseph A D. Improving MapReduce performance in heterogeneous environments [ C ] //Proceedings of the 8th USENIX Symposium on Operating Systems Design and Implementation. New York: ACM Press, 2008:29 -42.
  • 9Tomwhite.Hadoop权威指南:中文版[M].曾大聃,周傲英,译.北京:清华大学出版社,2010.
  • 10Chu Chen -tao, Kim S K, Lin Yian, et al. Map -Reduce for machine learning on muhicore [ C]//Twentieth Annual Conference on Neural Information Processing Systems, Vancouver: [ s. n. ], 2006 : 281 - 288.

共引文献69

同被引文献64

引证文献9

二级引证文献55

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部