期刊文献+

云环境下基于二进制编码聚类的并行频繁项集挖掘算法 被引量:1

A Parallel Frequent Itemsets Mining Algorithm Based on Binary Coding and Clustering under Cloud Environment
在线阅读 下载PDF
导出
摘要 本文提出了一种云环境下基于二进制编码的并行频繁项集挖掘算法,利用一种特殊的二进制编码的依赖度计量方法对原始数据集合进行编码转换及依赖度聚类,然后将数据集分布部署在云环境中,并采用共享多头表的FP-Growth并行改进算法挖掘频繁项集.实验表明,对于大规模数据集来说,本文算法可以取得良好的性能. This paper proposes a parallel frequent itemsets mining algorithm based on binary coding under cloud environment.A special binary coding dependency calculating method is adopted to transfer the raw data and cluster based on dependency,then the data is distributed deployed in cloud environment and the parallel improved algorithm of FP-Growth based on shared multi-head table is used to mine frequent item sets.Experiments show that the algorithm performed nicely with large scale of data sets.
出处 《微电子学与计算机》 CSCD 北大核心 2012年第11期62-65,共4页 Microelectronics & Computer
基金 国家自然科学基金(61070047 61070133) 江苏省自然科学基金(BK2010134) 江苏省教育厅自然科学基金(11KJD520011)
关键词 云计算 二进制编码 聚类 并行 频繁项集 cloud computing binary coding clustering parallel frequent itemsets
  • 相关文献

参考文献6

  • 1Agrawal Rakesh, Ramakrishnan Srikant. Fast algo-rithms for mining association rules in large databases[C] // Proceedings of 20th International Conference onVery Large Data Bases, 1994 ; 487-499.
  • 2Jiawei Han, Jian Pei,Yiwen Yin. Mining frequentpatterns without candidate generation[[C] // Proceed-ings of the ACM SIGMOD International Conference onManagement of Data, 2000,29(2): 1-12.
  • 3Zaiane O R,El-Hajj M,Lu P. Fast parallel associa-tion rule mining without candidacy generation [C] //Proceedings of IEEE International Conference on DataMining, 2001 : 665-668.
  • 4Javed A, Khokhar A. Frequent Pattern Mining onMessage Passing Muhiprocessor Systems[J]. Distrib-uted and Parallel Databases, 2004 ,16(3) : 321-334.
  • 5Haoyuan Li, Yi Wang, Dong Zhang,Ming Zhang,and Edward Y. Chang. PFP : Parallel FIMirowth forQuery Recommendation[C]// Proceedings of the 2008ACM Conference on Recommender Systems, 2008 :107-114.
  • 6McCormick W T,Sehweitzer P J. Problem decomposi-tion and data reorganization by a clustering technique[J]. Operations Research, 1972?20(5): 993-1009.

同被引文献9

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部