P2P网络中最大频繁项集挖掘算法研究被引量：1

Research on maximal frequent itemset mining algorithm over P2P network

下载PDF

导出

摘要为解决P2P网络频繁项集挖掘中存在的全体频繁项集数量过多和网络通信开销较大这两个问题,提出了一种在P2P网络中挖掘最大频繁项集的算法P2PMaxSet。首先,该算法只挖掘最大频繁项集,减少了结果的数量;其次,每个节点只需与邻居节点进行结果交互,节省了大量的通信开销;最后,讨论了网络动态变化时算法的调整策略。实验结果表明,算法P2PMaxSet具有较高的准确率和较少的通信开销。 The obstacles mainly lie in numerous frequent itemsets and huge communication cost. To solve the two problems, this paper proposed a maximal itemset mining algorithm P2PMaxSet. Firstly,only considered maximal itemset,which reduced the number of itemsets greatly. Secondly,only interchanged mining results between neighbor nodes,which saved communication cost. Finally,discussed adjust strategies for dynamic environment. Experimental results show P2PMaxSet is not only accurate but also with lower communication cost.

作者邓忠军宋威郑雪峰王少杰

机构地区北京科技大学信息工程学院北方工业大学信息工程学院国家信息技术安全研究中心

出处《计算机应用研究》 CSCD 北大核心 2010年第9期3490-3492,共3页 Application Research of Computers

基金国家“863”计划资助项目(2007AA012474) 北京市优秀人才培养资助项目(2009D005002000009)

关键词数据挖掘 P2P网络最大频繁项集关联规则 data mining P2P network maximal frequent itemset association rule

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献8

1程舒通,徐从富.关联规则挖掘技术研究进展[J].计算机应用研究,2009,26(9):3210-3213. 被引量：14
2TAYLOR I J.From P2P to Web services and grids[M].London:Springer-Verlag,2005.
3KANTERE V,TSOUMAKOS D,SELLIS T K,et al.GrouPeer:dynamic clustering of P2P databases[J].Information Systems,2009,34(1):62-86.
4DATTA S,GIANNELLA C R,KARGUPT A,et al.Approximate distributed K-means clustering over a peer-to-peer network[J].IEEE Trans on Knowledge and Data Engineering,2009,21(10):1372-1388.
5WOLFF R,SCHUSTER A.Association rule mining in peer-to-peer systems[J].IEEE Trans on Systems,Man,and Cybernetics,Part B,2004,34(6):2426-2438.
6BOUTSINAS B,SIOTOS C,GEROLIMATOS A.distributed mining of association rules based on reducing the support threshold[J].International Journal on Artificial Intelligence Tools,2008,17(6):1109-1129.
7YI Xun,ZHANG Yan-chun.Privacy-preserving distributed association rule mining via semi-trusted mixer[J].Data & Knowledge Engineering,2007,63(2):550-567.
8SONG Wei,YANG Bing-ru,XU Zhang-yan.Index-MaxMiner:a new maximal frequent itemset mining algorithm[J].International Journal on Artificial Intelligence Tools,2008,17(2):303-320.

二级参考文献24

1王俊峰,杨建华,周虹霞,谢高岗,周明天.网络测量中自适应数据采集方法(英文)[J].软件学报,2004,15(8):1227-1236. 被引量：11
2AGRAWAL R, IMIELINSKI T, SWAMI A. Mining association rules between sets of items in large databases[ C]//Proc of ACM SIGMOD International Conference on Management of Data. New York: ACM Press, 1993 : 207-216.
3AGRAWAL R, SRIKANT R. Fast algorithms for mining association rules[C]//Proc of the 20th International Conference on Very Large Data Bases. San Francisco: Morgan Kaufmann Publishers, 1994: 478-499.
4PARK J S, CHEN M S, YU P S. Using a hash based method with transaction trimming for mining association rules [ J ]. IEEE Trans on Knowledge and Data Engineering, 1997, 9(5) :813-825.
5BRIN S, MOTWANI R, ULLMAN J D, et al. Dynamic itemset counting and implication rules for market basket data [ C ]//Proc of ACM SIGMOD International Conference on Management of Data. New York: ACM Press, 1997: 255-264.
6MANNILA H, TOIVONEN H, VERKAMO A I. Efficient algorithms for discovering association rules[ C ]//Proc of the AAAI Workshop on Knowledge Discovery in Databases. Washington: AAAI Press, 1994: 181-192.
7TOIVONEN H. Sampling large databases for association rules [ C ]// Proc of the 22nd International Conference on Very Large Data Bases. Sam Francisco: Morgan Kaufmann Publishers, 1996: 134-145.
8HAN Jia-wei, PEI Jian, YIN Yi-wen. Mining frequent patterns without candidate generation [ C ]//Proc of ACM SIGMOD International Conference on Management of Data. New York : ACM Press, 2000 : 1-12.
9GRAHNE G, ZHU Jian-fei. Efficiently using prefix-trees in mining frequent itemsets [ C ]//Proc of IEEE ICDM Workshop on Frequent Itemset Mining Implementations. 2003.
10PIEPRZYK J, MORZY M. Mining generalized association roles using prutax and hierarchical bitmap index [ EB/OL ]. [ 2009-02-19]. http://www, cs. put. poznan, pl/mmorzy/papers/admkd07, pdf.

共引文献13

1陶小红.Web数据挖掘在智能选课系统中的应用研究[J].办公自动化（综合月刊）,2010(1):27-29. 被引量：2
2王卫东,屈洋.数据挖掘理念在医院病历随访系统中的应用[J].计算机技术与发展,2010,20(7):199-202. 被引量：7
3陶小红.Web数据挖掘在智能选课系统中的应用研究[J].科协论坛（下半月）,2010(2):68-70. 被引量：1
4李爱凤.基于数据挖掘技术的购物篮模式研究[J].计算机应用与软件,2011,28(12):156-158. 被引量：9
5邓广彪,蒙祖强.一种快速获取候选3项集的Apriori改进算法[J].电脑与信息技术,2012,20(1):22-25. 被引量：1
6朱晓峰,李玲娟,徐小龙,陈建新.基于MapReduce的关联规则增量更新算法[J].计算机技术与发展,2012,22(4):115-118. 被引量：15
7宋钰,何小利,张刚园.关联规则在医药云数据定向中的应用与仿真[J].计算机仿真,2013,30(2):239-242. 被引量：3
8王飞,缑锦.基于多变异粒子群优化算法的模糊关联规则挖掘[J].计算机科学,2013,40(5):217-223. 被引量：12
9李为.基于数据挖掘技术的网络违法案件分析研究[J].现代计算机（中旬刊）,2013(12):10-13.
10颜宏文,谢启龙.基于多维关联规则的电网脆弱性识别研究[J].计算机应用与软件,2015,32(11):36-40. 被引量：4

同被引文献15

1李振宇,谢高岗.基于DHT的P2P系统的负载均衡算法[J].计算机研究与发展,2006,43(9):1579-1585. 被引量：26
2蒋君,邓倩妮.eMule系统中的非均匀性分布[J].微电子学与计算机,2007,24(10):153-156. 被引量：3
3边肇祺,张学工.模式识别[M].北京:清华大学出版社,2006.
4王建.基于KAD网络监督的关键技术研究与实现[D].成都:四川大学,2012.
5Maymounkov P, Mazieres D.Kademlia: a peer-to-peer infor- matics system based on the XOR metric[C]//Proceedings of the lth International Workshop on P2P Systems, 2002: 53-65.
6Cai Hua, Zhou Chunguang, Wang Zhe, et al.Algorithm research on community mining from dynamic social network[J].Jour- hal of Jinlin University,2008,26(4) : 380-382.
7Berger-Wolf T Y, Saia J.A framework for analysis of dynamic social networks[C]//Proceeding of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006,12 : 523-528.
8Sarkar EDynamic social network analysis using latent space models[C]//Proceedings of the ACM SIGKDD Explora- tions Newsletter, 2005 : 31-35.
9飞思科技产品研发中心神经网络理论与MATLAB7实现[M]//MATLAB应用技术.北京:电子工业出版社,2005:4-90.
10赫南,李德毅,淦文燕,朱熙.复杂网络中重要性节点发掘综述[J].计算机科学,2007,34(12):1-5. 被引量：138

引证文献1

1王建,冯伟森,邱兴超,刘继,卢林.基于BP模型的KAD网络核心节点识别算法研究[J].计算机工程与应用,2013,49(7):72-75.

1陈晨.最大频繁项集挖掘算法综述[J].电脑知识与技术,2008,0(11Z):1030-1031.
2黄松英.基于最大频繁项集挖掘的入侵检测研究[J].绍兴文理学院学报,2007,27(10):32-36. 被引量：1
3彭慧伶,舒云星,武新.基于FP-tree的最大频繁项集挖掘新算法[J].计算技术与自动化,2009,28(2):62-65.
4陈凤娟.基于FP树的最大频繁项集挖掘[J].电子世界,2014(17):119-119.
5陈慧萍,王建东,王煜.频繁项集挖掘的研究与进展[J].计算机仿真,2006,23(4):68-73. 被引量：10
6马志新,陈晓云,王雪,李龙杰.最大频繁项集挖掘中搜索空间的剪枝策略[J].清华大学学报（自然科学版）,2005,45(S1):1748-1752. 被引量：5
7张志刚,黄刘生,金宗安,项莉萍.基于父子等价剪枝策略的最大频繁项集挖掘[J].计算机工程,2013,39(4):219-221. 被引量：4
8张世玲,李艳,王熙腾.一种基于布尔矩阵的最大频繁项集挖掘算法[J].计算机光盘软件与应用,2013,16(1):192-193. 被引量：1
9刘琰,张进,陈静,尹美娟,张伟丽.基于最大频繁项集挖掘的微博炒作群体发现方法[J].计算机工程与应用,2017,53(4):90-97. 被引量：1
10刘慧婷,候明利,赵鹏,姚晟.不确定数据流最大频繁项集挖掘算法研究[J].计算机工程与应用,2016,52(19):72-77. 被引量：9

计算机应用研究

2010年第9期

浏览历史

内容加载中请稍等...

P2P网络中最大频繁项集挖掘算法研究被引量：1

参考文献8

二级参考文献24

共引文献13

同被引文献15

引证文献1

相关作者

相关机构

相关主题

浏览历史

P2P网络中最大频繁项集挖掘算法研究 被引量：1

参考文献8

二级参考文献24

共引文献13

同被引文献15

引证文献1

相关作者

相关机构

相关主题

浏览历史

P2P网络中最大频繁项集挖掘算法研究被引量：1