摘要
为解决P2P网络频繁项集挖掘中存在的全体频繁项集数量过多和网络通信开销较大这两个问题,提出了一种在P2P网络中挖掘最大频繁项集的算法P2PMaxSet。首先,该算法只挖掘最大频繁项集,减少了结果的数量;其次,每个节点只需与邻居节点进行结果交互,节省了大量的通信开销;最后,讨论了网络动态变化时算法的调整策略。实验结果表明,算法P2PMaxSet具有较高的准确率和较少的通信开销。
The obstacles mainly lie in numerous frequent itemsets and huge communication cost. To solve the two problems, this paper proposed a maximal itemset mining algorithm P2PMaxSet. Firstly,only considered maximal itemset,which reduced the number of itemsets greatly. Secondly,only interchanged mining results between neighbor nodes,which saved communication cost. Finally,discussed adjust strategies for dynamic environment. Experimental results show P2PMaxSet is not only accurate but also with lower communication cost.
出处
《计算机应用研究》
CSCD
北大核心
2010年第9期3490-3492,共3页
Application Research of Computers
基金
国家“863”计划资助项目(2007AA012474)
北京市优秀人才培养资助项目(2009D005002000009)