期刊文献+

基于数据挖掘的网络链接预测研究 被引量:5

Research on Network Link Prediction Based on Data Mining
在线阅读 下载PDF
导出
摘要 近年来社交网络日益火热,基于社交网络的数据挖掘也随之兴起。链接预测作为网络数据挖掘的重要课题,其借助已知的网络结构等信息来预测和估计尚未链接的两个节点间存在链接的可能性。社交网络的链接预测可以用于好友推荐,过滤冗余信息,提高用户的满意度、忠诚度,建立一个健康的社交网络环境。已有的链接预测算法集中研究网络结构信息或网络节点属性,以分析网络全局或局部特性。文章考虑到微博社交网络的本质,提出了融合多特征的链接预测方法,其中包括节点特征、拓扑特征、社交特征以及投票特征。基于这些特征,在微博社交网络数据上应用SVM、朴素贝叶斯、随机森林和逻辑回归4种机器学习算法训练预测模型,预测潜在的社交链接。结果表明,文章提出的组合特征相对于传统特征表现更好,且融合多种特征能够提高最终的预测精度。 In recent years,social networks have become increasingly hot,and data mining basedon social networks has also arisen.Link prediction(LP)is an important topic of network data mining,which uses the known network structure and other information to predict and estimate the possibilityof linking between two nodes that are not yet linked.Link prediction in social network can be usedto recommend friends,fi lter redundant information,improve user’s satisfaction and loyalty,and builda healthy social networking environment.In previous researches,attentions are focused on structureinformation or node attributes,in order to analyze the global or local properties.Considering the naturesof microblog social network,this paper proposes a link prediction method combining multiple featureswhich includes node features,topological features,social features and voting features.Based on thesefeatures,4machine learning algorithms,SVM,naive Bayes,random forest and logical regression,areapplied on microblog social network data to train predictive models to predict potential social links.The results show that combining multiple features performs better than the traditional features,and thecombination of multiple features can achieve highest accuracy.
作者 徐燕 XU Yan(the Information Science Department, Beijing Language and Culture University, Beijing 100083, China;Institute of Computing Technology, Chinese Academy of Science, Beijing 100190, China)
出处 《信息网络安全》 CSCD 2017年第6期30-34,共5页 Netinfo Security
基金 国家自然科学基金[60873166] 北京市自然科学基金[4122076]
关键词 链接预测 数据挖掘 特征提取 好友推荐 机器学习算法 link prediction data mining features extraction friends recommendation machine learning algorithm
  • 相关文献

参考文献4

二级参考文献49

  • 1李伟,苏璞睿.基于内核驱动的恶意代码动态检测技术[J].中国科学院研究生院学报,2010,27(5):695-703. 被引量:9
  • 2张新宇,卿斯汉,马恒太,张楠,孙淑华,蒋建春.特洛伊木马隐藏技术研究[J].通信学报,2004,25(7):153-159. 被引量:43
  • 3康治平,向宏,傅鹂.基于API HOOK技术的特洛伊木马攻防研究[J].信息安全与通信保密,2007,29(2):145-148. 被引量:4
  • 4胡燕京,张冰,王海义,张丽琼.主流木马技术分析及攻防研究[J].现代电子技术,2007,30(13):96-100. 被引量:2
  • 5PELEG D,MOOtkE A.X-means: Extending K-means with Efficient Estimation of the Number of Clusters [C]//Proceedings of the Seventeenth International Conference on Machine Learning. San Francisco:Morgan Kaufmann Publishers,2000:89-97.
  • 6新浪.新浪微博发布2015第三季度财报[EB/OL].http://tech.sina.com.cn/i/2015- 11 - 19/doc-ifxkwuwxOl83629.shtml, 2015-11-19.
  • 7新浪.新浪微博开放API[EB/OL]. htlp://opeawdbo.com/\viki/%E5%BE%AE)E5%8D%9AAPI, 2015-11-19.
  • 8Binux. pyspider 爬虫教程:AJAX 和 HTTP [EB/OL]. http://blog,binux.me/2015/01/ pyspider-tutorial-level-2-ajax-and-more-http/,2015-11-19.
  • 9CSDN.利用python 实现新浪微博爬虫[EB/OL]. http://blog.csdn.net/monsion/ article/details/7981366, 2015-11 - 19.
  • 10CSDN.PageRank 算法[EB/OL]. http://blog.csdn.net/hguisu/article/details/7996185, 2015-11-19.

共引文献36

同被引文献32

引证文献5

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部