期刊文献+

基于机器学习的网络媒体热点话题预测方法研究与实现 被引量:3

Research and implementation of a forecasting method of hot topics in authoritative network media based on machine learning
在线阅读 下载PDF
导出
摘要 针对目前互联网"富信息化"现象,提出了基于机器学习的网络热点话题预测的思想。该思想通过总结能尽量准确描述热点话题的一组特征,得到每篇新闻各自的特征向量,并针对大量近期已知是否热门的随机新闻样本内容进行聚类处理。基于健壮精准的分类算法,利用支持向量机将向量映射到高维空间达到分类目的。在机器学习过程中,采用大量试验的方法修改并完善特征向量的组成、度量及权重,最终达到准确作出热点话题预测的目的。 Specific to the phenomenon of ″rich informationization″,an idea of Internet hot topic forecasting is proposed in this paper. The core of this idea is to summarize a set of relevant features of the hot topics in order to obtain the feature vectors of the sample news. Based on these features, therandom sample contents of a great deal of latest news are clustered, which means whether the news is a hot topic or not had been known to all. On the basis of theselected robust and accurate classification algorithm , the support vector machine is used to map the vectors into a higher dimensional space for the purpose of data classification. In the process of machine learning, the composition, the measurement and the weight of the feature vectors are modified and improved through trials and errors, thus to realize the accurate forecasting of hot topics.
出处 《微型机与应用》 2014年第15期62-64,共3页 Microcomputer & Its Applications
基金 北京对外文化交流与世界文化研究基地项目(BWSK201303) 北京外国语大学公共外交研究中心 北京市社科联青年社科人才资助项目(2013SKL030) 北京高等学校青年英才计划项目(YETP0847)
关键词 机器学习 网络媒体 热点话题 特征向量 分词 预测 machine learning network media hot topic feature vector classification forecasting
  • 相关文献

参考文献7

  • 1彭菲菲.网络热点话题发现的关键技术研究[D].北京:中国矿业大学(北京),2012.
  • 2王巍,杨武,齐海凤.基于多中心模型的网络热点话题发现算法[J].南京理工大学学报,2009,33(4):422-426. 被引量:29
  • 3赖锦辉,梁松.一种消除孤立点的微博热点话题发现方法[J].计算机应用与软件,2014,31(1):105-107. 被引量:8
  • 4RAHM E, BERNSTEIN P A. A survey of approaches to automatic schema matching[J]. The VLDB Journal, 2001, 10(4) : 334-350.
  • 5马子恩.热点事件新闻语料库的研制及词汇研究[D].南京:南京师范大学.2012.
  • 6LI S, ZHAO J, SONG Z, et al. Study on topic tracking system based on SVM[C]. 2011 Fourth International Sym- posium on Knowledge Acquisition and Modeling (KAM), IEEE, 2011: 83-87.
  • 7ZHENG Y, LU R. An adaptive topic tracking method based on feedback stories[C]. International Symposium on Information Technology in Medicine and Education, 2012 (2) : 1021-1025.

二级参考文献18

共引文献35

同被引文献20

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部