摘要
时序性和波动性直接与话题的热度有关,短时间内某话题出现的相关报道越多,则其热度越高;话题的波动幅度越大,则其热度越高.依据积分理论给出了基于时序性的相关报道密度计算和基于波动性的峰值计算,并采用线性调和的方法将二者融合,给出话题热度计算方法.实验采用TDT4语料作为测试集合,验证了该方法的有效性与合理性.
Temporal and volatility are directly related to the hot degree of topic,namely,the more related stories the topic has in a certain time distance,and the larger fluctuation the topic is,the higher hot degree of this topic.Applying integration theory,methods of computing the density of related stories and the peak value are put forward based on temporal and volatility respectively.Finally,linear meditated method is used to merge them to compute the hot degree of a topic.Experiments are carried out on TDT4 corpus to testify the validity and rationality of our new method.
作者
李汉才
徐建民
吴树芳
LI Hancai;XU Jianmin;WU Shufang(Personnel Department,Hebei University,Baoding 071002,China;College of Computer Science and Technology,Hebei University,Baoding 071002,China;College of Management and Economics,Tianjin University,Tianjin 300000,China;College of Management,Hebei University,Baoding 071002,China)
出处
《河北大学学报(自然科学版)》
CAS
北大核心
2018年第4期416-422,共7页
Journal of Hebei University(Natural Science Edition)
基金
国家社科基金资助项目(17BTQ068)
河北省教育厅青年基金资助项目(QN2015099)
河北大学中西部提升综合实力专项资金项目
河北省自然科学基金资助项目(F2015201142)
中国博士后基金资助项目(2017M621078)