期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
New event detection based on sorted subtopic matching algorithm
1
作者 翟东海 CUI Jing-jing +1 位作者 NIE Hong-yu DU Jia 《Journal of Chongqing University》 CAS 2013年第4期179-186,共8页
How to quickly and accurately detect new topics from massive data online becomes a main problem of public opinion monitoring in cyberspace. This paperpresents a new event detection method for the current new event det... How to quickly and accurately detect new topics from massive data online becomes a main problem of public opinion monitoring in cyberspace. This paperpresents a new event detection method for the current new event detection system, based on sorted subtopic matching algorithm and constructs the entire design framework. In this p^per, the subtopics contained in old topics (or news stories) are sorted in descending order according to their importance to the topic(or news stories), and form a sorted subtopic sequence. In the process of subtopic matching, subtopic scoring matrix is used to determine whether a new story is reporting a new event. Experimental results show that the sorted subtopic matching model improved the accuracy and effectiveness ofthenew event detection system in cyberspace. 展开更多
关键词 new event detection topic detection scoring matrix sorted subtopic matching model subtopic sequence
在线阅读 下载PDF
Hierarchical Subtopic Segmentation of Web Document
2
作者 ZHANG Yun-tao GONG Ling WANG Yong-cheng 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期47-50,共4页
The paper proposes a novel method for subtopics segmentation of Web document. An effective retrieval results may be obtained by using subtopics segmentation. The proposed method can segment hierarchically subtopics an... The paper proposes a novel method for subtopics segmentation of Web document. An effective retrieval results may be obtained by using subtopics segmentation. The proposed method can segment hierarchically subtopics and identify the boundary of each subtopic. Based on the term frequency matrix, the method measures the similarity between adjacent blocks, such as paragraphs, passages. In the real-world sample experiment, the macro-averaged precision and recall reach 73.4 % and 82.5 %, and the micro-averaged precision and recall reach 72.9% and 83. 1%. Moreover, this method is equally efficient to other Asian languages such as Japanese and Korean, as well as other western languages. 展开更多
关键词 subtopic segmentation Web document passage retrieval DISCOURSE
在线阅读 下载PDF
Semantic composition of distributed representations for query subtopic mining
3
作者 Wei SONG Ying LIU +1 位作者 Li-zhen LIU Han-shi WANG 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2018年第11期1409-1419,共11页
Inferring query intent is significant in information retrieval tasks. Query subtopic mining aims to find possible subtopics for a given query to represent potential intents. Subtopic mining is challenging due to the n... Inferring query intent is significant in information retrieval tasks. Query subtopic mining aims to find possible subtopics for a given query to represent potential intents. Subtopic mining is challenging due to the nature of short queries. Learning distributed representations or sequences of words has been developed recently and quickly, making great impacts on many fields. It is still not clear whether distributed representations are effective in alleviating the challenges of query subtopic mining. In this paper, we exploit and compare the main semantic composition of distributed representations for query subtopic mining. Specifically, we focus on two types of distributed representations: paragraph vector which represents word sequences with an arbitrary length directly, and word vector composition. We thoroughly investigate the impacts of semantic composition strategies and the types of data for learning distributed representations. Experiments were conducted on a public dataset offered by the National Institute of Informatics Testbeds and Community for Information Access Research. The empirical results show that distributed semantic representations can achieve outstanding performance for query subtopic mining, compared with traditional semantic representations. More insights are reported as well. 展开更多
关键词 subtopic mining QUERY INTENT DISTRIBUTED representation SEMANTIC COMPOSITION
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部