期刊文献+

基于FIFA的主题相似性计算模型 被引量:3

FIFA-based Topic Similarity Computation Modal
在线阅读 下载PDF
导出
摘要 针对主题检测和追踪的第五个技术任务连接分析,提出了一种事件主题相似性分析技术·通过引入领域知识库,将基于词汇的分析技术提升到领域知识计算层面·当输入不同两个文档时,采用该分析技术进行识别文档内容所涉及到的事件主题是否一致·首先采用FIFA模型进行内容主题识别,然后采用LDM模型进行事件主题相似性计算分析·实验结果显示主题相似性计算正确率为64%,召回率为69%· An event topic similarity computation technique for the fifth TDT techniquelinking detection was put forward. Domain knowledge base is used to upgrade topic similarity computation technique from lexical analysis to domain knowledge analysis. The linking detection technique is used to decide whether two documents (or stories) are on the same event topic. There are two steps in the analysis procedure which includes using FIFA modal to identification content topic, and using LDM modal to compute event topic similarity. Experiment results show that precision and recall percents of the model are 64% and 69%.
出处 《东北大学学报(自然科学版)》 EI CAS CSCD 北大核心 2003年第11期1041-1044,共4页 Journal of Northeastern University(Natural Science)
基金 国家自然科学基金 微软亚洲研究院资助项目(60203019).
关键词 内容主题识别 事件主题分析 主题检测和追踪 领域知识 连接分析 content topic identification event topic analysis topic detection and tracking domain knowledge linking parsing
  • 相关文献

参考文献9

  • 1朱靖波,姚天顺.文本内容主题的识别方法[J].东北大学学报(自然科学版),2002,23(5):425-427. 被引量:8
  • 2Boykin S, Merlino A. Machine learning of event segmentation for news on demand[J ]. Communications of the ACM, 2000,43(2) :35 - 41.
  • 3Luhn H P. A statistical approach to mechanized encoding and searching of literary information[J ]. IBM Journal, 1957, l0(1):309-317.
  • 4Edmundson H. New methods in automatic extracting [J]. Journal of the ACM, 1969,16 (2) : 264 - 285.
  • 5Salton G, James A, Buckley C. Automatic analysis, theme generation, and summarization of nmchine-readable texts[J]. Science, 1994,264 (3) : 1421 - 1426.
  • 6Lehnert W, Loiselle C. An introduction to plot unit[A]. Semantic Structures-advances in Natural Language Processing [C]. Hillsdale: Lawrence Erlbaum Associates, 1989.88 - 111.
  • 7Hearst A. Context and structure in autmnated full-text information accesx [ D]. Berkeley: University of California,1994.
  • 8Peter W F. Latent semantic analysis for text-based research, behavior research methods[ J ]. lnstruments and Computers,1996,28(2) : 197 - 202.
  • 9Chin Y L. Robust autmnated topic identification [D]. California: University of Southern California, 1997.

二级参考文献2

  • 1朱靖波.面向英汉机器翻译的统计消岐技术研究[M].沈阳:东北大学,1999..
  • 2林鸿飞,高天,姚天顺.中文文本的可视化表示[J].东北大学学报(自然科学版),2000,21(5):501-504. 被引量:7

共引文献7

同被引文献46

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部