期刊文献+

文本分割综述 被引量:5

Overview of Text Segmentation
在线阅读 下载PDF
导出
摘要 文本分割在信息提取、文摘生成、语篇解析及其他多个领域有着极为重要的应用。文本分割的对象包括静态书面文本、语音文本以及动态文本等;分割的粒度因分割的目的不同而有所区别;分割的准确性不仅需要直接评测,更需要间接评测。在大量文献的基础上,对目前常用的分割方法及评测手段进行了全面的归纳和总结,分析了文本分割技术的研究现状,指出尚存在的问题并展望研究前景。 Text segmentation is very important in information retrieval,automatic summarization,discourse analysis,and many other fields.Static written text,speech text and dynamic text can be segmented.The granularity of segmentation is varied for different purpose.Direct and indirect evaluations are applied to assess algorithms.The current work on segmentation approaches and direct evaluation methods are generalized on the basis of lots of literatures.The paper presents the status of text segmentation,points out the problems and future research.
作者 石晶
出处 《计算机工程与应用》 CSCD 北大核心 2006年第35期155-159,171,共6页 Computer Engineering and Applications
基金 国家973重点基础研究发展规划资助项目(2002CB312103) 国家自然科学基金资助项目(60503054) 中国科学院软件所创新工程重大项目资助。
关键词 文本分割 主题分割 粗分割 细分割 text segmentation topic segmentation coarse-grained segmentation fine grained segmentation
  • 相关文献

参考文献37

  • 1GROSZ B J,SIDNER C L.Attention,intentions,and the structure of discourse[J].Computational Linguistics,1986,12(3):175-204.
  • 2MANN W C,THOMPSON S A.Rhetorical structure theory:toward a functional theory of text organization[J].Text,1988,8(3):243-281.
  • 3HOBBS J R.Coherence and coreference[J].Cognitive Science,1979,3 (1):67-90.
  • 4REYNAR J.Topic segmentation:algorithms and applications[D].Computer and Information Science,University of Pennsylvania,1998.
  • 5PASSONEAU R,LITMAN D.Intention based segmentation:human reliability and correlation with linguistic cues[C]//proceedings of Association of Computational Linguistics,(ACL-93),1993:148-155.
  • 6MARCU D.The rhetorical parsing of unrestricted texts:a surfacebased approach[J].Computational Linguistics,2000,26(3):395-448.
  • 7MAARCU D.The theory and practice of discourse parsing and summarization[M].Cambridge,Massachusetts,London,England:MIT Press,2000.
  • 8MARCU D.The rhetorical parsing,summarization,and generation of natural language texts[D].Department of Computer Science,University of Toronto,1997.
  • 9MORRIS J,HIRST G.Lexical cohesion computed by thesaural relations as an indicator of the structure of text[J].Computational Linguistics,1991,17(1):21-48.
  • 10MOCHIZUKI H,HONDA T,OKUMURA M.Text segmentation with multiple surface linguistic cues[C]//COLING-ACL'98,1998:881-885

同被引文献39

引证文献5

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部