期刊文献+

用于多文档文摘句排序的改进MO算法 被引量:2

Improved Majority Ordering Algorithm of Multi-Document Summarization Sentence
在线阅读 下载PDF
导出
摘要 针对CO和MO文摘句排序算法的缺陷,提出了一种将局部主题间的内聚度与MO算法相结合进行文摘句排序的新方法.在统计局部主题间相对位置的基础上,建立它们之间的关系有向图并计算其内聚度;排序过程中每从有向图中输出一个顶点,便从剩余顶点中查找与其具有最大内聚度的顶点,若该内聚度大于阈值,则将这两个顶点所代表的局部主题文摘句置于摘要中相邻的位置.实验结果表明,该算法排序生成的文摘更具连贯性和可读性. In order to overcome the shortcomings of the Chronological Ordering and the Majority Ordering methods for summarization sentences, a new ordering algorithm that combines the mutual cohesion among themes and the Majority Ordering method is proposed. Based on the statistical data about the relative position in each pair of themes, a directed graph of the themes is built and the corresponding mutual cohesion is computed. In the ordering process, when a vertex is output from the directed graph, the vertex possessing the greatest cohesion with the vertex is searched from the remaining vertexes. If the cohesion is bigger than the threshold value, the sentences from the two themes corresponding to the two above-mentioned vertexes are placed on adjacent locations in the summarization. Experimental results show that the summarization generated by the proposed ordering algorithm is more coherent and readable.
出处 《华南理工大学学报(自然科学版)》 EI CAS CSCD 北大核心 2008年第9期43-47,70,共6页 Journal of South China University of Technology(Natural Science Edition)
基金 教育部高等学校博士学科点专项科研项目(20050007023)
关键词 人工智能 多文档文摘 局部主题 句子排序 artificial intelligence multi-document summarization local topic sentence ordering
  • 相关文献

参考文献10

  • 1秦兵,刘挺,李生.多文档自动文摘综述[J].中文信息学报,2005,19(6):13-20. 被引量:51
  • 2Nenkova Ani, Vanderwende Lucy, McKeown Kathleen. A compositional context sensitive multi-document summarizer: exploring the factors that influence summarization[ C]//Proc of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Seattle : ACM, 2006 : 573- 580
  • 3顾益军,樊孝忠,黄维金,于江德.一种文本讨论线索的自动获取方法[J].华南理工大学学报(自然科学版),2004,32(z1):96-98. 被引量:2
  • 4Barzilay R, McKeown K. Sentence fusion for muhidocument news summarization [ J ]. Computational Linguistics, 2005,31 ( 3 ) : 297- 328.
  • 5Barzilay R, Elhadad E, McKeown K. Sentence ordering in multi-document summarization [ C] //Proc of the 1st Human Language Technology Conference. San Diego:ACL, 2001 : 149-156.
  • 6Bollegala Danushka, Okazaki Naoaki, Ishizuka Mitsuru. A bottom-up approach to sentence ordering for multi-document summarization [ C ]//Proc of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association of Computational Linguistics. Sydney : AC L, 2006 : 385- 392.
  • 7Okazaki Naoaki, Matsuo Yutaka, Ishizuka Mitsuru. Improving chronological sentence ordering by precedence relation [ C ]//Proc of the 20th International Conference on Computational Linguistics. Geneva : ACL,2004:750-756.
  • 8Jing H. Summary generation through intelligent cutting and pasting of the input document [ R ]. New York : Department of Computer Science, Columbia University, 1998.
  • 9Barzilay R, McKeown K, Elhadad E. Information fusion in the context of multi-document summarization [ C ]//Proc of the 37th Annual Meeting of the Association of Computational Linguistics. Maryland : ACL, 1999:550-557.
  • 10Barzilay R, Elhadad E, McKeown K. Inferring strategies for sentence ordering in multi-document summarization [ J ]. Journal of Artificial Intelligence Research, 2002, 17:35-55.

二级参考文献28

  • 1[1]Edmundson H P. New methods in automatic extracting [J]. ACM,1969,16(2) :264 -285.
  • 2[2]Kupiec Julian, Pedersen J O, Chen F. A trainable document summarizer [A]. Research and Development in Information Retrieval [C]. USA: SIGIR, 1995.68 - 73.
  • 3[4]Wan Min, Luo Zhen-sheng. Study on topic segmenting method in automatic abstracting system [A]. Natural Language Processing and Knowledge Engineering, 2003International Conference [C]. USA: IEEE, 2003. 734 -739.
  • 4穗志方 俞士汶.基于骨架依存树的语句相似度计算模型[A]..中文信息处理国际会议论文集(ICCIP''98)[C].北京:清华大学出版社,1998.458-465.
  • 5Over, P and J. Yen. 2003. An Introduction to DUC 2003 - Intrinstic Evaluation of Generic News Text Summatization Systems. http :/www. nlpir, nist. gov/projeets/due/pubs/2003 slides/due2003 intro, pdf.
  • 6Saggion H., D. Radev, S. Teufel, and W. Lmn. 2002. Meta-Evaluation of Summarization in a cross-Lingual Environment Using-Based Metrics. In: Proceedings of COLING - 2002, Taipei.
  • 7Michael White, Tanya Korelsky, Claire Cardie, Vincent Ng, David Pierce and Kiri Wagstaff. Multidocument Summarizatien via Information Extraction[A]. In: Proceedings of the First International Conference on Human Language Technology Research[ C ]. 1998 : 36 - 44.
  • 8Minghui Wang and Hediheko Tanaka. Summarization of Multiple Chinese Technical Articles[A]. In: The First International Conference on Information[C]. Fukuoka, Japan. 2002:16- 19.
  • 9.[EB/OL].http://www-nlpir, nist. gov/projects/duc/index. html.,.
  • 10Chin-Yew Lin, Eduard Hovy. From Single to Multi-document Summarization: A Prototype System and its Evaluation[A]. In Proceeding of the 4Oth Anniversary Meeting of the Association for Computational Linguistics (ACL- 02)[ C ], Philadelphia, USA, 2002:25 - 34.

共引文献51

同被引文献20

  • 1吴玲达,雷震,老松杨,雷永林.基于局部话题句群的事件相关多文档摘要研究[J].计算机仿真,2006,23(11):263-267. 被引量:2
  • 2姚超,李生,张姝,郑德权.基于内聚度的多文档文摘句子排序[C]//中文信息处理前沿进展--中国中文信息学会二十五周年学术会议论文集,2006:345-351.
  • 3BARZILY R, ELHADAD N, McKEOWN K. Sentence ordering in multidocument summarization [ C ]//Proc of the 1 st International Con- ference on Human Language Technology Research. 2001:79-82.
  • 4BARZILY R, ELHADAD N, McKEOWN K. Inferring strategies for S6nte'nce Ordering in muitidocument news summarization[J]. Journal of Arti'ficial I nielligence Research,2002,17(2 ): 35-55.
  • 5LAPATA M. Probabilistic text structuring: experiments with sentence ordering[ C ]//Prbc of the Annual Meeting of ACL. 2003:545-552.
  • 6BOLLEGALA D, OKAZAKI N, ISHIZUKA M. A bottom-up approach to sentence ordering for multi-document summarization [ C ]//Proc of ACL-COLING. 2006 : 134-137.
  • 7BOLLEGALA D, OKAZAKI N, ISHIZUKA M. A machine learning approach to sentence ordering for multi document [ C ]//Proc of the Annual Meeting of the Association for Natural Language Processing. 2005 : 1381- 1384.
  • 8XIE Zhu-lilLI Xin, Di EUGENIO B,et al. Using gene expression pro- gramming to construct sentence ranking functions for text summariza- tion [ C ]//Proc of the 20th International Conference on Computational Linguistics. 2004 : 1381 - 1384.
  • 9OKAZAKI N, MATSU0 Y, ISHIZUKA M. Improving chronological sentence ordering by precedence relation [ C ]//Proc of the 20th Inter- naional Conference on Combutational Lin~riaistics. 2004:750-756.
  • 10徐永东,徐志明,王晓龙.基于信息融合的多文档自动文摘技术[J].计算机学报,2007,30(11):2048-2054. 被引量:27

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部