摘要
提出并实现了一种能有效地融合改进的浅层主题特征分析方法、词汇链方法、话语结构方法的分析结果来生成文摘的多知识源融合的自动摘要系统,并对其进行了评测实验。评测结果表明,该系统在捕获文章特征的同时较好地保持了原文的内容及内在逻辑结构,生成的文摘具有良好的连贯性和流畅性;与采用单一方法的自动摘要系统相比较,生成的文摘质量有明显提高。
An automatic text summarization system based on fusing multiple knowledge sources is proposed. It generates summaries by fusing the results produced by the improved method based on add terms, the improved method based on lexical chains and the improved method based on discourse structure. Evaluation results show that the presented system not only captures the characteristics of texts but also retains the content and inherent logic of sources, and high-quality summaries are produced from arbitrary Chinese texts. The proposed system is compared to systems based on single method and it is shown that it produces either comparable or better summaries overall.
出处
《高技术通讯》
CAS
CSCD
北大核心
2006年第4期337-341,共5页
Chinese High Technology Letters
基金
国家自然科学基金(60435020)和国家863计划(2002AA117010-09)资助项目.
关键词
自动摘要
特征词
词汇链
话语结构
融合
automatic text summarization, add term, lexical chain, discourse structure, fusion