摘要
问答系统 (QuestionAnsweringSystem)能用准确、简洁的答案回答用户用自然语言提出的问题。目前多数问答系统利用大规模文本作为抽取答案的知识库 ,而网络上丰富的资源为问答系统提供了另外一种良好的知识来源 ,对于回答简短、基于事实的问题非常有效。本文对基于网络的问答系统研究现状作了简要的介绍 ,分析了网络信息的特点。我们提出了一种基于语句相似度计算的答案抽取方法 ,在此基础上实现了一个基于网络的中文问答系统。该系统只利用网络搜索引擎返回结果中的摘要部分作为答案抽取的资源 ,从而节省了下载、分析网络源文本的时间。实验结果表明该系统对人名、数量及时间类型的问题效果显著 ,对测试问题集的MRR值达到 0 5 1。
Question Answering System can give users precise answer to the question presented in natural language. Currently, most of question answering systems use large scaled corpus as knowledge base to extract answer. However, the abundant web resource provides another ideal knowledge source for question answering system. The research result shows that using web resource as the information source for question answering system can get good performance for simple and factoid based questions. This paper presents an answer extraction method based on the computation of sentence similarity between the question sentence and the candidate answer sentence. We also developed a web based Chinese QA system. This system only utilizes the 'text snippet' in the feedback of the web search engine as data resource for answer extraction. The experiment result indicates that the system can get relatively good results for the questions of the types of PERSON, TIME and NUMBER; the MRR of all questions is 0 51.
出处
《中文信息学报》
CSCD
北大核心
2004年第3期24-31,共8页
Journal of Chinese Information Processing
关键词
计算机应用
中文信息处理
问答系统
句子相似度
信息抽取
computer application
Chinese information processing
question answering system
sentence similarity
information extraction