期刊文献+

中文社区问答中问题答案质量评价和预测 被引量:28

Quality Evaluation and Prediction for Question and Answer in Chinese Community Question Answering
在线阅读 下载PDF
导出
摘要 知识共享型网站为自动问答系统带来了新的研究契机。但用户提供的问题及其答案质量参差不齐,在提供有用信息的同时可能包含各种无关甚至恶意的信息。对此类信息进行判别和过滤,并选取高质量的问题与答案对,有助于在基于社区的自动问答系统中重用相关问题的答案以提高问答系统的服务质量。首先从中文社区问答网站上抓取大量问题及答案,利用社会网络的方法对提问者和回答者的互动关系及特点进行了统计与分析。然后基于给定的问答质量判定标准,对3000多个问题及其答案进行了人工标注。并通过提取文本和非文本两类特征集,利用机器学习算法设计和实现了基于特征集的问答质量分类器。试验结果表明其精度和召回率均在70%以上。最后分析了影响社区网络中问答质量的主要因素。 The rise of Knowledge-sharing platform on the Internet in China provides a new approach for Automatic Question Answering.However,the quality of User-Generated Content in such social networks may vary significantly,from useless information to malice spam.Identifying and filtering such content are particularly important to improve users' experience and the performance of Question Answering System.We first extracted a set of question answer content from Chinese Community Question Answering site,investigated a series of statistic characteristics on the interaction of participants,and then manually annotated quality of a subset of these questions and answers.By combining text features and non-text features provided by the community extracted from those questions and answers,we established a content quality classification model for evaluation and prediction.We find that this model is able to distinguish high-quality ones from others with considerable accuracy.
出处 《计算机科学》 CSCD 北大核心 2011年第6期230-236,共7页 Computer Science
基金 国家自然科学基金项目(90718017) 教育部高等学校博士学科点专项基金(20070006055)资助
关键词 社区问答 社会网络 机器学习 问题答案质量评价和预测 人工标注 Community question answering Social networks Machine learning Question and answer quality evaluation and prediction Human annotation
  • 相关文献

参考文献25

  • 1Brin S, Page L. The anatomy of a large-scale hypertextual Web search engine [J]. Computer Networks and ISDN Systems, 1998,30(1-7) :107-117.
  • 2Kleinberg J M. Authoritative sources in a hyperlinked environ- ment[J]. Journal of the ACM, 1999,46(5) :604-632.
  • 3Zhou Y,Croft W B. Document quality models for web ad hoe re- trieval[C]//Proceedings of the ACM Fourteenth Conference on Information and Knowledge Management. 2005:331-332.
  • 4Jurezyk P, Agichtein E. Discovering authorities in question-an- swer communities using link analysis[C]//ACM Conference on Information and Knowledge Management(CIKM). 2007.
  • 5J urczyk P,Agiehtein E. Hits on question answer portals:an ex- ploration of link analysis for author ranking[C]//SIGIR (pos- ters). ACM, 2007.
  • 6Zhang M S J. Ackerrnan Lada Adamic. Expertise Networks in Online Communities: Structure and Algorithms [C]//WWW 2007. Banff, Alberta, Canada, May 2007.
  • 7Jeon J,Croft B W, Lee J H,et al. A framework to predict the quality of answers with non-textual features [C]//Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NY, USA: ACM Press, 2006 : 228-235.
  • 8Scott J P. Social Network Analysis: A Handbook[M]. SAGE Publications, January 2000.
  • 9Zheng Zhi-ping. AnswerBus Question Answering System[C]//Human Language Technology Conference (HLT 2002). San Die- go, CA, March 2002 : 24-27.
  • 10Agichtein E, Liu Y. Modeling Information-seeker Satisfaction in Community Question Answering [ J ].ACM Transactions on Knowledge Discovery from Data, 2009,3 (2).

二级参考文献24

  • 1张仰森,曹元大,徐波.基于统计的纠错建议给出算法及其实现[J].计算机工程,2004,30(11):106-109. 被引量:7
  • 2张磊,周明,黄昌宁,潘海华.中文文本自动校对[J].语言文字应用,2001(1):19-26. 被引量:23
  • 3[8]Ulf Hermjakob. Parsing and Question Classification for Question Answering. Proceeding of the workshop on Open-Domain Question Answering at ACL-2001
  • 4[9]Eugene Agichtein, Steve Lawrence, Luis Gravano. Learning Search Engine Specific Query Transformations for Question Answering. ACM 2001,169- 178
  • 5[10]Soo-Min Kim, ae-Ho Baek, Sang-Beom Kim, Hae-Chang Rim Question Answering Considering Semantic Categories and Co-occurrence Density. Proceedings of the night Text Retrieval Conference (TREC-9)
  • 6[11]Marius Pasca, Sanda Harabagiu. High-Performance Question/Answering. 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( Sigir-01 ). New Orleans, LA. September 9 - 13,2001
  • 7[1]Ittycheriah,M. Franz,W-J Zhu,A. Ratnaparkhi. IBM's Statistical Question Answering System. Proceedings of the night Text Retrieval Conference (TREC-9)
  • 8[2]D. Elworthy. Question Answering Using a Large NLP System. Proceedings of the night Text Retrieval Conference (TREC-9)
  • 9[3]L. Wu,X-j Huang,Y. Guo,B. Liu,Y. Zhang. FDU at TREC-9:CLIR,Filtering and QA Tasks. Proceedings of the night Text Retrieval Conference(TREC-9)
  • 10[4]R.J. Cooper, S. M. Rüger. A Simple Question Answering System. Proceedings of the night Text Retrieval Conference(TREC-9)

共引文献204

同被引文献289

引证文献28

二级引证文献214

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部