摘要
根据对中文问题的分析可知,问题中的疑问词和中心词等关键词对问题所属类型起着决定性的作用。提出利用自学习方法建立疑问词—类别和疑问词+中心词—类别两种规则,并结合改进贝叶斯模型的问题分类方法。该方法充分利用了关键词对分类的贡献。实验结果表明,该分类方法有很大的改进,准确率达到了84%。
According to the Chinese question,this paper presented a question classification method which combined selflearning rules,consisting of question word-category rules and question word + head word-category rules established in advance by the self-learning method,and modified Bayesian model to improve Chinese question classification. At last,combined modified Bayesian model to improve question classification. The method takes advantage of the contribution of key words to Chinese question classification. Experimental results show that this classification method is a considerable improvement,and accuracy rate a chieves 84% .
出处
《计算机应用研究》
CSCD
北大核心
2010年第8期2869-2871,共3页
Application Research of Computers
基金
国家自然科学基金资助项目(60603068)
关键词
问题分类
问答系统
疑问词
中心词
改进贝叶斯模型
规则
question classification
question answering system
question word
head word
modified Bayesian model
rule