期刊文献+

基于词典属性特征的粗粒度词义消歧 被引量:10

Coarse-Grained Word Sense Disambiguation Using Features Described in the Lexicon
在线阅读 下载PDF
导出
摘要 本文依据《现代汉语语法信息词典》中对词语多义的属性特征描述,对《人民日报》语料中155个词语共4996个同形实例进行了粗粒度词义自动消歧实验,同时用贝叶斯算法进行了比较测试。基于词典属性特征的消歧方法在同形层面上准确率达到90%,但召回率偏低。其优点在于两个方面:1)不受词义标注语料库规模的影响;2)对特定词语意义的消歧准确率可达到100%。本文也讨论了适用于不同词类的消歧特征。 This paper presents a simple but effective feature-based approach to Chinese word sense disambiguation using the distributional features available from the Grammatical Knowledge-base of Contemporary Chinese. The test data is the sense-tagged corpus of People's Daily. A Naive Bayes classifier is also tried as a comparable statistical method. The feature-based approach achieves precision of 90%, which is comparable to the NB classifier. The striking advantages of the feature-based approach are 1) It is not influenced by the data size, and 2) It can disambiguate some specific words with precision of 100%. The features appropriate for different parts of speech in Chinese WSD are also discussed. This paper demonstrates that sense features described in the lexicon are worth including in WSD.
出处 《中文信息学报》 CSCD 北大核心 2007年第2期3-8,共6页 Journal of Chinese Information Processing
基金 国家973计划资助项目(2004CB318102)
关键词 人工智能 自然语言处理 特征 词义 词义消歧 贝叶斯分类法 artificial intelligence natural language processing~ feature word sense word sense disambiguation Naive Bayes classifier
  • 相关文献

参考文献8

  • 1刘风成,黄德根,姜鹏.基于AdaBoost.MH算法的汉语多义词消歧[J].中文信息学报,2006,20(3):6-13. 被引量:7
  • 2卢志茂,刘挺,郎君,李生.神经网络和贝叶斯网络在汉语词义消歧上的对比研究[J].高技术通讯,2004,14(8):15-19. 被引量:9
  • 3全昌勤,何婷婷,姬东鸿,刘辉.从搭配知识获取最优种子的词义消歧方法[J].中文信息学报,2005,19(1):30-35. 被引量:13
  • 4Lesk, M.E. Automated sense disambiguation using machine-readable dictionaries: How to tell a pine conefrom an ice cream cone [A]. In. Proceedings of the SIGDOC Conference [C]. 1986.
  • 5Yarowsky, D. Word-sense disambiguation using statistical models of Roget's categories trained on large corpora [A]. In.. Proceedings of COLING 92 [C].1992.
  • 6Niu, ZH. Y., Ji, D. H. and Tan, Ch. L. : Optimizing Feature Set for Chinese Word Sense Disambiguation [A]. In: Third International Workshop On The Evaluation of Systems for the Semantic Analysis of Text [C]. 2004.
  • 7Dang, H. T. and Palmer, M. : The Role of Semantic Roles in Disambiguating Verb Senses [A]. In: Proceedings of the 43th Annual Meeting of the ACL[C].2005.
  • 8Yarowsky, D. and Florian, R. Evaluating Sense Disambiguation Performance Across Diverse Parameter Spaces [J]. Journal of Natural Language Engineering, 2002.

二级参考文献26

  • 1Nancy I de, Jean Veronis. Introduction to the Special Issue on Word Sense Disambiguation:The State of the Art[J].Computational Linguistics. 1998, 1-42.
  • 2Yarowsky D. Umupervised Word Sense Disambiguation Rivaling Supervised Methods[A]. In: Proceedings of 33rd Annual Meeting of ACL[C], Cambridge, Massachusetts, USA, 1995, 181 - 188.
  • 3HAO Trang Dang, Ching - yi Chia. Simple Features for Chinese Word Sense Disambiguation[A]. In: Proceedings of COLING-2002 [ C ].Philadelphia, USA, 2002, 88- 94.
  • 4Lesk, Michael, Automatic Sense Disambiguation: How to tell a Pine Cone from and Ice Cream Cone, Proceeding of the 1986 SIGDOC Conference, Association for Computing Machinery, New York, 1986.
  • 5N.Ide,J.Veronis,Introduction to the special Issue on Word Sense Disambiguation:The State of the Art[J].Computational Linguistics,ACL,1998.24(1).
  • 6D.Yarowsky.Unsupervised Word Sense Disambiguation Rivaling Supervised Methods[A].In:the 33rd Annual Meeting of ACL[C].Massachusetts,1995:181-188.
  • 7H.T.Ng,Exemplar-based Word Sense Disambiguation:Some Recent Improvements[A].In:proceeding of the2nd Conference on Empirical Methods in Natural Language Processing,EMNLP,1997.
  • 8Peter F.Brown,Stephen A.Della Pietra,Vincent J.Della Pietra,and Robert L.Mercer.Word-sense disambiguation using statistical methods[A].In:proceedings of the 29th conference on Association for Computational Linguistics[C].California,June 1991,264-270
  • 9G.Towell,E.M.Voorhees,Disambiguating Highly Ambiguous Words[J].Computational Linguistics,ACL,1998.24(1).
  • 10S.Abney,R.E.Schapire,Y.Singer.Boosting Applied to Tagging and PP-attachment[A].In:proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Proceeding and Very larger Corpora[C].1999.

共引文献25

同被引文献190

引证文献10

二级引证文献40

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部