期刊文献+

SSD模型及其在汉语词性标注中的应用 被引量:4

Symbol-and-Statistics Decoding Model and Its Application in Chinese POS Tagging
在线阅读 下载PDF
导出
摘要 该文提出了一种以符号解码与数值解码并举的SSD(Symbol-and-Statistics Decoding Model)模型,该模型被用于汉语词性标注任务,其标注正确率在封闭测试中达到97.08%,开放测试中达到95.67%,较二阶HMM的95.56%和94.70%都有较为显著提高。SSD模型的正确率虽然不及最大熵模型和CRF模型,但它的训练时间远少于后者,说明SSD模型在处理自然语言中的特定任务时是一种较强的实用模型。 A statistical language model named Symbol-and-Statistics Decoding (SSD) language model is presented in this article. The 2-gram SSD model is applied to the Chinese POS tagging task with a quite good result. The precision is as high as 97. 08% in the closed test and 95.67% in the open test is, which are both significantly higher than the HMM at 95.56% and 94.70%, respectively. Although the performance of SSD model is not as good as the conditional models such as Maximum Entropy Model and CRF model, the training time of SSD is much less than the conditional models, which makes SSD model more applicable to certain tasks in natural language processing.
出处 《中文信息学报》 CSCD 北大核心 2010年第1期20-24,共5页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(60572159 60872121)
关键词 计算机应用 中文信息处理 SSD模型 HMM 词性标注 computer application Chinese information processing SSD model HMM POS tagging
  • 相关文献

参考文献9

二级参考文献35

  • 1周强.规则和统计相结合的汉语词类标注方法[J].中文信息学报,1995,9(3):1-10. 被引量:43
  • 2杨尔弘,方莹,刘冬明,乔羽.汉语自动分词和词性标注评测[J].中文信息学报,2006,20(1):44-49. 被引量:16
  • 3赵岩,王晓龙,刘秉权,关毅.融合聚类触发对特征的最大熵词性标注模型[J].计算机研究与发展,2006,43(2):268-274. 被引量:20
  • 4Rosenfeld R. Adaptive statistical language modeling: maximum entropy approch [D]. Pittsburgh:Carnegie Mellon Univ, 1994.
  • 5Brown R F, Della-Pietray V J,de Sousa P V,et al.Class-based N-gram models onatural language [J].Computational Linguistics, 1992,18 (4) : 467 - 479.
  • 6Jelinek F. Self-organizing language models for speech recognition [A]. Reading in Speech Reognition [C]. USA: Morgan Kaufman Publishers, Inc,1990. 450-506.
  • 7Morialdo B. Tagging english text with a problistic model [J]. Computational Linguistics, 1994. 20 (2) :155-171.
  • 8Berger A L,Della P, Pietra S A, et al. A maximum entropy approach to natrual language processing [J].Computational Linguistics, 1996,22 ( 1 ) : 450- 480.
  • 9Kuhn R, Mori R. A cache-based natural language model for speech recognition [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1990,PAMI- 12(6) :570-583.
  • 10Eric Brill.Some Advances in Transformation-Based Part of Speech Tagging[C].In:Proceedings of the Twelfth National Conference on Artificial Intelligence,1994:722~727

共引文献214

同被引文献63

引证文献4

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部