基于序列标注的全词消歧方法

All-Words Word Sense Disambiguation Based on Sequence Labeling

下载PDF

导出

摘要全词消歧(All-Words Word Sense Disambiguation)可以看作一个序列标注问题,该文提出了两种基于序列标注的全词消歧方法,它们分别基于隐马尔可夫模型(Hidden Markov Model,HMM)和最大熵马尔可夫模型(Maximum Entropy Markov Model,MEMM)。首先,我们用HMM对全词消歧进行建模。然后,针对HMM只能利用词形观察值的缺点,我们将上述HMM模型推广为MEMM模型,将大量上下文特征集成到模型中。对于全词消歧这类超大状态问题,在HMM和MEMM模型中均存在数据稀疏和时间复杂度过高的问题,我们通过柱状搜索Viterbi算法和平滑策略来解决。最后,我们在Senseval-2和Senseval-3的数据集上进行了评测,该文提出的MEMM方法的F1值为0.654,超过了该评测上所有的基于序列标注的方法。 All-Words Word Sense Disambiguation（WSD） can be regarded as a sequence labeling problem,and two All-Words WSD methods based on sequence labeling are proposed in this paper,which are based on Hidden Markov Model（HMM） and Maximum Entropy Markov Model（MEMM）,respectively.First,we model All-Words WSD using HMM.Since HMM can only exploit lexical observation,we generalize HMM to MEMM by incorporating a large number of non-independent features.For All-Words WSD which is a typical extra-large state problem,the data sparsity and high time complexity seriously hinder the application of HMM and MEMM models.We solve these problems by beam-search Viterbi algorithm and smoothing strategy.Finally,we test our methods on the dataset of All-Words WSD tasks in Senseval-2 and Senseval-3,and achieving a 0.654 F1 value forthe MEMM method which outperforms other methods based on sequence labeling.

作者周云王挺易绵竹张禄彭王之元

机构地区国防科技大学计算机学院解放军外国语学院国防语言文化研究所解放军外国语学院欧亚语系国防科技大学并行与分布处理国家重点实验室

出处《中文信息学报》 CSCD 北大核心 2012年第2期28-34,共7页 Journal of Chinese Information Processing

基金国家高技术研究发展计划(863计划)项目(2010AA012505) 国家自然科学基金重点课题资助项目(60933005) 国家自然科学基金资助项目(60873097)

关键词全词消歧隐马尔可夫模型最大熵马尔可夫模型超大状态问题 all-words word sense disambiguation hidden Markov model maximum entropy Markov model very large state problem

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献21

1Mooney R.J.Comparative experiments on disambiguating word senses:An illustration of the role of bias in machine learning[C]//Proceedings of the 1996 Conference on Empirical Methods in Natural Language Processing (EMNLP).1996.82-91.
2Tratz S.,Sanfillippo A.,Gregory M.,et al.PNNL:A supervised maximum entropy approach to word sense disambiguation[C]//Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007).Stroudsburg,PA,USA,2007.264-267.
3Escudero G.,M rquez L.,Rigau,G.On the portability and tuning of supervised word sense disambiguation[C]//Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP).2000.172-180.
4Lawrence R.Rabiner.A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition[C]//Proceedings of the IEEE.1989.257-286.
5Andrew McCallum,Dayne Freitag,Fernando Pereira.Maximum Entropy Markov Models for Information Extraction and Segmentation[C]//Proceedings of the 17th International Conference on Machine Learning.San Francisco,CA,USA:Morgan Kaufmann Publishers Inc.,2000.591-598.
6John Lafferty,Andrew McCallum,Fernando Pereira.Conditional Random Fields:Probabilistic Models for Segmenting and Labeling Sequence Data[C]//Proceedings of the 18th International Conference on Machine Learning. San Francisco,CA,USA: Morgan Kaufmann Publishers Inc.,2001.282-289.
7El-B ze M.,M rialdo B..HMM Based Taggers[C]//H.Van Halteren eds.Syntactic Wordclass Tagging.Kluwer Academic Publishers,1999.
8F.Jelinek.Statistical Methods for Speech Recognition[M].Cambridge:MIT Press,1998.
9Segond F.,Schiller,A.,Grefenstette,G.,et al.An Experiment in Semantic Tagging using Hidden Markov Model Tagging[C]//Proceedings of the Joint ACL/EACL Workshop on Automatic Information Extraction and Building of Lexical Semantic Resources.Stroudsburg,PA,USA,1997.78-81.
10Claude de Loupy,MarcEl-Beze,Pierre-Fran ois Marteau.Word Sense Disambiguation using HMM Tagger[C]//Proceedings of the 1st International Conference on Language Resources and Evaluation(LREC).Granada,Spain,1998.1255-1258.

1孟孟.桌面云集群使用的资源调度方法现状分析[J].信息通信,2014,27(3):159-159.
2王胜,朱明.基于最大熵马尔可夫模型的地址信息抽取[J].计算机工程与应用,2005,41(21):192-194. 被引量：8
3李向阳.基于有限时间跟踪微分器的迭代学习控制[J].自动化学报,2014,40(7):1366-1375. 被引量：8
4彭丽莉,周传斌,田永涛.关于HMM模型算法的一种改进[J].绵阳师范学院学报,2010,29(8):110-112.
5黄睿.基于RGMM的离散基因表达数据关联规则挖掘[J].计算机应用与软件,2014,31(9):191-193.
6李向阳.一类非线性时变系统的迭代学习控制[J].控制理论与应用,2014,31(8):1087-1093. 被引量：4
7邵开丽,王鸿运.基于WSN和数据融合技术的电力变压器故障诊断系统的研究[J].计算机与现代化,2012(7):171-175. 被引量：6
8肖基毅,朱道辉,邹腊梅.基于混合条件模型的Web信息抽取[J].郑州大学学报（理学版）,2008,40(3):52-55. 被引量：2
9齐耀龙.基于模糊粗糙集的数据库安全评价[J].合作经济与科技,2013(12):126-127.
10李向阳,田森平.基于边界层的一类不确定系统的迭代学习控制[J].华南理工大学学报（自然科学版）,2015,43(3):103-110.

中文信息学报

2012年第2期

浏览历史

内容加载中请稍等...

基于序列标注的全词消歧方法

参考文献21

相关作者

相关机构

相关主题

浏览历史