基于高阶词汇依存的短语结构树重排序模型被引量：3

Phrase Parses Reranking Based on Higher-Order Lexical Dependencies

下载PDF

导出

摘要在句法分析中,已有研究工作表明,词汇依存信息对短语结构句法分析是有帮助的,但是已有的研究工作都仅局限于使用一阶的词汇依存信息.提出了一种使用高阶词汇依存信息对短语结构树进行重排序的模型,该模型首先为输入句子生成有约束的搜索空间(例如,N-best句法分析树列表或者句法分析森林),然后在约束空间内获取高阶词汇依存特征,并利用这些特征对短语结构候选树进行重排序,最终选择出最优短语结构分析树.在宾州中文树库上的实验结果表明,该模型的最高F1值达到了85.74%,超过了目前在宾州中文树库上的最好结果.另外,在短语结构分析树的基础上生成的依存结构树的准确率也有了大幅提升. The existing works on parsing show that lexical dependencies are helpful for phrase tree parsing.However,only first-order lexical dependencies have been employed and investigated in previous research.This paper proposes a novel method for employing higher-order lexical dependencies for phrase tree evaluation.The method is based on a parse reranking framework,which provides a constrained search space（via N-best lists or parse forests） and enables the parser to employ relatively complicated lexical dependency features.The models are evaluated on the UPenn Chinese Treebank.The highest F1 score reaches 85.74% and has outperformed all previously reported state-of-the-art systems.The dependency accuracy of phrase trees generated by the parser has been significantly improved as well.

作者王志国宗成庆

机构地区模式识别国家重点实验室(中国科学院自动化研究所)

出处《软件学报》 EI CSCD 北大核心 2012年第10期2628-2642,共15页 Journal of Software

基金国家自然科学基金(60975053 61003160) 中国科学院对外合作交流项目

关键词短语结构依存结构句法重排序高阶词汇依存关系句法森林 phrase structure dependency structure parse reranking higher-order lexical dependencies parse forest

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献38

1Zong CQ. Statistical Natural Language Processing. Beijing: Tsinghua University Press, 2008. 147-189.
2Klein D, Manning CD. Accurate unlexicalized parsing. In: Proc. of the ACL 2003. Association for Computational Linguistics, 2003. 423-430. http://aclweb.org/anthology-new/P/P03/[doi: 10.3115/1075096.1075150].
3Jurafsky D, Martin JH. Speech and Language Processing: An Introduction to Natural Language Processing. 2nd ed., Prentice Hall, 2008. http://www.cs.colorado.edu/-martin/slp.html.
4Matsuzaki T, Miyao Y, Tsujii J. Probabilistic CFG with latent annotations. In: Proc. of the ACL 2005. Ann Arbor: Association for Computational Linguistics, 2005.75-82. http://aclweb.org/anthology-new/P/P05/[doi: 10.3115/1219840.1219850].
5Petrov S, Barrett L, Thibaux R, Klein D. Learning accurate, compact, and interpretable tree annotation. In: Proc. of the COLING-ACL 2006. Sydney: Association for Computational Linguistics, 2006. 433-440. http://aclweb.org/anthology-new/P/P06/ [doi: 10.3115/1220175.1220230].
6Petrov S, Klein D. Improved inference for unlexicalized parsing. In: Proc. of the NAACL-HLT 2007. Rochester: Association for Computational Linguistics, 2007.404-411. http://aclweb.org/anthology-new/N/N07/.
7Bikel DM. Intricacies of Collins' parsing model. Computational Linguistics, 2004,30(4):479-511. [doi: 10.1162/08912010 42544929].
8Charniak E. A maximum-entropy-inspired parser. In: Proc. of the NAACL 2000. Association for Computational Linguistics, 2000. 132-139. http://aclweb.org/anthology-new/A/A00/.
9Collins M. Head-Driven statistical models for natural language parsing [Ph.D. Thesis]. Philadelphia: University of Pennsylvania, 1999.
10Charniak parser, http://bllip.cs.brown.edu/download/reranking-parserAug06.tar.gz.

同被引文献28

1王锦,陈群秀.现代汉语语义资源用于短语歧义模式消歧研究[J].中文信息学报,2007,21(5):80-86. 被引量：9
2http://www.cipsc.org.cn/clp2010/task2_en.htm.
3http://crfpp.googlecode.com/svn/trunk/doc/index.html.
4http://nlp.cs.nyu.edu/evalb/.
5S Abney.Parsing by Chunks[J].Principle-Based Parsing,1991:257-278.
6Lance A Ramshaw,Mitchell P Marcus.Text Chunking Using Transformation-Based Learning[C]//Proceedings of the Third ACL Workshop on Very Large Corpora,1995:87-88.
7Adwait Ratnaparkhi.Learning to Parse Natural Language with Maximum Entropy Models[J].Machine Learning,1999,34(1-3):151-175.
8K Sagae,A Lavie.A classifier-based parser with linear run-time complexity[C]//Proceedings of the IWPT'05,2005:125-132.
9Pascale Fung,Grace Ngai,YongSheng Yang,and BenFeng Chen.A Maximum-Entropy Chinese Parser Augmented by Transformation-Based Learning[C]//Proceedings of the ACM Transactions on Asian Language Information Processing,2004:4-8.
10Mengqiu Wang,Kenji Sagae,and Teruko Mitamura.A fast,accurate deterministic parser for Chinese[C]//Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL,2006:425-432.

引证文献3

1朱丽芳,林川捷,魏郑浩.优选排序模型在电信运营商资源竞合决策中的应用[J].数字技术与应用,2013,31(1):56-56.
2蒋志鹏,关毅,董喜双.基于多层协同纠错的中文层次句法分析[J].中文信息学报,2014,28(4):29-36. 被引量：3
3雷霆,王孟轩.基于NLP的新冠肺炎疫情研判系统设计与实现[J].电信快报(网络与通信),2020(6):21-25. 被引量：2

二级引证文献5

1何劲.宗教道德与现代社会[J].湖北大学学报（哲学社会科学版）,2000,27(2):83-84. 被引量：1
2张丹,周俏丽,张桂平.引入层次成分分析的依存句法分析[J].沈阳航空航天大学学报,2017,34(1):76-82. 被引量：1
3陈永俊,夏艳锋,高宇航,郭锐,唐宇.基于NLP技术的警情文本数据分析应用[J].警察技术,2021(2):39-42. 被引量：6
4郭琰,张矛.基于深度学习的语法纠错算法建模研究[J].信息技术,2021,45(4):148-152. 被引量：7
5蒋志鹏,关毅.面向中文电子病历的句法分析融合模型[J].自动化学报,2019,45(2):276-288. 被引量：5

1李英,郭剑毅,余正涛,毛存礼,线岩团.越南语短语树到依存树的转换研究[J].计算机科学与探索,2017,11(4):599-607. 被引量：5
2郭振,张玉洁,苏晨,徐金安.基于字符的中文分词、词性标注和依存句法分析联合模型[J].中文信息学报,2014,28(6):1-8. 被引量：14
3张育,王红玲,周国栋.基于两种句法分析的语义角色标注比较研究[J].计算机应用与软件,2010,27(8):13-16. 被引量：2
4李彬,刘挺,秦兵,李生.基于语义依存的汉语句子相似度计算[J].计算机应用研究,2003,20(12):15-17. 被引量：127
5李向宏,王丁,黄成哲,雷国华.自然语言句法分析研究现状和发展趋势[J].微处理机,2003,24(2):4-7. 被引量：6
6李伟.中文语句相似度计算的方法初探[J].兰州工业高等专科学校学报,2009,16(4):1-3. 被引量：2
7刘颖,姜巍.一种基于改进隐马尔克夫模型的词语对齐方法[J].中文信息学报,2014,28(2):51-55. 被引量：2
8周惠巍,黄德根,钱志强,杨元生.短语结构到依存结构树库转换研究[J].大连理工大学学报,2010,50(4):609-613. 被引量：6
9李堂秋,卢伟.基于语义的中文句子的直接生成方法[J].厦门大学学报（自然科学版）,1998,37(5):650-657. 被引量：1
10赵春燕,肖化顺.元数据在森林资源信息共享中的应用[J].林业调查规划,2005,30(4):16-19. 被引量：3

软件学报

2012年第10期

浏览历史

内容加载中请稍等...

基于高阶词汇依存的短语结构树重排序模型被引量：3

参考文献38

同被引文献28

引证文献3

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

基于高阶词汇依存的短语结构树重排序模型 被引量：3

参考文献38

同被引文献28

引证文献3

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

基于高阶词汇依存的短语结构树重排序模型被引量：3