期刊文献+

基于高阶词汇依存的短语结构树重排序模型 被引量:3

Phrase Parses Reranking Based on Higher-Order Lexical Dependencies
在线阅读 下载PDF
导出
摘要 在句法分析中,已有研究工作表明,词汇依存信息对短语结构句法分析是有帮助的,但是已有的研究工作都仅局限于使用一阶的词汇依存信息.提出了一种使用高阶词汇依存信息对短语结构树进行重排序的模型,该模型首先为输入句子生成有约束的搜索空间(例如,N-best句法分析树列表或者句法分析森林),然后在约束空间内获取高阶词汇依存特征,并利用这些特征对短语结构候选树进行重排序,最终选择出最优短语结构分析树.在宾州中文树库上的实验结果表明,该模型的最高F1值达到了85.74%,超过了目前在宾州中文树库上的最好结果.另外,在短语结构分析树的基础上生成的依存结构树的准确率也有了大幅提升. The existing works on parsing show that lexical dependencies are helpful for phrase tree parsing.However,only first-order lexical dependencies have been employed and investigated in previous research.This paper proposes a novel method for employing higher-order lexical dependencies for phrase tree evaluation.The method is based on a parse reranking framework,which provides a constrained search space(via N-best lists or parse forests) and enables the parser to employ relatively complicated lexical dependency features.The models are evaluated on the UPenn Chinese Treebank.The highest F1 score reaches 85.74% and has outperformed all previously reported state-of-the-art systems.The dependency accuracy of phrase trees generated by the parser has been significantly improved as well.
出处 《软件学报》 EI CSCD 北大核心 2012年第10期2628-2642,共15页 Journal of Software
基金 国家自然科学基金(60975053 61003160) 中国科学院对外合作交流项目
关键词 短语结构 依存结构 句法重排序 高阶词汇依存关系 句法森林 phrase structure dependency structure parse reranking higher-order lexical dependencies parse forest
  • 相关文献

参考文献38

  • 1Zong CQ. Statistical Natural Language Processing. Beijing: Tsinghua University Press, 2008. 147-189.
  • 2Klein D, Manning CD. Accurate unlexicalized parsing. In: Proc. of the ACL 2003. Association for Computational Linguistics, 2003. 423-430. http://aclweb.org/anthology-new/P/P03/[doi: 10.3115/1075096.1075150].
  • 3Jurafsky D, Martin JH. Speech and Language Processing: An Introduction to Natural Language Processing. 2nd ed., Prentice Hall, 2008. http://www.cs.colorado.edu/-martin/slp.html.
  • 4Matsuzaki T, Miyao Y, Tsujii J. Probabilistic CFG with latent annotations. In: Proc. of the ACL 2005. Ann Arbor: Association for Computational Linguistics, 2005.75-82. http://aclweb.org/anthology-new/P/P05/[doi: 10.3115/1219840.1219850].
  • 5Petrov S, Barrett L, Thibaux R, Klein D. Learning accurate, compact, and interpretable tree annotation. In: Proc. of the COLING-ACL 2006. Sydney: Association for Computational Linguistics, 2006. 433-440. http://aclweb.org/anthology-new/P/P06/ [doi: 10.3115/1220175.1220230].
  • 6Petrov S, Klein D. Improved inference for unlexicalized parsing. In: Proc. of the NAACL-HLT 2007. Rochester: Association for Computational Linguistics, 2007.404-411. http://aclweb.org/anthology-new/N/N07/.
  • 7Bikel DM. Intricacies of Collins' parsing model. Computational Linguistics, 2004,30(4):479-511. [doi: 10.1162/08912010 42544929].
  • 8Charniak E. A maximum-entropy-inspired parser. In: Proc. of the NAACL 2000. Association for Computational Linguistics, 2000. 132-139. http://aclweb.org/anthology-new/A/A00/.
  • 9Collins M. Head-Driven statistical models for natural language parsing [Ph.D. Thesis]. Philadelphia: University of Pennsylvania, 1999.
  • 10Charniak parser, http://bllip.cs.brown.edu/download/reranking-parserAug06.tar.gz.

同被引文献28

  • 1王锦,陈群秀.现代汉语语义资源用于短语歧义模式消歧研究[J].中文信息学报,2007,21(5):80-86. 被引量:9
  • 2http://www.cipsc.org.cn/clp2010/task2_en.htm.
  • 3http://crfpp.googlecode.com/svn/trunk/doc/index.html.
  • 4http://nlp.cs.nyu.edu/evalb/.
  • 5S Abney.Parsing by Chunks[J].Principle-Based Parsing,1991:257-278.
  • 6Lance A Ramshaw,Mitchell P Marcus.Text Chunking Using Transformation-Based Learning[C]//Proceedings of the Third ACL Workshop on Very Large Corpora,1995:87-88.
  • 7Adwait Ratnaparkhi.Learning to Parse Natural Language with Maximum Entropy Models[J].Machine Learning,1999,34(1-3):151-175.
  • 8K Sagae,A Lavie.A classifier-based parser with linear run-time complexity[C]//Proceedings of the IWPT'05,2005:125-132.
  • 9Pascale Fung,Grace Ngai,YongSheng Yang,and BenFeng Chen.A Maximum-Entropy Chinese Parser Augmented by Transformation-Based Learning[C]//Proceedings of the ACM Transactions on Asian Language Information Processing,2004:4-8.
  • 10Mengqiu Wang,Kenji Sagae,and Teruko Mitamura.A fast,accurate deterministic parser for Chinese[C]//Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL,2006:425-432.

引证文献3

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部