期刊文献+

结合语境与布朗聚类特征的上下位关系验证 被引量:1

Hyponymy Relation Validation Combined with Context and Brown Clustering Feature
在线阅读 下载PDF
导出
摘要 对海量文本语料进行上下位语义关系自动抽取是自然语言处理的重要内容,利用简单模式匹配方法抽取得到候选上下位关系后,对其进行验证过滤是难点问题。为此,分别通过对词汇语境相似度与布朗聚类相似度计算,提出一种结合语境相似度和布朗聚类相似度特征对候选下位词集合进行聚类的上下位关系验证方法。通过对少量已标注训练语料的语境相似度和布朗聚类相似度进行计算,得到验证模型和2种相似度的结合权重系数。该方法无需借助现有的词汇关系词典和知识库,可对上下位关系抽取结果进行有效过滤。在CCF NLP&2012词汇语义关系评测语料上进行实验,结果表明,与模式匹配和上下文比较等方法相比,该方法可使F值指标得到明显提升。 Hyponymy has many important applications in the field of Natural Language Processing(NLP) and the automatic extraction of hyponym relation from massive text datasets is naturally one of important NLP research tasks.The emphasis and difficult point of the research is how to validate a hyponym which is extracted with simple pattern matching method is really correct.By calculating the context feature similarity(SimCF) and Brown clustering similarity(SimBrown),this paper proposes a novel approach of hyponymy validation.It applies a clustering on hyponym candidates,and the clustering similarity feature is obtained by combining SimCF and SimBrown.The combination coefficient of two kinds of similarity is derived based on the SimCFs and SimBrowns between all labeled training words and their hyponyms.The model can filter roughly extraction results without any existed lexical relation dictionary or knowledge base.Evaluation on CCF NLPCC2012 word semantic relation corpus shows that the proposed approach in this paper significantly improves the F measure value compared with other approaches including pattern matching and simple context comparison.
出处 《计算机工程》 CAS CSCD 北大核心 2015年第2期145-150,共6页 Computer Engineering
基金 国家自然科学基金资助项目(61163039 61163036 61363058) 西北师范大学青年教师科研能力提升计划基金资助项目(NWNU-LKQN-10-2)
关键词 上下位关系 语境相似度 布朗聚类相似度 点互信息 模式匹配 聚类验证 hyponymy relation context similarity Brown clustering similarity Point Mutual Information(PMI) pattern matching clustering validation
  • 相关文献

参考文献15

  • 1Hearst M.Automatic Acquisition of Hyponyms from Large Text Corpora[C]//Proceedings of COLING’92.New York,USA:[s.n.],1992:539-545.
  • 2Kozareva Z,Riloff E,Hovy E.Semantic Class Learning from the Web with Hyponym Pattern Linkage Graphs[C]//Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics:Human Language Technologies.Columbus,USA:[s.n.],2008:1048-1056.
  • 3Kozareva Z,Hovy E.A Semi-supervised Method to Learn and Construct Taxonomies Using the Web[C]//Proceedings of EMNLP’10.Boston,USA:[s.n.],2010:1110-1118.
  • 4Zhang Chunxia,Jiang Peng.Automatic Extraction of Definitions[C]//Proceedings of ICCSIT’09.Beijing,China:[s.n.],2009:364-368.
  • 5Westerhout E.Definition Extraction Using Linguistic and Structural Features[C]//Proceedings of the 1st Workshop on Definition Extraction.Borovets,Bulgaria:[s.n.],2009:61-67.
  • 6Akiba T,Sakai T.Japanese Hyponymy Extraction Based on a Term Similarity Graph[R].Tokyo,Japan:IPSJ SIG,Technical Reprot:2011-IFAT-104,2011.
  • 7Miller G A.Word Net:A Lexical Database for English[J].Communications of the ACM,1995,38(11):39-41.
  • 8Suchanek F M,Kasneci G,Weikum G.Yago:A Large Ontology from Wikipedia and Word Net[J].Web Semantics:Science,Services and Agents on the World Wide Web,2008,6(3):203-217.
  • 9Boella G,di Caro L.Extracting Definitions and Hypernym Relations Relying on Syntactic Dependencies and Support Vector Machines[C]//Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics.Sofia,Bulgaria:[s.n.],2013:532-537.
  • 10Zhang Fan,Shi Shuming,Liu Jing,et al.Nonlinear Evidence Fusion and Propagation for Hyponymy Relation Mining[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics.Portland,USA:[s.n.],2011,1159-1168.

二级参考文献2

共引文献13

同被引文献24

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部