摘要
为实现篇章连贯语义关系的判定与自动标注,提出一种综合运用关联词多种语法信息的自动标注方法。该方法利用关联词的词性分布规则排除非关联词,标注出潜在关联词,对比关联词库中的模式表,并综合利用搭配距离、搭配强度和句法位置获取合法的篇章连贯模式,在此基础上标注出其语义关系。通过实验验证了该方法的有效性。
This paper provides a method of the automatic annotation by means of the synthetic use of the grammatical information to realize the judgment and automatic annotation of the semantic relationship of textual coherence.The distribution rules of the parts of speech are used to eliminate the non-conjunctions,and the potential conjunctions are tagged.The pattern of the textual coherence is obtained by the synthetic use of the collocation distance,collocation strength and syntactic position after matching the pattern in the corpus of conjunctions.Based on the above data,the semantic relationship is tagged.Experiment shows that the method is effective.
出处
《计算机工程》
CAS
CSCD
2012年第7期131-133,共3页
Computer Engineering
基金
国家自然科学基金资助项目(60773167)
教育部人文社科重点研究基金资助重大项目(10JJD740012)
关键词
篇章连贯
语义关系
搭配距离
搭配强度
句法规则
自动标注
textual coherence
semantic relationship
collocation distance
collocation strength
syntactic rule
automatic annotation