期刊文献+

关联词搭配的自动发现 被引量:12

Automatic discovery of conjunctions' collocation pattern
在线阅读 下载PDF
导出
摘要 提出了关联词搭配模式自动发现的基本方法。建立一个大规模语料库,然后作分词处理,并对关联词进行自动标注和人工校对;评估关联词搭配的三个重要参数(搭配距离、搭配强度MI值、搭配强度Z值),并设定阈值,超过阈值的格式自动作为候选搭配模式。通过实验,标注的准确率为88.75%,表明本方法具有较好效果。运用该方法,发现了以往大量未被注意的句法搭配模式,对研制高质量的关联词知识库起到了积极的促进作用,对复句句法、语义的自动分析具有重要的意义。 This paper provided a method of the automatic discovery of the conjunctions' collocation pattern. Built a large corpus, and it was tagged by a Chinese automatic segmenting system, and tagged and proofed the connects words artificially. Set a threshold, and regard the collocation whose parameters were above of the value as candidates for the collocation pattern. The accuracy of tagging was 88.75% ,which indicated that this method was feasible. Many syntactic patterns are discoved in the research which will promot buliding a top-quality knowledge base of connects words. And it has vital significance in automatic analysis of the syntactic and semantic of compund sentences.
出处 《计算机应用研究》 CSCD 北大核心 2011年第12期4426-4428,4432,共4页 Application Research of Computers
基金 国家自然科学基金资助项目(60703008) 国家重点实验室开放研究基金资助项目(SKLSE04-018) 教育部人文社科重点研究基地重大资助项目(10JJD740012) 湖北省科技攻关资助项目(2007AA101C49)
关键词 语料库 关联词 搭配 自动发现 corpus conjunction collocation pattern automatic discovery
  • 相关文献

参考文献10

  • 1鲁松,白硕,李素建,刘群.汉语多重关系复句的关系层次分析[J].软件学报,2001,12(7):987-995. 被引量:24
  • 2邹嘉彦,连兴隆,高维君,等.中文篇章中的关联词语及其引导的句子关系的自动标注-面向话语分析的中文篇章语料库的开发[C]//中文信息处理国际会议论文集.1998:288-297.
  • 3肖升,胡金柱,姚双云,吴锋文.面向对象有标复句本体建模[J].计算机应用研究,2010,27(2):552-554. 被引量:6
  • 4BURSTEIN J, KUKICH K, WOLFF S, et al. Enriching automated es- say scoring using discourse marking [ C ]//Proc of the 36th Annual Meeting of the Association for Computational Linguistics and 17th In- ternational Conference on Computational Linguistic. 1998: 15-21.
  • 5ANTHONY L, LASHKIA G V. Automatic identification of organiza- tional structure in writing using machine learning [ C]//Proc of the 6th International Conference on Languages for Specific Pupooses. 2001.
  • 6TEUFEL S. The structure of scientific articles [ M ]. Stanford: CSLI Publications, 2010.
  • 7CHAN W K, LAI B Y, GAO J, et al. Mining discourse markers for Chinese textual summarization[ C ] //Proc of the 6th Applied Natural Language Processing Conference and the 1st North American Chapter of the Association for Computational Linguistics. 2000 : 11-20.
  • 8LOUIS A,JOSHI A, NENKOVA A. Discourse indicators for content selection in summarization [ C ]//Proc of the 11 th Annual Meeting of the Special Interest Group on Discoume and Dialogue. 2010 : 147-156.
  • 9邢福义.汉语复句研究[M].北京:商务印书馆.2002.
  • 10MANNING CD,SCHUTZE H.统计自然语言处理基础[M].苑春法,等译.北京:电子工业出版社,2005.

二级参考文献13

  • 1陈凯,何克清,李兵,梁鹏.面向对象的本体建模研究[J].计算机工程与应用,2005,41(2):40-43. 被引量:18
  • 2胡金柱,王琳,肖明,罗旋,姚双云,罗进军.汉语复句本体模型初探[J].华中师范大学学报(自然科学版),2005,39(4):466-469. 被引量:11
  • 3胡金柱,罗旋,肖明,王琳,姚双云,罗进军.本体论在复句领域概念建模中的应用[J].计算机应用研究,2006,23(10):212-213. 被引量:10
  • 4吴竞存 侯学超.现代汉语句法分析[M].北京大学出版社,1988.232-261.
  • 5邢福义.汉语语法学[M].长春:东北师范大学出版社,2000.75-78.
  • 6GUARINO N, MASOLO C, VETERE G. OntoSeek: content-based access to the Web [ J ]. IEEE Intelligent System, 1999, 14 ( 3 ) :70- 80.
  • 7SHUN S B, MOTTA E, DOMINGUE J. ScholOnto: an ontologybased digital library server for research documents and discourse[ J]. Intl J Digital Ubrades, 2000, 3(3) :237-248.
  • 8GUARNO N, GIARETTA P. Ontologies and knowledge bases: towards a terminological clarification [ M]//MARS N. Towards very large knowledge bases:knowledge building and knowledge sharing. Amsterdam :IOS Press, 1995:25- 32.
  • 9GRUBER T. Towards principles for the design of ontologies used for knowledge sharing[ J]. International Journal of Human-Computer Studies, 1995, 43(6) :907-928.
  • 10USCHOLD M, GRUNINGER M. Ontologies: principles, methods and applications[ J]. The Knowledge Engineering Review, 1996, 2(11) :2.

共引文献48

同被引文献116

引证文献12

二级引证文献45

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部