期刊文献+

建设综合型语言知识库的理念与成果的价值 被引量:13

The Rationale of Building the Comprehensive Language Knowledge-base and The Significance of its Achievements
在线阅读 下载PDF
导出
摘要 积20余年之努力与锤炼,北京大学计算语言学研究所完成的一项科研成果"综合型语言知识库"于2007年2月通过了教育部组织的技术鉴定。鉴定结论认为"其规模、深度、质量和应用效果在我国语言工程实践中是前所未有的。该成果是以汉语为核心的多语言知识库建设中最全面、最重要的研究成果,总体上达到了国际领先水平"。本文在介绍以《现代汉语语法信息词典》为基础的综合型语言知识库的规模、构成、内容、品质和发展历程之后,陈述建设综合型语言知识库的理念,期望与读者分享在计算语言学和自然语言处理这一交叉学科领域内治学的心得与研发的经验。同时也对这项成果的应用实例进行分析,评估它的应用潜力,期望它在以汉语为核心的多语言信息处理事业的发展中起到铺路填坑或者投石问路的作用。 After accumulation and hard work for over two decades, one of the research achievements made by the Institute of Computational Linguistics at Peking University (ICL/PKU), the Comprehensive Language Knowledgebase (CLKB), passed the Technical Appraisal organized by the Ministry of Education in February 2007. The conclusion is: The scale, depth, quality and application result of CLKB are unprecedented in China's language engineering practice. This achievement is the most comprehensive and important research fruit in the building of multilanguage knowledge-base with Chinese as the center and has generally reached world-class level. This paper briefly describes the scale, composition, quality and development of CLKB based on the Grammatical Knowledge-base of Contemporary Chinese (GKB), and then lays an emphasis on illustrating the rationale of the building of CLKB, with an expectation to share the knowledge and experience with readers obtained in the study and research on the crossdisciplines--Computational Linguistics and Natural Language Processing. Meanwhile, the author also explores the application practice of this achievement and assesses its application potential in the hope of paving the path, or sending out a trial balloon, for the development of multi-language information processing techniques with Chinese as the center.
作者 俞士汶
出处 《中文信息学报》 CSCD 北大核心 2007年第6期3-12,共10页 Journal of Chinese Information Processing
基金 国家973课题资助项目(2004CB318102)
关键词 计算机应用 中文信息处理 综合型语言知识库 多语言信息处理 计算语言学 自然语言处理 现代汉语语 法信息词典 治学心得 computer application Chinese information processing comprehensive language knowledge-base multilanguage information processing computational linguistics natural language processing grammatical kvowledgebase of contemporary Chinese research experience
  • 相关文献

参考文献24

  • 1俞士汶,段慧明,朱学锋,张化瑞.综合型语言知识库的建设与利用[J].中文信息学报,2004,18(5):1-10. 被引量:31
  • 2俞士汶.现代汉语短语结构知识库规格说明书.汉语语言与计算学报,2003,13(2):215-226.
  • 3王惠 詹卫东 俞士汶.现代汉语语义词典规格说明书.汉语语言与计算学报,2003,13(2):159-176.
  • 4于江生 刘扬 俞士汶.中文概念词典规格说明.汉语语言与计算学报,(2):177-194.
  • 5俞士汶,段慧明,朱学锋,孙斌.北京大学现代汉语语料库基本加工规范[J].中文信息学报,2002,16(5):49-64. 被引量:132
  • 6俞士汶 段慧明 朱学锋 等.北大语料库加工规范:切分·词性标注·注音[J].汉语语言与计算学报,2003,13(2):121-158.
  • 7常宝宝 柏晓静.北京大学汉英双语语料库标记规范[J].汉语语言与计算学报,2003,(2):195-209.
  • 8常宝宝.基于语料库的双语词典编纂平台的构建[J].辞书研究,2006(3):122-133. 被引量:10
  • 9Bin SUN and Shiwen YU. A graded approach for the efficient resolution of Chinese word segmentation am biguities[A].In: Proceedings of 5th Natural Language Processing Pacific Rim Symposium '99 (NL PRS '99)[C]. Beijing, Nov. 1999.
  • 10俞士汶.中文输入中语法分析技术的应用[J].中文信息学报,1988,(3).

二级参考文献97

共引文献424

同被引文献198

引证文献13

二级引证文献85

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部