摘要
积20余年之努力与锤炼,北京大学计算语言学研究所完成的一项科研成果"综合型语言知识库"于2007年2月通过了教育部组织的技术鉴定。鉴定结论认为"其规模、深度、质量和应用效果在我国语言工程实践中是前所未有的。该成果是以汉语为核心的多语言知识库建设中最全面、最重要的研究成果,总体上达到了国际领先水平"。本文在介绍以《现代汉语语法信息词典》为基础的综合型语言知识库的规模、构成、内容、品质和发展历程之后,陈述建设综合型语言知识库的理念,期望与读者分享在计算语言学和自然语言处理这一交叉学科领域内治学的心得与研发的经验。同时也对这项成果的应用实例进行分析,评估它的应用潜力,期望它在以汉语为核心的多语言信息处理事业的发展中起到铺路填坑或者投石问路的作用。
After accumulation and hard work for over two decades, one of the research achievements made by the Institute of Computational Linguistics at Peking University (ICL/PKU), the Comprehensive Language Knowledgebase (CLKB), passed the Technical Appraisal organized by the Ministry of Education in February 2007. The conclusion is: The scale, depth, quality and application result of CLKB are unprecedented in China's language engineering practice. This achievement is the most comprehensive and important research fruit in the building of multilanguage knowledge-base with Chinese as the center and has generally reached world-class level. This paper briefly describes the scale, composition, quality and development of CLKB based on the Grammatical Knowledge-base of Contemporary Chinese (GKB), and then lays an emphasis on illustrating the rationale of the building of CLKB, with an expectation to share the knowledge and experience with readers obtained in the study and research on the crossdisciplines--Computational Linguistics and Natural Language Processing. Meanwhile, the author also explores the application practice of this achievement and assesses its application potential in the hope of paving the path, or sending out a trial balloon, for the development of multi-language information processing techniques with Chinese as the center.
出处
《中文信息学报》
CSCD
北大核心
2007年第6期3-12,共10页
Journal of Chinese Information Processing
基金
国家973课题资助项目(2004CB318102)
关键词
计算机应用
中文信息处理
综合型语言知识库
多语言信息处理
计算语言学
自然语言处理
现代汉语语
法信息词典
治学心得
computer application
Chinese information processing
comprehensive language knowledge-base
multilanguage information processing
computational linguistics
natural language processing
grammatical kvowledgebase of contemporary Chinese
research experience