摘要
双语对齐的平行语料库在机器翻译和自然语言处理领域中扮演着非常重要的角色,它的研究和制作具有重要的理论意义和实用价值,双语语料的建设十分必要,其中双语对齐是最基本的环节。文章首先简要介绍了语料库的建设情况,然后结合主流的句子对齐方法提出并实现了基于词典和语言学信息的英汉双语句子对齐。
Bilingual corpus plays a very important part in machine translation and natural language process field, Building and research on it both have theoretic significance and practical value. Therefore, it's necessary to build a large-scale bilingual corpus, Bilingual Alignment is the basic connection, This paper firstly gives a brief discussion on the construction of the corpus, secondly discuss the main ways of sentence alignment, and finally pose another aligning method base on lexical and language information.
出处
《电脑与信息技术》
2008年第1期5-7,共3页
Computer and Information Technology
基金
福建省自然科学基金项目(2006J0043)
关键词
双语语料库
平行
对齐
机器翻译
bilingual corpus
parallel
alignment
machine translation