摘要
利用中国知网(CNKI)数据,对汉语中介语语料库的应用现状进行统计分析,重点分析了"HSK动态作文语料库"的应用研究领域。分析发现,基于中介语语料库的研究主要集中于词汇偏误层面,其次是句子层面和汉字层面,关于段落篇章的研究较少。从汉语中介语语料库建设上说,资源利用主要集中于个别语料库上,造成了资源的浪费;语料库语料分布也不均衡,缺乏口语语料。语料库标注加工的广度及深度有待提高。
By studying papers from China National Knowledge Infrastructure( CNKI),this paper analyzes the status of the application of Chinese Inter- language Corpus,especially the applications of"HSK dynamic composition corpus". We find that most studies based on Chinese Inter- language Corpus focused on the vocabulary errors,some focused on the sentence errors,only a few studies involve paragraph and chapter errors.From the view of the construction of Chinese Inter- language Corpus,corpus resources focused on individual corpus,resulting in a waste of language resources and the distribution of corpus is also not balanced,speech corpus is insufficient. The annotate processing of Chinese Inter- language Corpus also need to be improved.This paper presents the development measures of Chinese Inter- language Corpus to solve the problems. We hope our study can provide a reference for the construction of the Chinese Inter- language Corpus.
出处
《曲靖师范学院学报》
2016年第2期86-91,共6页
Journal of Qujing Normal University
基金
国家语委"十二五"科研规划项目"语言资源建设规划研究"(YB125-124)
关键词
汉语中介语语料库
建设
应用
Chinese Inter-language Corpus
construction
application