摘要
本文在两部现代汉语词典和一个大规模分词语料库的基础上,对现代汉语的同音词现象进行了调查。基于词典的统计描述了汉语同音词按不同音节数分布的数量和比率,计算了汉语拼音的载词量、同音率和同音度等。根据语料库的调查进一步揭示了不同音节数的词频分布规律。
This article investigates Chinese homophones, based on the statistics of two dictionaries and one large scale tokenized balanced corpus. From the statistical result of two dictionaries, the characteristics of homophones with varied number of syllables are concluded and data including word types per pinyin, homophone ratio and homophone degree are acquired. From the statistical result of NCC corpus, this article analyzes the distribution pattern of word types with varied number of syllables.
出处
《语言文字应用》
CSSCI
北大核心
2009年第4期132-142,共11页
Applied Linguistics
关键词
同音词
拼音载词量
同音率
同音度
homophone
word-types per pinyin
homophone ratio
homophone degree