期刊文献+

基于共振峰模式的汉语普通话中韵母发音水平客观测试方法的研究 被引量:16

Objective evaluation of vowels of standard Chinese pronunciation based on formant pattern
原文传递
导出
摘要 提出了一种以元音的共振峰模式为特征基于支持向量机算法的分类评估方法,用以对汉语普通话中的韵母发音水平进行客观测试。此算法为每个韵母分别训练全分类模型、子分类模型和评估模型,在两级分类的基础上对发音水平进行测试打分。实验结果表明,全分类模型可以达到90%以上的分类正确率,客观测试与专家主观评估的相似度达到82%,在性能上超过了传统的以倒谱系数为特征的隐含马尔科夫模型方法。 A method used for objective evaluation of pronunciation of vowels in standard Chinese is presented. The formant patterns of vowels are selected as the main features and an improved evaluation algorithm based on Support Vector Machine is proposed. In this algorithm, two-level classification strategy is employed. A full-classification model and a sub-classification model are trained for each vowel. The pronunciation quality is evaluated based on the classification results of this two-level strategy with evaluation model of each vowel. The new evaluation method is compared with traditional methods such as HMM posterior probability scoring method and feature of Mel-Frequency Cepstrum Coefficients (MFCC), and the results show that the performance is effectively improved by the proposed method. The correlation of scores between human testers and machine has achieved 82%.
出处 《声学学报》 EI CSCD 北大核心 2007年第2期122-128,共7页 Acta Acustica
基金 国家973项目支持(2004CB318106)
关键词 客观测试方法 汉语普通话 共振峰 发音 韵母 支持向量机算法 分类模型 Algorithms Feature extraction Probability Standards Support vector machines
  • 相关文献

参考文献14

  • 1Kim Y, Franco H, Neumeyer L. Automatic pronunciation scoring of specific phone segments for language instruction.in Proc. of EUROSPEECH 97, Rhodes, Greece: 649-652.
  • 2Neumeyer L, FYanco H, Digalakis V, Weintraub M. Automatic scoring of pronunciation quality. Speech Communication, 2000; 30:83-93.
  • 3Neumeyer L, Franco H, Weintraub M, Price P. Automatic text-independent pronunciation scoring of foreign language student speech. Proc. of ICSLP 96, 1996, Philadelphia,Pennsylvania: 1457-1460.
  • 4Franco H, Abrash V, Precoda K, Bratt H, Rao R,Butzberger J. The SRI EduSpeak^TM System: Recognition and Pronunciation Scoring for Language Learning. Proceedings of INSTIL 2000, 2000, Dundee, Scotland: 121-125.
  • 5Philip Schmid. Explicit N-best formant features for segment-based speech recognition. PhD thesis, Computer Science and Engineering, Oregon Graduate Institute of Science & Technology, 1996:53-60.
  • 6Cortes C, Vapnik V. Support-vector networks. Machine Learning, 1995; 20:273-297.
  • 7Vapnik V, Levin E, Le Cun Y. Measuring the VCdimension of a learning machine. Neural Computation,1994; 6:851-876.
  • 8吕萍,颜永红.基于回归分析的语音识别快速自适应算法[J].声学学报,2005,30(3):222-228. 被引量:4
  • 9DONG Bin, ZHAO Qingwei, YAN Yonghong. Automatic assessment of pronunciation quality. Proceedings of 4th International Symposium on Chinese Spoken Language Processing 2004, Hong Kong, 2004:137-140.
  • 10鲍怀翘.普通话单元音的生理解释[J].中国语文,1984,2:45-45.

二级参考文献39

  • 1陆致极.试论普通话音位的区别特征[J].语文研究,1987(4):10-20. 被引量:17
  • 2张家騄.汉语普通话区别特征系统[J].声学学报,2005,30(6):506-514. 被引量:28
  • 3齐士钤 张家禄.汉语普通话辅音音长分析[J].声学学报,1982,(1):8-13.
  • 4张家禄 齐士钤 宋美珍 等.汉语声调在言语可懂度中的重要作用.声学学报,1981,7:237-237.
  • 5Helmholtz H L F von. On the sensations of tone as a physiological basis for the theory of music, Dover, New York,1954.
  • 6Seebeck A. Uber die Definition des Tones. Ann. Phys.Chem., 1843; 60: 449-481.
  • 7Békésy, G. von, Experiments in Hearing, McGraw-Hill,New York, 1960.
  • 8Schouten J F, Ritsma R J, Cordozo B J. Pitch of the residue. J. Acoust. Soc. Am., 1962; 34: 1418-1424.
  • 9Goldstein J L. An optimum processor theory for the central formation of the pitch of complex tones. J. Acoust. Soc.Am., 1973; 54:1496-1517.
  • 10Meddis R, O'Mard L. A unitary model of pitch perception.J. Acoust. Soc. Am. 1997; 102(3): 1811-1820.

共引文献54

同被引文献131

引证文献16

二级引证文献64

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部