期刊文献+

汉语语音识别中的区分性声调建模方法 被引量:4

Tone modeling based on discriminative training for Mandarin speech recognition
在线阅读 下载PDF
导出
摘要 提出从特征提取参数、模型参数对隐马尔可夫声调模型进行区分型训练,来提高声调识别率;提出模型相关的权重对谱特征模型和声调模型的概率进行加权,并根据最小音子错误区分性目标函数对权重进行训练,来提高声调模型加入连续语音识别时的性能。声调识别实验表明区分性的声调模型训练以及特征提取方法显著提高了声调识别率。区分性模型权重训练能够在声调模型加入之后进一步连续语音识别系统的识别率。 To improve tone recognition accuracy,discriminative training in both feature and model parameters for hidden Markov model based tone modeling is proposed.When incorporating tone models into continuous speech recognition,discriminative model weight training is presented.Acoustic and tone model distributions are scaled by model dependent weights trained by the mini- mum phone error criterion.Experiments show tone recognition and large vocabulary continuous speech recognition accuracy can be considerably improved by the proposed methods.
出处 《计算机工程与应用》 CSCD 北大核心 2009年第11期178-182,共5页 Computer Engineering and Applications
基金 国家自然科学基金No.60865001~~
关键词 区分性训练 声调建模 汉语语音识别 特征提取 discriminative training tone modeling Mandarin speech recognition feature extraction
  • 相关文献

参考文献8

  • 1Huang C H,Side F.Pitch tracking and tone features for mandarin speech recognition[C]//Proceedings of the 25th International Conference on Acoustics,Speech and Signal Processing,Istanbul,Turkey,2000:1523-1526.
  • 2Lei X, Siu M, Hwang M, Ostendorf M,et al.Improved tone modeling for Mandarin broadcast news speech recognition[C]//Proceedings of Interspeech (ICSLP), Pittsburgh, USA, 2006 : 1277-1280.
  • 3Povey D,Woodland P C.Minimum phone error and i-smoothing for improved discriminative training[C]//Proceedings of the 27th International Conference on Acoustics,Speech and Signal Processing, Florida, USA, 2002 : 105-108.
  • 4Povey D.Discriminative training for large vocabulary speech recognition[D].Peterhouse : Cambridge University, 2004.
  • 5Povey D, Kingsbury B, Mangu L, et al.fMPE : discriminatively trained features for speech recognition[C]//Proceedings of International Conference on Acoustics,Speech and Signal Processing,Philadelphia, USA, 2005,1 : 961-964.
  • 6Chang E,Shi Yu,Zhou Jian-lai,et al.Speech lab in a box:a Mandarin speech toolbox to jumpstart speech related research[C]//Proceedings of the 7th European Conference on Speech Communication and Technology,Aalborg,Denmark,2001:2779-2782.
  • 7Gopalakrishnan P S,Kanevsky D,Nadas A,et al.A generalization of the Baum algorithm to rational objective functions[C]//Proceedings of the 25th International Conference on Acoustics,Speech and Signal Processing, Glasgow, Seotland, 1989:631-634.
  • 8Lee T,Lau W,Wong Y W,et al.Using tone information in cantonese continuous speech recognition[J].ACM Transactions on Asian Language Information Processing, 2002, 1 ( 1 ) : 83-102.

同被引文献15

  • 1高新涛,陈乖丽.语音识别技术的发展现状及应用前景[J].甘肃科技纵横,2007,36(4):13-13. 被引量:19
  • 2YAN Long,ZHAO Ren-cai,LIU Gang,et al. Large vocabulary manda-rin Chinese continuous speech recognition system based on tonaltriphone [ C] //Proc of International Symposium on Tonal Aspects ofLanguages. 2004:28 - 31.
  • 3YOUNG S J, WOODLAND P C. State clustering in hidden Markovmodel-based continuous speech recognition [ J]. Computer Speechand Language, 1994,8(4) :369-384.
  • 4WANG Guang-sen, SIM K C. An investigation of tied-mixture GMMbased triphones state clustering [ C] //Proc of IEEE International Con-ference on Acoustics,Speech and Signal Processing.2012 -.4717-4720.
  • 5REICHL W,CHOU W. Robust decision tree state tying for continuousspeech recognition [ J]. IEEE Trans on Speech and Audio Pro-cessing,2000,8(5) ;555-566.
  • 6LIU Chao-jun,WU Xin-tian,YAN Yong-hong. High accuracy acousticmodeling using two-level decision-tree based state-tying[ C] //Proc ofthe 5 th European Conference on Speech Communication and Techno-logy. 1999:1703-1706.
  • 7WONG Y W,CHANG E. The effect of pitch and lexical tone on diffe-rent mandarin speech recognition tasks[ C]//Proc of the 7th EuropeanConference on Speech Communication and Technology. 2001:2741-2744.
  • 8SHINODA K. Speaker adaptation techniques for automatic speechrecognition[ C]//Proc of APSIPA. 2011.
  • 9詹新明,黄南山,杨灿.语音识别技术研究进展[J].现代计算机,2008,14(9):43-45. 被引量:44
  • 10倪崇嘉,刘文举,徐波.汉语大词汇量连续语音识别系统研究进展[J].中文信息学报,2009,23(1):112-123. 被引量:39

引证文献4

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部