期刊文献+

基于MDT特征补偿的噪声鲁棒语音识别算法 被引量:2

Robust noise feature compensation method for speech recognition based on missing data technology
原文传递
导出
摘要 针对噪声环境下语音识别系统性能下降的问题,提出一种基于语音时频相关性的Mel特征矢量聚类补偿算法。该算法首先实现掩码估计,利用纯净语音信号时域和频域的相关性,实现了时频块的有效划分和基于时频块的语音特征聚类。在此基础上,对带噪语音的Mel语谱进行特征补偿。采用HTK工具和TIDIGITS数据库加入不同类别噪声的语音测试结果表明:该算法在不同信噪比条件下,获得了较基于频域相关性聚类特征补偿算法更好的性能。 The performance of automatic speech recognition systems declines dramatically in noisy environments.This paper presents a missing data technology(MDT) feature compensation method based on a time-frequency correlation(TF-CBFC) to improve the speech recognition.This method first estimates the missing data masks and then divides the spectrogram into time-frequency blocks(TFBs).Clusters of the TFBs are used to compensate for missing data in the Mel spectrogram of noisy speech.Tests with the HTK tools on the TIDIGITS database using different classes of noise show that this method outperforms the MDT feature compensation method based on the spectral relations for various signal to noise ratios(SNRs).
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2013年第6期753-756,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金项目(61271309)
关键词 语音识别 缺失数据技术 隐Markov模型(HMM) 特征补偿 speech recognition missing data technology hidden Markov model(HMM) feature compensation
  • 相关文献

参考文献13

  • 1Cooke M, Green P, Crawford M D. Handling missing data in speech recognition[C]//Proc ICSLP. Yokohama, Japan: IEEE Press, 1994:1555 - 1558.
  • 2Hermansky H. Perceptual linear predictive (PLP) analysis of speech [J]. Journal of the Acoustical Society of America, 1990, 87:1738-1752.
  • 3Gales M J F, Young S J. Robust continuous speech recognition using parallel model combination [J]. IEEE Transactions on Speech and Audio Processing,, 1996, 4(5) : 352 -359.
  • 4Moreno P J, Raj B, Stern R M. A vector Taylor series approach for environment independent speech recognition [C]//Proc ICASSP. Atlanta, GA, USA.- IEEE Press, 1996(2) ~ 73:3-736.
  • 5Cooke M, Green P, Josifovski L, et al. Robust automatic speech recognition with missing and unreliable acoustic data [J]. Speech Communication, 2001, 34(3) : 267 - 285.
  • 6Vizinho A, Green P, Cooke M, et al. Missing data theory, spectral subtraction and signal-to-noise estimation for robust ASR: An integrated study [C]// Proc Eurospeech. Budapest, Hungary, 1999:2407- 2410.
  • 7Seltzer M L, Raj B, Stern R M. A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition [J]. Speech Communication, 2004, 43(4): 379-393.
  • 8Raj B, Seltzer M L, Stern R M. Reconstruction of missing features for robust speech recognition [J]. Speech Communication, 2004, 43(4) : 275 - 296.
  • 9Raj B, Stern R M. Missing-feature approaches in speech recognition [J]. Signal Processing Magazine, IEEE , 2005, 22(5) :101 - 116.
  • 10Raj B, Seltzer M L, Stern R M. Robust speech recognition: The case for restoring missing features[C]//Proc Eurospeech: The Workshop on Consistent and Reliable Acoustic Cues. Aalborg, Denmark, 2001.

同被引文献10

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部