期刊文献+

基于隐马尔可夫模型与并行模型组合的特征补偿算法 被引量:4

Feature compensation algorithm based on hidden Markov model and parallel model combination
在线阅读 下载PDF
导出
摘要 提出了一种基于隐马尔可夫模型和并行模型组合的特征补偿算法.首先,利用一个包含较多状态的隐马尔可夫模型来描述全部单词特征向量的分布.然后,根据静音段估计的噪声均值和方差,采用并行模型组合方法调整隐马尔可夫模型的均值向量和协方差矩阵,使之与识别环境相匹配.最后,根据基于状态转移矩阵压缩的前向后向算法计算隐马尔可夫模型的后验概率,并通过最小均方误差准则估计纯净语音特征向量.实验结果表明,该算法能够更加准确地估计纯净语音特征向量,其性能明显优于基于高斯混合模型的特征补偿算法;状态转移矩阵压缩算法可以在不影响补偿精度的前提下,显著减少前向后向算法的计算量. A feature compensation algorithm based on hidden Markov model (HMM) and parallel model combination (PMC) is presented. Firstly, a HMM composed of a number of states is employed to represent the distribution of the speech features of all words. Then, according to the mean and covariance of noise from noise-only frames, the mean vectors and covariance matrices of the HMM are transformed to the testing condition by the PMC method. Finally, the posterior probability of HMM is computed by the forward-backward algorithm based on the compression of the state transition matrix, and the clean speech feature is calculated by the minimum mean squared error method. The experimental results show that the proposed algorithm can restore the clean speech feature more accurately and outperforms the feature compensation algorithm based on Gaussian mixture model (GMM). Besides, the state transition matrix compression method can greatly reduce the computational cost of the forward-backward algorithm without decreasing the compensation performance.
作者 吕勇 吴镇扬
出处 《东南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2009年第5期889-893,共5页 Journal of Southeast University:Natural Science Edition
基金 国家重大基础研究发展计划(973计划)资助项目(2002CB312102) 国家自然科学基金资助项目(60672094)
关键词 语音识别 特征补偿 隐马尔可夫模型 并行模型组合 speech recognition feature compensation hidden Markov model parallel model combination
  • 相关文献

参考文献11

  • 1Nasersharif B, Akbari A. SNR-dependent compression of enhanced Mel sub-band energies for compensation of noise effects on MFCC features [J ]. Pattern Recognition Letters, 2007,28( 11 ) : 1320 - 1326.
  • 2赵蕤,王作英.语音识别中信道和噪音的联合补偿[J].声学学报,2006,31(5):466-470. 被引量:11
  • 3Cui X, Alwan A. Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR [ J ]. IEEE Transactions on Speech and Audio Processing, 2005, 13(6) : 1161 -1172.
  • 4Barreaud V, Illina I, Fohr D. On-line stochastic matching compensation for non-stationary noise [ J ]. Computer Speech and Language, 2008, 22 ( 3 ) : 207 - 229.
  • 5Moreno P J. Speech recognition in noisy environments [ D]. Pittsburgh, Pennsylvania, USA: Carnegie Mellon University, 1996: 79 - 126.
  • 6Kim W, Kwon O, Ko H. PCMM-based feature compensation schemes using model interpolation and mixture sharing [ C ]//IEEE International Conference on Acoustics, Speech, and Signal Processing. Montreal, Canada, 2004:989-992.
  • 7Kim W, Hansen J H L. Feature compensation in the cepstral domain employing model combination [ J ]. Speech Communication, 2009, 51 (2) : 83 - 96.
  • 8Sasou A, Asano F, Nakamura S, et al. HMM-based noise-robust feature compensation [ J]. Speech Communication, 2006, 48 (9) : 1100 - 1111.
  • 9Gales M J F, Young S J. Robust speech recognition in additive and convolutional noise using parallel model combination [ J ]. Computer Speech and Language, 1995, 9(4): 289-307.
  • 10孙暐,吴镇扬.基于独立感知理论的鲁棒语音识别算法[J].东南大学学报(自然科学版),2005,35(4):506-509. 被引量:2

二级参考文献25

  • 1Gales M, Young S. Cepstral parameter compensation for HMM recognition in noise[J]. Computer Speech and Language, 1993, 12(3): 231-239.
  • 2Leggetter C J, Woodland P C. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models[J]. Computer Speech and Language, 1995, 9(2): 171-185.
  • 3Gales M J F, Woodland P C. Mean and variance adaptation within the MLLR framework[J]. Computer Speech and Language, 1996, 10(4): 249-264.
  • 4Allen J B. How do humans process and recognize speech[J]. IEEE Transactions on Speech and Audio Processing, 1994, 2(4): 567-577.
  • 5Sharma S R. Multi stream approach to robust speech recognition[D]. Portland, USA: Oregon Graduate Institute of Science and Technology, 1999.
  • 6Tibrewala S, Hermansky H. Sub-band based recognition of noisy speech[A]. In: Proc ICASSP'97[C]. Munich, Germany, 1997. 1255-1258.
  • 7Hermansky H, Tibrewala S, Pavel M.Towards ASR on partially corrupted speech[A]. In: Proc ICSLP'96[C]. Philadelphia, USA, 1996. 462-465.
  • 8Ji M, Smith F J. A probabilistic union model for subband based robust speech recognition[A]. In: Proc ICASSP'00[C]. Istanbul, Turkey, 2000. 1787-1790.
  • 9Ris C, Dupont S. Assessing local noise level estimation methods: application to noise robust ASR[J]. Speech Communication, 2001, 34: 141-158.
  • 10Hirsh H G. Estimation of noise spectrum and its application to SNR estimation and speech enhancement (TR-93-012)[R]. Berkeley, USA: International Computer Science Institute, 1993.

共引文献11

同被引文献54

引证文献4

二级引证文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部