基于隐马尔可夫模型与并行模型组合的特征补偿算法被引量：4

Feature compensation algorithm based on hidden Markov model and parallel model combination

下载PDF

导出

摘要提出了一种基于隐马尔可夫模型和并行模型组合的特征补偿算法.首先,利用一个包含较多状态的隐马尔可夫模型来描述全部单词特征向量的分布.然后,根据静音段估计的噪声均值和方差,采用并行模型组合方法调整隐马尔可夫模型的均值向量和协方差矩阵,使之与识别环境相匹配.最后,根据基于状态转移矩阵压缩的前向后向算法计算隐马尔可夫模型的后验概率,并通过最小均方误差准则估计纯净语音特征向量.实验结果表明,该算法能够更加准确地估计纯净语音特征向量,其性能明显优于基于高斯混合模型的特征补偿算法;状态转移矩阵压缩算法可以在不影响补偿精度的前提下,显著减少前向后向算法的计算量. A feature compensation algorithm based on hidden Markov model （HMM） and parallel model combination （PMC） is presented. Firstly, a HMM composed of a number of states is employed to represent the distribution of the speech features of all words. Then, according to the mean and covariance of noise from noise-only frames, the mean vectors and covariance matrices of the HMM are transformed to the testing condition by the PMC method. Finally, the posterior probability of HMM is computed by the forward-backward algorithm based on the compression of the state transition matrix, and the clean speech feature is calculated by the minimum mean squared error method. The experimental results show that the proposed algorithm can restore the clean speech feature more accurately and outperforms the feature compensation algorithm based on Gaussian mixture model （GMM）. Besides, the state transition matrix compression method can greatly reduce the computational cost of the forward-backward algorithm without decreasing the compensation performance.

作者吕勇吴镇扬

机构地区东南大学信息科学与工程学院

出处《东南大学学报（自然科学版）》 EI CAS CSCD 北大核心 2009年第5期889-893,共5页 Journal of Southeast University：Natural Science Edition

基金国家重大基础研究发展计划(973计划)资助项目(2002CB312102) 国家自然科学基金资助项目(60672094)

关键词语音识别特征补偿隐马尔可夫模型并行模型组合 speech recognition feature compensation hidden Markov model parallel model combination

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1Nasersharif B, Akbari A. SNR-dependent compression of enhanced Mel sub-band energies for compensation of noise effects on MFCC features [J ]. Pattern Recognition Letters, 2007,28( 11 ) : 1320 - 1326.
2赵蕤,王作英.语音识别中信道和噪音的联合补偿[J].声学学报,2006,31(5):466-470. 被引量：11
3Cui X, Alwan A. Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR [ J ]. IEEE Transactions on Speech and Audio Processing, 2005, 13(6) : 1161 -1172.
4Barreaud V, Illina I, Fohr D. On-line stochastic matching compensation for non-stationary noise [ J ]. Computer Speech and Language, 2008, 22 ( 3 ) : 207 - 229.
5Moreno P J. Speech recognition in noisy environments [ D]. Pittsburgh, Pennsylvania, USA: Carnegie Mellon University, 1996: 79 - 126.
6Kim W, Kwon O, Ko H. PCMM-based feature compensation schemes using model interpolation and mixture sharing [ C ]//IEEE International Conference on Acoustics, Speech, and Signal Processing. Montreal, Canada, 2004:989-992.
7Kim W, Hansen J H L. Feature compensation in the cepstral domain employing model combination [ J ]. Speech Communication, 2009, 51 (2) : 83 - 96.
8Sasou A, Asano F, Nakamura S, et al. HMM-based noise-robust feature compensation [ J]. Speech Communication, 2006, 48 (9) : 1100 - 1111.
9Gales M J F, Young S J. Robust speech recognition in additive and convolutional noise using parallel model combination [ J ]. Computer Speech and Language, 1995, 9(4): 289-307.
10孙暐,吴镇扬.基于独立感知理论的鲁棒语音识别算法[J].东南大学学报（自然科学版）,2005,35(4):506-509. 被引量：2

二级参考文献25

1Gales M, Young S. Cepstral parameter compensation for HMM recognition in noise[J]. Computer Speech and Language, 1993, 12(3): 231-239.
2Leggetter C J, Woodland P C. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models[J]. Computer Speech and Language, 1995, 9(2): 171-185.
3Gales M J F, Woodland P C. Mean and variance adaptation within the MLLR framework[J]. Computer Speech and Language, 1996, 10(4): 249-264.
4Allen J B. How do humans process and recognize speech[J]. IEEE Transactions on Speech and Audio Processing, 1994, 2(4): 567-577.
5Sharma S R. Multi stream approach to robust speech recognition[D]. Portland, USA: Oregon Graduate Institute of Science and Technology, 1999.
6Tibrewala S, Hermansky H. Sub-band based recognition of noisy speech[A]. In: Proc ICASSP'97[C]. Munich, Germany, 1997. 1255-1258.
7Hermansky H, Tibrewala S, Pavel M.Towards ASR on partially corrupted speech[A]. In: Proc ICSLP'96[C]. Philadelphia, USA, 1996. 462-465.
8Ji M, Smith F J. A probabilistic union model for subband based robust speech recognition[A]. In: Proc ICASSP'00[C]. Istanbul, Turkey, 2000. 1787-1790.
9Ris C, Dupont S. Assessing local noise level estimation methods: application to noise robust ASR[J]. Speech Communication, 2001, 34: 141-158.
10Hirsh H G. Estimation of noise spectrum and its application to SNR estimation and speech enhancement (TR-93-012)[R]. Berkeley, USA: International Computer Science Institute, 1993.

共引文献11

1孙暐,吴镇扬.多带同步模型用于噪声环境下语音识别[J].中国工程科学,2006,8(3):31-34.
2王欢良,钱瑶,F.K.Soong,韩纪庆.基于声调建模的带噪汉语数字串语音识别[J].声学学报,2007,32(5):454-460. 被引量：2
3马会丽,唐红,赵国锋.电话外呼系统的研究与实现[J].计算机应用,2007,27(9):2343-2345. 被引量：5
4张军,韦岗,余华.基于特征分量输出概率加权的多数据流鲁棒语音识别方法[J].声学学报,2008,33(2):102-108. 被引量：2
5王智国,吴及,戴礼荣,王仁华.一种对加性噪声和信道函数联合补偿的模型估计方法[J].声学学报,2008,33(3):238-243. 被引量：5
6曾毓敏,吴镇扬.基于浊音语音谐波谱子带加权重建的抗噪声说话人识别[J].东南大学学报（自然科学版）,2008,38(6):935-941. 被引量：5
7张岩,李风华,李整林,张仁和.爆炸信号中气泡脉动去除方法及其应用[J].声学学报,2009,34(2):124-130. 被引量：5
8ZHANG Jun WEI Gang YU Hua NING Genxin.Robust multi-stream speech recognition based on weighting the output probabilities of feature components[J].Chinese Journal of Acoustics,2009,28(3):269-279. 被引量：4
9吕勇,吴镇扬.基于最大似然多项式回归的鲁棒语音识别[J].声学学报,2010,35(1):88-96. 被引量：3
10LU Yong WU Zhenyang.Maximum likelihood polynomial regression for robust speech recognition[J].Chinese Journal of Acoustics,2011,30(3):358-370.

同被引文献54

1李珀瀚,何震瀛,向河林.一种基于链接聚类的查询扩展算法[J].计算机研究与发展,2011,48(S3):197-204. 被引量：2
2刘群,张华平,俞鸿魁,程学旗.基于层叠隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429. 被引量：202
3Gai-TaiHuang,Hsiu-HsenYao.Chinese Question-Answering System[J].Journal of Computer Science & Technology,2004,19(4):479-488. 被引量：2
4林亚平,刘云中,周顺先,陈治平,蔡立军.基于最大熵的隐马尔可夫模型文本信息抽取[J].电子学报,2005,33(2):236-240. 被引量：49
5张宇,刘挺,文勖.基于改进贝叶斯模型的问题分类[J].中文信息学报,2005,19(2):100-105. 被引量：47
6文勖,张宇,刘挺,马金山.基于句法结构分析的中文问题分类[J].中文信息学报,2006,20(2):33-39. 被引量：84
7卢志茂,刘挺,李生.统计词义消歧的研究进展[J].电子学报,2006,34(2):333-343. 被引量：28
8余正涛,樊孝忠,郭剑毅,耿增民.基于潜在语义分析的汉语问答系统答案提取[J].计算机学报,2006,29(10):1889-1893. 被引量：46
9孙景广,蔡东风,吕德新,董燕举.基于知网的中文问题自动分类[J].中文信息学报,2007,21(1):90-95. 被引量：41
10刘挺,车万翔,李生.基于最大熵分类器的语义角色标注[J].软件学报,2007,18(3):565-573. 被引量：73

引证文献4

1吕勇,吴镇扬.基于矢量泰勒级数的鲁棒语音识别[J].天津大学学报,2011,44(3):261-265. 被引量：4
2牛铜,李弼程,张连杰.基于缺失数据补偿的鲁棒语音识别[J].信息工程大学学报,2012,13(4):411-415.
3张宁,朱礼军.中文问答系统问句分析研究综述[J].情报工程,2016,2(1):32-42. 被引量：14
4李聪,葛洪伟.自适应并行模型组合的鲁棒语音身份识别算法[J].信号处理,2018,34(7):867-875. 被引量：6

二级引证文献24

1尹全海.江泽民新安全观初探[J].信阳师范学院学报（哲学社会科学版）,2000,20(1):7-12.
2沈崇德,童思木.医院智能语音客户服务系统的创新研究与应用示范[J].中国医学装备,2013,10(1):71-73. 被引量：7
3ZHANG Yi,HE Chun-jiang,LUO Yuan,CHEN Kai,XING Wu-chao.Improved perceptually non-uniform spectral compression for robust speech recognition[J].The Journal of China Universities of Posts and Telecommunications,2013,20(4):122-126. 被引量：1
4张毅,何春江,罗元,徐晓东,童开国.基于改进感知非均匀谱压缩的鲁棒语音识别算法[J].信息与控制,2013,42(5):565-569. 被引量：1
5万义龙,张天骐,王志朝,金静.基于多频带谱减法的抗噪声语音识别研究[J].电视技术,2013,37(23):183-187. 被引量：5
6乔霈,王素格,陈鑫,谭红叶,陈千,王元龙.基于词语关联的散文阅读理解问题答案获取方法[J].中文信息学报,2018,32(3):135-142. 被引量：5
7贺佳,杜建强,聂斌,熊旺平,罗计根.智能问答系统在医学领域的应用研究[J].医学信息,2018,31(14):16-19. 被引量：4
8孙泽健,司光亚,刘洋.面向兵棋演习的问答系统问句分类模型研究[J].计算机与数字工程,2019,47(2):308-313. 被引量：4
9李聪,葛洪伟.非线性幂变换Gammachirp滤波器的鲁棒语音特征提取[J].计算机科学与探索,2019,13(8):1351-1359. 被引量：3
10张靖,俞一彪.具有环境自学习机制的鲁棒说话人识别算法[J].通信技术,2020,53(3):618-624. 被引量：2

1曹晖,王瑾,柏鹏,林治国.基于DSP的LDPC码通用快速编码器设计[J].电视技术,2012,36(23):54-56. 被引量：1
2金连斌,丁庆海,陈显治.PMC在噪声环境下的语音识别中的应用[J].解放军理工大学学报（自然科学版）,2001,2(2):42-45. 被引量：1
3邱作春.麦克风阵列语音增强用于抗噪说话人识别[J].大众科技,2008,10(12):35-37.
4胡旭琰,邹月娴,王文敏.基于MDT特征补偿的噪声鲁棒语音识别算法[J].清华大学学报（自然科学版）,2013,53(6):753-756. 被引量：2
5何勇军,付茂国,孙广路.语音特征增强方法综述[J].哈尔滨理工大学学报,2014,19(2):19-25. 被引量：3
6黄宏伟,谢正光,蒋小燕,蔡旭.基于压缩感知的双向阈值匹配追踪算法[J].电视技术,2015,39(10):5-10.
7吕勇,吴镇扬.基于矢量泰勒级数的鲁棒语音识别[J].天津大学学报,2011,44(3):261-265. 被引量：4
8熊伟,水仲飞.论嵌入式语音识别系统的研究与实现[J].现代商贸工业,2010,22(2):291-292. 被引量：2
9雷建军,杨震,刘刚,郭军.噪声鲁棒语音识别研究综述[J].计算机应用研究,2009,26(4):1210-1216. 被引量：16
10李曙,毛承敏,杨喜,尹里,梁俊.基于子空间追踪的稀疏信号重构算法[J].吉首大学学报（自然科学版）,2015,36(4):23-25. 被引量：1

东南大学学报（自然科学版）

2009年第5期

浏览历史

内容加载中请稍等...

基于隐马尔可夫模型与并行模型组合的特征补偿算法被引量：4

参考文献11

二级参考文献25

共引文献11

同被引文献54

引证文献4

二级引证文献24

相关作者

相关机构

相关主题

浏览历史

基于隐马尔可夫模型与并行模型组合的特征补偿算法 被引量：4

参考文献11

二级参考文献25

共引文献11

同被引文献54

引证文献4

二级引证文献24

相关作者

相关机构

相关主题

浏览历史

基于隐马尔可夫模型与并行模型组合的特征补偿算法被引量：4