汉语语音识别的抗噪性前端算法及性能分析被引量：1

A Noise Robust Front End Algorithm for Mandarin Speech Recognition and Performance Analysis

下载PDF

导出

摘要讨论了欧洲电信标准委员会ETSI提出的分布式语音识别系统的抗噪前端特征提取算法,该算法融合多种抗噪技术。结合汉语语音的特点,进行了汉语语音识别整体框架下的算法实现,并进行了实验和分析,典型噪声环境下的识别结果证明,相对于基线MFCC特征提取算法,稳健性有较大提高。 Noise robustness of automatic speech recognition system is the hot topic during recent years.In this paper,the noise robust Front End algorithm proposed by ETSI for Distributed Speech Recognition System(DSR)which is a combination of several separate noise-robust techniques is discussed.Considering Mandarin speech char-acters,a feature extraction system based on this algorithm is realized and analyzed its performance through recog-nition experiments is analyzed.In some typical noise environments,we got much higher recognition rate compared with classical MFCC(Melscale Frequency Cepstrum Coeffcient )feature is obtained.[

作者林建臻孙甲松王作英

机构地区清华大学电子工程系

出处《电声技术》北大核心 2004年第3期45-48,52,共5页 Audio Engineering

基金犦国家863高技术项目(863-306-ZD03-02-1) 985重大项目(985校-22-攻关-06) 国家自然科学基金项目.

关键词汉语语音识别抗噪性性能分析抗噪前端特征提取算法 noise robustness DSR front end mandarin speech recognition

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1王作英.基于段长分布的HMM语音识别模型[A]..第二届全国汉字语音识别会议[C].庐山,1989..
2Satoru Tsuge, Toshiaki Fukada, Harald Singer. Speaker Normalized Spectral Subband Parameters For Noise Robust Recognition. ICASSP 1999.
3Steven F. Boll. Suppression of Acoustic Noise in Speech Using Spectral Subtraction. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1979.
4Agarwal A., Cheng Y.M.. Two-stage Mel-Warped Wiener Filter for Robust Speech Recognition. Proc.ASRU'99,1999.
5Hynek Hermansky. Exploring Temporal Domain for Robustness in Speech Recognition. Invited paper, the 15th International Congress on Acoustics, 1995.
6Agarwal A., Cheng Y. M.. Two-stage Mel-Warped Wiener Filter for Robust Speech Recognition. Proceeding of IEEE workshop on Automatic Speech Recognition and Understanding, Keystone, 1999.
7Li Deng, Jasha Droppo, Alex Acero. A Bayesian Approach to Speech Feature Enhancement Using the Dynamic Cepstral Prior. ICASSP 2002.
8Qi Li,Frank K.Soong, Olivier Siohan. An Auditory system-based feature for robust speech recognition. Eurospeech 2001.
9Xiaodong Cui, Markus Iseli, Qifeng Zhu,Abeer Alwan. Evaluation of Noise Robust Features on the Aurora Databases. ICSLP 2002.
10Ramalingam Hariharan, Imre Kiss, Olli Viikki. Noise Robust Speech Parameterization Using Multi-resolution Feature Extraction.IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001-09.

共引文献3

1任纪生,王作英.一种新的潜在语义分析语言模型[J].高技术通讯,2005,15(8):1-5. 被引量：3
2李春,王作英.汉语连续语音识别中一种新的音节间相关识别单元[J].声学学报,2003,28(2):187-191. 被引量：3
3肖熙,王侠,王作英.基于Dialogic语音卡实时数据采集的电话语音识别系统[J].计算机工程与应用,2003,39(17):110-114. 被引量：3

同被引文献5

1孙甲松,王作英,吴及.汉语语音识别中的有调拼音多候选问题[C]//第六届全国人际语音通讯学术会议论文集.深圳:中国计算机学会人工智能与模式识别专业委员会等6专业委员会,2001:243-247.
2JELINEK F.Self-organized language modeling for speech recognition[M]. San Mateo: Readings in Speech Recognition, Morgan-Kaufmann, 1990 : 450-506.
3BELLMAN R E. Dynamic programming[M].Princeton : Princeton University Press, 1957.
4GUODONG Z, KIMTENG L. Interpolation of n-gram and mutual-information based trigger pair language models for Mandarin speech recognition[J]. Computer Speech and Language, 1999,13(2) : 125-141.
5王作英,肖熙.基于段长分布的HMM语音识别模型[J].电子学报,2004,32(1):46-49. 被引量：42

引证文献1

1金玮,孙甲松.汉语语音识别中语言模型的并行优化[J].电声技术,2010,34(8):49-52. 被引量：1

二级引证文献1

1万济萍,刘子菡,王玥,刘婉姬,张清涛,辛杰.基于语音识别技术口语自动评测的专利分析[J].电声技术,2012,36(S1):53-56. 被引量：1

1王文为,蒋保臣.语音识别系统的抗噪前端处理及性能分析[J].计算机应用,2007,27(B06):138-139.
2鲁五一,吴德华,谢志明,刘建.基于听觉掩蔽效应的改进MFCC特征提取算法[J].信息化研究,2009,35(9):16-18. 被引量：3
3梁钊.分布式语音识别系统及其相关技术[J].计算机工程与应用,2002,38(12):57-58. 被引量：1
4董俊,杨震.基于局域网上的分布式语音识别系统[J].南京邮电学院学报（自然科学版）,2003,23(1):18-22.
5普次仁,顿珠次仁.基于LDA-MFCC的藏语语音特征提取技术研究[J].西藏大学学报（社会科学版）,2014,29(2):44-47. 被引量：1
6叶蕾,方鹏.分布式语音识别参数提取的改进算法及实现[J].福建电脑,2007,23(5):91-91.
7张宇,刘坚强.基于SFA的改进MFCC特征提取算法[J].电声技术,2015,39(5):66-70. 被引量：1
8孟建庭,吴及,王作英.分布式语音识别系统的架构分析和具体实现[J].电声技术,2004,28(8):51-53.

电声技术

2004年第3期

浏览历史

内容加载中请稍等...

汉语语音识别的抗噪性前端算法及性能分析被引量：1

参考文献11

共引文献3

同被引文献5

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

汉语语音识别的抗噪性前端算法及性能分析 被引量：1

参考文献11

共引文献3

同被引文献5

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

汉语语音识别的抗噪性前端算法及性能分析被引量：1