期刊文献+

基于新型小波滤波器的语音识别特征提取方法

Feature Extraction Method Based on New Wavelet Filter in Speech Recognition
在线阅读 下载PDF
导出
摘要 介绍一种基于新型小波听觉滤波器组的语音识别特征提取方法。按照人耳听觉临界频带带宽设计一组新型小波带通滤波器组,并详细计算给出构建新型小波滤波器所需要的尺度参数。采用SDA9000串行信号分析仪进行频谱分析,使用型号为MIC3000 Compact PCI Industrial Computer的LSP设备进行FPGA硬件仿真,使用协同神经网络进行模式识别,建立基于Matlab GUI的仿真界面,与高斯小波滤波器组模型所得仿真结果进行对比,从功率谱图和识别结果上进行分析,证明新型小波滤波器组具有更优的识别率和抗噪性。 This paper introduces a feature extraction method based on a new wavelet filter. At first, the new wavelet' s theory is introduced. Then, the new wavelet filter is designed according to the concept of human critical frequency band, and the scale parameter which the new wavelet filter need is given. The SDA9000 is used for spectral analysis, the LSP is applied for FPGA hardware simulation. The SNN (Synergetic Neural Networks) is used in train and recognition, and the Gauss wavelet filter is used to compare with the new wavelet filter. The characteristics of numerical and application for the methods are illustrated by using PC simulation of Maflab GUI. After the analysis of the spectrogram and the recognition result, it is found that the new wavelet filter has higher recognition rate and better robustness than traditional feature.
出处 《计算机与现代化》 2010年第3期111-114,117,共5页 Computer and Modernization
基金 成都信息工程学院科研基金资助项目(CRF200826)
关键词 语音识别 听觉模型 听觉滤波器 临界频带 小波滤波器 speech recognition auditory model auditory filter critical bands wavelet filter
  • 相关文献

参考文献10

  • 1Alfredo Mantilla Caeiros, Mariko Nakano Miyatake, Hector Perez Meana. A new wavelet function for audio and speech processing[ C ]//Proceeding of 50th IEEE INT'L Midwest Symposium on Circuits & Systems. Canada,Agosto,2007: 101-104.
  • 2Zwicker E. Subdivision of the audible frequency range into critical bands [ J ]. Journal of Acoustical Society of America, 1961,33 (2) :248.
  • 3Zhang X, Heinz M G, Bruce I C, et al. A phenomenological model for the responses of auditory-nerve fibers:Ⅰ. Nonlinear tuning with compression and suppression[ J]. Journal of Acoustical Society of America,2001,109(2) :648-4570.
  • 4De Boer E, De Jough H R. On cochlear encoding:Potentialities and limitations of the reverse-correlation technique[ J]. Journal of Acoustical Society of America, 1978,63 ( 1 ) : 115- 135.
  • 5陈世雄,宫琴.常见的听觉滤波器[J].北京生物医学工程,2008,27(1):94-99. 被引量:7
  • 6孙颖,张雪英.基于高斯小波滤波器的语音识别特征提取方法[J].太原理工大学学报,2007,38(2):146-149. 被引量:2
  • 7陈小平,胡泽.听觉临界频带及其在声频信号处理中的应用[J].北京广播学院学报(自然科学版),2004,11(2):28-35. 被引量:6
  • 8Yang Gui, Kwan H K. Adaptive subband wiener filtering for speech enhancement using critlcal-band gammatone filterbank[ J]. Circuits and Systems ,2005 ( 1 ) :732-735.
  • 9Ambikairajah E,Epps J,Lin L. Wideband speech and audio coding using gammatone filter banks [ C ]//Proc. ICASSP' 01. Salt Lake City,USA,2001:773-776.
  • 10Martin T.Hagam.神经网络设计[M].戴葵,译.北京:机械工业出版社,2002.

二级参考文献28

  • 1焦志平,张雪英,赵姝彦.一种基于听觉模型的抗噪语音识别特征提取方法[J].太原理工大学学报,2005,36(1):13-15. 被引量:8
  • 2[1]Zwicker E and Fastl H. Psychoacoustics-Facts and Models Second edition [ M ] . SpringerVerlag, 1990.
  • 3[2]Brian C J Moore. An Introduction to the Psychology of Hearing, Fifth edition [M].Academic Press, 2003.
  • 4[3]ISO 532B. Method for calculating the loudness of complex sound that has been analysed in terms of one-third octave bands[M]. 1975.
  • 5[4]Zwicker E. Subdivision of the audible frequency range into critical bands[J]. Journal of the Acoustical Society of America, Vol 33,pp248, 1961.
  • 6[5]Schroeder M R, Atal B S and Hall J L, Optimizing digital speech coders by exploiting masking properties of the human ear. [ J ]Journal of the Acoustical Society of America,vol. 66, pp1647-1652, Dec 1979.
  • 7[6]ISO/IEC 11172-3. Coding of moving pictures and associated audio for digital storage media at up to about 1. 5Mbit/s-Part 3:Audio. ISO/IEC JTC 1/SC 29, May 1993.
  • 8[7]Johnston J D, Transform coding of audio signals using perceptual noise criteria, IEEE J. on Sel. Areas in Com., vol. 6, pp314-323, Feb. 1988.
  • 9Doh-suk Kim, Soo-Young Lee, Rhee M Kil. Auditory Processing of Speech Signal for Robust Speech Recognition in Real-Word Noisy Environments[J]. IEEE Transactions On Speech And Audio Processing, 1999, 1 (7) : 55-68.
  • 10Oded Ghitza. Auditory Models and Human Performance in Tasks Related to Speech Coding and Speech Recognition[J].IEEE Transactions On Speech And Audio Processing, 1994, 1(2):113-131.

共引文献40

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部