基于新型小波滤波器的语音识别特征提取方法

Feature Extraction Method Based on New Wavelet Filter in Speech Recognition

下载PDF

导出

摘要介绍一种基于新型小波听觉滤波器组的语音识别特征提取方法。按照人耳听觉临界频带带宽设计一组新型小波带通滤波器组,并详细计算给出构建新型小波滤波器所需要的尺度参数。采用SDA9000串行信号分析仪进行频谱分析,使用型号为MIC3000 Compact PCI Industrial Computer的LSP设备进行FPGA硬件仿真,使用协同神经网络进行模式识别,建立基于Matlab GUI的仿真界面,与高斯小波滤波器组模型所得仿真结果进行对比,从功率谱图和识别结果上进行分析,证明新型小波滤波器组具有更优的识别率和抗噪性。 This paper introduces a feature extraction method based on a new wavelet filter. At first, the new wavelet＇ s theory is introduced. Then, the new wavelet filter is designed according to the concept of human critical frequency band, and the scale parameter which the new wavelet filter need is given. The SDA9000 is used for spectral analysis, the LSP is applied for FPGA hardware simulation. The SNN （Synergetic Neural Networks） is used in train and recognition, and the Gauss wavelet filter is used to compare with the new wavelet filter. The characteristics of numerical and application for the methods are illustrated by using PC simulation of Maflab GUI. After the analysis of the spectrogram and the recognition result, it is found that the new wavelet filter has higher recognition rate and better robustness than traditional feature.

作者柏懋睿郑郁正张杰

机构地区成都信息工程学院

出处《计算机与现代化》 2010年第3期111-114,117,共5页 Computer and Modernization

基金成都信息工程学院科研基金资助项目(CRF200826)

关键词语音识别听觉模型听觉滤波器临界频带小波滤波器 speech recognition auditory model auditory filter critical bands wavelet filter

分类号 TP391.42 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1Alfredo Mantilla Caeiros, Mariko Nakano Miyatake, Hector Perez Meana. A new wavelet function for audio and speech processing[ C ]//Proceeding of 50th IEEE INT'L Midwest Symposium on Circuits & Systems. Canada,Agosto,2007: 101-104.
2Zwicker E. Subdivision of the audible frequency range into critical bands [ J ]. Journal of Acoustical Society of America, 1961,33 (2) :248.
3Zhang X, Heinz M G, Bruce I C, et al. A phenomenological model for the responses of auditory-nerve fibers:Ⅰ. Nonlinear tuning with compression and suppression[ J]. Journal of Acoustical Society of America,2001,109(2) :648-4570.
4De Boer E, De Jough H R. On cochlear encoding:Potentialities and limitations of the reverse-correlation technique[ J]. Journal of Acoustical Society of America, 1978,63 ( 1 ) : 115- 135.
5陈世雄,宫琴.常见的听觉滤波器[J].北京生物医学工程,2008,27(1):94-99. 被引量：7
6孙颖,张雪英.基于高斯小波滤波器的语音识别特征提取方法[J].太原理工大学学报,2007,38(2):146-149. 被引量：2
7陈小平,胡泽.听觉临界频带及其在声频信号处理中的应用[J].北京广播学院学报（自然科学版）,2004,11(2):28-35. 被引量：6
8Yang Gui, Kwan H K. Adaptive subband wiener filtering for speech enhancement using critlcal-band gammatone filterbank[ J]. Circuits and Systems ,2005 ( 1 ) :732-735.
9Ambikairajah E,Epps J,Lin L. Wideband speech and audio coding using gammatone filter banks [ C ]//Proc. ICASSP' 01. Salt Lake City,USA,2001:773-776.
10Martin T.Hagam.神经网络设计[M].戴葵,译.北京:机械工业出版社,2002.

二级参考文献28

1焦志平,张雪英,赵姝彦.一种基于听觉模型的抗噪语音识别特征提取方法[J].太原理工大学学报,2005,36(1):13-15. 被引量：8
2[1]Zwicker E and Fastl H. Psychoacoustics-Facts and Models Second edition [ M ] . SpringerVerlag, 1990.
3[2]Brian C J Moore. An Introduction to the Psychology of Hearing, Fifth edition [M].Academic Press, 2003.
4[3]ISO 532B. Method for calculating the loudness of complex sound that has been analysed in terms of one-third octave bands[M]. 1975.
5[4]Zwicker E. Subdivision of the audible frequency range into critical bands[J]. Journal of the Acoustical Society of America, Vol 33,pp248, 1961.
6[5]Schroeder M R, Atal B S and Hall J L, Optimizing digital speech coders by exploiting masking properties of the human ear. [ J ]Journal of the Acoustical Society of America,vol. 66, pp1647-1652, Dec 1979.
7[6]ISO/IEC 11172-3. Coding of moving pictures and associated audio for digital storage media at up to about 1. 5Mbit/s-Part 3:Audio. ISO/IEC JTC 1/SC 29, May 1993.
8[7]Johnston J D, Transform coding of audio signals using perceptual noise criteria, IEEE J. on Sel. Areas in Com., vol. 6, pp314-323, Feb. 1988.
9Doh-suk Kim, Soo-Young Lee, Rhee M Kil. Auditory Processing of Speech Signal for Robust Speech Recognition in Real-Word Noisy Environments[J]. IEEE Transactions On Speech And Audio Processing, 1999, 1 (7) : 55-68.
10Oded Ghitza. Auditory Models and Human Performance in Tasks Related to Speech Coding and Speech Recognition[J].IEEE Transactions On Speech And Audio Processing, 1994, 1(2):113-131.

共引文献40

1汪洋,孙进才,陈克安,付莉莉.基于心理声学参数的水下目标识别特征提取方法[J].数据采集与处理,2006,21(3):313-317. 被引量：4
2沈静,阮若林.音频信号的感知编码技术研究[J].咸宁学院学报,2007,27(3):75-77. 被引量：3
3杨宏韬,张德江,李秀兰,王秀英.遗传神经网络能耗预测模型在钢铁企业中的应用[J].长春工业大学学报,2007,28(B07):186-189. 被引量：14
4熊光华,夏庆观.基于自组织特征映射神经网络的零件图像的识别[J].中国制造业信息化（学术版）,2007,36(12):62-65.
5王旭楠,陈圣波,宁亚灵,荣冲,汪自军.基于ASTER数据的石头口门水库水质参数定量遥感反演[J].世界地质,2008,27(1):105-109. 被引量：7
6许增福,吴贵生,王宏伟,柳明旺.基于径向基神经网络的经济预测方法[J].经济师,2008(5):65-66. 被引量：3
7王祥.基于神经网络的余度舵机系统故障诊断[J].机电一体化,2008,14(5):90-92.
8余士品,刘续兴,王吉成,刘端红,金友林.BP神经网络在电液振动台控制中的应用[J].液压与气动,2008,32(7):53-55. 被引量：3
9赖明勇,罗卓娃,吴敬兵.国家社保基金收入预测的PCA&BP模型[J].湖南大学学报（社会科学版）,2008,22(4):52-56. 被引量：3
10华祖林,钱蔚,顾莉.改进型LM-BP神经网络在水质评价中的应用[J].水资源保护,2008,24(4):22-25. 被引量：13

1张婷,何凌,黄华,刘肖珩.基于临界频带及能量熵的语音端点检测[J].计算机应用,2013,33(1):175-178. 被引量：9
2黄力.基于听觉模型的自适应水印算法设计[J].广西民族大学学报（自然科学版）,2012,18(2):41-44.
3陈乃塘.PCI Express市场趋势眺望[J].电子测试（新电子）,2006(5):60-62.
4李鉴,李杰.基于临界小波参数和新序列核支持向量机的说话人识别[J].信阳师范学院学报（自然科学版）,2012,25(3):398-401. 被引量：1
5AIC机架解决方案内含LSI的SATA300—8X适配器[J].国际电子变压器,2005(6):112-112.
6朱坚民,张雷,翟东婷,雷静桃.基于声音多特征贝叶斯网络融合的话者识别研究[J].仪器仪表学报,2013,34(9):2058-2067. 被引量：14
7陶智,赵鹤鸣,顾济华,吴迪.基于心理声学模型和临界频带子波变换的数字声频水印[J].声学学报,2006,31(2):114-119. 被引量：15
8贾宝军,周巍,何进.桌面云系统中的带宽及网络需求研究[J].互联网天地,2013(3):16-19. 被引量：7
9李峰,陈志坚,蔡碧野.一种基于三进制小波变换的纹理分割方法[J].计算机应用研究,2004,21(2):248-249. 被引量：4
10汤云剑,朱尚明,金毅.校园网建设和多出口带宽设计[J].化工高等教育,2009,26(1):80-83. 被引量：5

计算机与现代化

2010年第3期

浏览历史

内容加载中请稍等...

基于新型小波滤波器的语音识别特征提取方法

参考文献10

二级参考文献28

共引文献40

相关作者

相关机构

相关主题

浏览历史