期刊文献+

基于伽马通滤波器组的听觉特征提取算法研究 被引量:30

An Auditory Feature Extraction Algorithm Based on γ-Tone Filter-Banks
在线阅读 下载PDF
导出
摘要 本文从模拟人类听觉角度出发,给出了基于人耳耳蜗听觉模型的伽马通滤波器组模型,测试语音通过该滤波器组输出得到了高维听觉特征向量.经过主成分分析和离散余弦变换,分别得到了可用于表征说话人的伽马通系数和伽马通滤波器倒谱系数及其衍生特征.实验证明,与传统梅尔倒谱特征相比,采用本文提出特征的说话人识别系统在识别率及鲁棒性上均有明显提高. By means of emulating human auditory,gamma-Tone filter-banks models based on the auditory system in human cochlea are presented.The speech to be detected goes through the gamma-Tone filter-banks,thereby multi-dimension eigenvectors are obtained.By PCA(principal component analysis)and DCT(discrete cosine transform),it is yielded to represent a speaker's gamma-Tone coefficients,gamma-Tone filter-banks cepstral coefficients respectively and their derivative features as well.Compared to the ordinary Mel-frequency cepstral coefficients,the speaker recognition system presented turns out to have better recognition rate and robustness characteristics.
出处 《电子学报》 EI CAS CSCD 北大核心 2010年第3期525-528,共4页 Acta Electronica Sinica
关键词 语音信号处理 伽马通滤波器 听觉特征提取 倒谱系数 speech signal processing gammatone filter auditory feature extraction cepstral coefficients
  • 相关文献

参考文献11

  • 1S Furui. Digital Speech Processing, Synthesis, and Recognition [ M]. New York: Marcel Dekker, 2001.
  • 2H Gish, M Schmidt. Text-independent speaker identification [ J]. IEEE Signal Proc, 1994,11 (4): 18 - 32.
  • 3D A Reynolds, et al. The SuperSID project: Exploiting high- level information for high-accuracy speaker recognition [ A ]. International Conference on Acoustics, Speech, and Signal Processing[ C]. Hong Kong, China: IEEE, 2003.4:784 - 787.
  • 4A Drygajlo,M El-Maliki. Speaker verification in noisy environments with combined spectral subtraction and missing feature theory [ A ]. IEEE International Conference on Acoustics, Speech, and Signal Processing[ C]. Seattle, USA: IEEE, 1998. 1 : 121 - 124.
  • 5SHAO Y, WANG D L. Robust speaker recognition using binary time-frequency masks [ A ]. IEEE International Conference on Acoustics,Speech,and Signal Processing[ C]. Toulouse: IEEE, 2006.1:645-648.
  • 6WNG L,KITAOKA N,NAKAGAWA S. Analysis of effect of compensation parameter estimation for CMN on speech/speaker recognition[ A]. 9th International Symposium on Signal Processing and Its Applications[ C]. Sharjah: IEEE, 2007.1 - 4.
  • 7陈雪勤,赵鹤鸣.基于听觉模型的汉语耳语音声调检测[J].电子学报,2009,37(4):864-867. 被引量:5
  • 8Z Wanfeng, Y Yingchun, W Zhaohui, S Lifeng. Experimental evaluation of a new speaker identification framework using PCA[ A]. IEEE. International Conference on Systems, Man and Cybernetics[C]. Washington, DC: IEEE., 2003.4147 - 4152.
  • 9WU Xihong. A Chinese Speech Database for Speaker Recognition[ EB/OL]. http://nlpr-web. ia. ac. cn/englisb_/irds/chinese / sinobiometrics- pdf/wuxihong.pdf, 2002.
  • 10D A Reynolds, R C Rose. Robust text-independent speaker identification using Gaussian mixture speaker models[ J].Proc IEEE. Trans Speech Audio Process, 1995,3 ( 1 ) : 72 - 83.

二级参考文献15

  • 1LIXueli,XUBoling.Tone features in whispered Chinese[J].Progress in Natural Science:Materials International,2005,15(3):285-288. 被引量:5
  • 2黄海,潘家强.基于Hilbert-Huang变换的基音周期提取方法[J].声学学报,2006,31(1):35-41. 被引量:11
  • 3罗亚飞,鲍长春.基于DCT分带谱熵与信号分解的高精度基音检测算法[J].电子学报,2007,35(1):13-22. 被引量:5
  • 4Morris R W. Enhancement and recognition of whispered speech [ D]. USA: Georgia Institute of Technology ,2002.
  • 5Ito T, Takeda K. Analysis and recognition of whispered speech [ J] .Speech Communication, 2005,45(2) : 139 - 152.
  • 6Meyer-eppler W. Realization of prosodic features in whispered speech [J]. Journal of Acoustical Society of America, 1957,29 (1) :104- 106.
  • 7Martin Kloster Jenson. Recognition of word tones in whispered speech[ J]. Word, 1958,14:187 - 196.
  • 8Man-gao. Tones in whispered Chinese: articulatory features and perceptual cues[ D ]. Thesis of Master, University of Victoria, Canada, 2002.
  • 9Sachs M B, et al. Rate-place and temporal-place representations of vowels in the auditory nerve and anterovenlral cochlear nucleus[ J]. Journal of Phonetics, 1988,16:37 - 53.
  • 10Patterson R. An efficient auditory filterbank based on the gammatone functions[R] .Annex B of the Svos Final Report: The auditory filter bank,APU Report No.2341,1988.

共引文献4

同被引文献187

引证文献30

二级引证文献192

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部