期刊文献+

结合主分量分析及Fisher准则的说话人识别方法研究 被引量:3

Research of Speaker Recognition Based on PCA
在线阅读 下载PDF
导出
摘要 本文提出了一种基于主分量分析和Fisher准则的新的Mel频率域特征参数。它是在Mel域频谱的基础上做主分量分析,并且根据Fisher准则,按Fisher比的大小进行特征参量的选择而得到的。它充分的利用了各频带间的相关统计信息,能更紧致有效的区分说话人。这样得到的特征矢量,与传统的按相应特征值进行特征选择的方法相比,在相同维数时具有最大的类别区分度。最后我们实现了一个文本无关的说话人自动识别系统,它的后端采用矢量量化实现聚类分析。在语音库上的实验表明本文的特征矢量在说话人识别上比相同维数的传统特征矢量识别率更高,证实了它紧致、区分度好、冗余信息少的优良性能。 A new feature vectorMel Frequency Principal Coefficient (MFPC), used for speaker recognition is proposed. It is derived from Principal Component Analysis on the Mel Scale Spectrum Vector. The correlation information among different frequency channels, which is mainly caused by the vocal tract resonance, can be efficiently exploited by means of MFPC. This correlation information has been found to vary consistently from one speaker to another. Feature coefficients are chosen according to their Fisher Ratio. Compared with conventional Frequency Cepstrum Coefficient, the proposed feature vector can give greater distance between classes under the condition of same dimensions. A text-independent speaker recognition system has been complemented based on Vector Quantization to design the code-books of a given reference speaker. Experiment results demonstrate that the proposed feature vector has many good performances as compact, easy to be discriminated and low redundancy.
出处 《电路与系统学报》 CSCD 2002年第1期116-119,共4页 Journal of Circuits and Systems
基金 国家自然科学基金资助项目(39870194)
关键词 主分量分析Fisher准则 说话人识别 语音识别 Mel Frequency Principal Coefficient (MFPC) Principle Component Analysis (PCA) Vector Quantization (VQ) Fisher Ratio.
  • 相关文献

参考文献1

  • 1边肇祺.模式识别[M].清华大学出版社,1999..

共引文献60

同被引文献21

  • 1陶智,赵鹤鸣,龚呈卉.基于听觉掩蔽效应和Bark子波变换的语音增强[J].声学学报,2005,30(4):367-372. 被引量:39
  • 2杨阳,陈永明.声纹识别技术及其应用[J].电声技术,2007,31(2):45-46. 被引量:23
  • 3CAMPBELL W M,STURIM D E,REYNOLDS D A.Support vector machines using GMM supervectors for speaker verification[J].IEEE Transaction on Signal Processing Letters,2006,13(5):308-311.
  • 4JAAKKOLA T S,HAUSSLER D.Exploiting generative models in discriminative classifiers[C]//KEARNS M S,SOLLA S A,COHN D A.Advances in Neural Information Processing Systems 11.Cambridge:MIT Press,1998.
  • 5WAN V,RENDS S.Speaker verification using sequence discriminant support vector machines[J].IEEE Transaction on Speech and Audio Processing,2005,13(2):203-210.
  • 6XU Limin,TANG Zhenmin.Speaker identification using multi-step clustering algorithm with transformation-based GMM[J].Automatic Control and Computer Sciences,2007,41(4):224-231.
  • 7SONG F X,LIU S H,YANG J Y.Orthogonalized Fisher discriminant[J].Pattern Recognition,2003,38:311-313.
  • 8CHEN L,MAN H,NEFIAN A V.Face recognition based on multi-class mapping of Fisher scores[J].Pattern Recognition,2005,38:799-811.
  • 9YANG J,GAO X M.Kernel ICA:an alternative formulation and its application to face recognition[J].Pattern Recognition,2005,38:1784-1787.
  • 10MATSUI T,FURUI S.N-best-based unsupervised speaker adaptation for speech recognition[J].Computer Speech and Language,1998,12:41-50

引证文献3

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部