期刊文献+

基于WCCN和余弦评分的话者确认研究

Within-class covariance normalization and cos-score for speaker verification
在线阅读 下载PDF
导出
摘要 本征音话者识别方法能够在一定程度上补偿因文本无关造成的语音类失配,但它并没有涉及另一个重要的失配因素——信道失配.本文提出了一种在本征音方法基础上补偿信道失配的方案.首先用本征音方法进行语音类失配补偿,然后采用WCCN(类内方差规整)进行信道失配补偿,从而得到经过语音类失配补偿和信道失配补偿的话者因子并将其作为话者模型,最后采用余弦评分方法进行性能评测.实验表明,本文方法在等误识率和最小检测代价函数上具有较好表现,同时本文方法对话者建模所需要空间较小. The eigenvoice-based speaker verification method can compensate for voice mismatch in text- independent speaker verification applications, but it does not compensate channel mismatch, which also exerts a negative impact on the verification. Therefore channel mismatch compensation based on eigenvoice method was proposed. First, eigenvoice was adopted to compensate voice mismatch, then WCCN was applied to compensate channel mismatch. After these compensations, the speaker factor was computed and acted as speaker model. Based on the speaker factor model, Cos-score calculation was conveniently used to test verification operation. The experiment results show better performance, with an improvement by 22.85% at EER and 31.22% at MinDCF, while compared with GMM-UBM-SVM, an improvement was achieved by 9. 14% at MinDCF. Meanwhile, the new method needs less storage space, which benefits practical applications.
出处 《中国科学技术大学学报》 CAS CSCD 北大核心 2012年第10期813-819,共7页 JUSTC
关键词 话者确认 失配补偿 话者因子模型 类内方差规整 余弦评分 speaker verification mismatch compensation speaker-vector model WCCN cos-score
  • 相关文献

参考文献12

  • 1Reynolds D A,Quatieri T F,Dunn R B. Speakerverification using adapted Gaussian mixture models[J].Digital Signal Processing, 2000,10(1-3) : 19-41.
  • 2潘镭,郭武,李轶杰,戴礼荣.基于本征音因子分析的短时说话人识别[J].数据采集与处理,2009,24(4):449-452. 被引量:3
  • 3鲍焕军,郑方.GMM-UBM和SVM说话人辨认系统及融合的分析[J].清华大学学报(自然科学版),2008,48(S1):693-698. 被引量:10
  • 4Kenny P? Mihoubi M,Dumouchel P. New MapEstimators for Speaker Recognition [ C ]//ProcEurospeech-2003, 2003 : 2 961-2 964.
  • 5Kenny P. Joint Factor analysis of speaker and sessionvariability: Theory and algorithms [EB/OL].http://www. crim. ca/perso/Patrick, kenny,2006.
  • 6Kenny P,Ouellet P,Dehak N. A study of inter-speaker variability in speaker verification [J].IEEETransaction on Audio Speech and LanguageProcessing, 2008,16(5) : 980-988.
  • 7Stefanos Z,Anastasioa T, Ioannis P. Minimum classvariance support vector machines [J].IEEE Transactionson Image Processing,2007,16(10) : 2 551-2 564.
  • 8Gauvain J L,Lee C H. Maximum a posteriorestimation for multivariate Gaussian mixtureobservations of Markov chains [ J ].IEEE Trans.Speech and audio processing, 1994,2(2) : 291-298.
  • 9Jeff A B. A Gentle Tutorial of the EM algorithm andits application to parameter estimation for Gaussianmixture and hidden Markov models [EB/OL].http://www. icsi. berkeley. edu. /ftp/global/pub/techreports/1997/tr-07-021. pdf. 1998.
  • 10Andrew O H. Kernel optimization for support vectormachines : Application to speaker verification [ EB/OL].Technical Report No. UCB/EEC&*2006-187.http : / / www. eecs. berkerley. edu/Pubs/TechRpts/2006-187. pdf, 2006.

二级参考文献18

  • 1Kenny P,Dumouchel P.Experiments in speaker verificationusing factor analysis likelihood ratios[].Proc Odyssey.2004
  • 2Campbell W M,Sturimv D E,Reynolds D A.SVM basedspeaker verification suing a GMM supervector kernel andNAP variability compensation[].Signal ProcessingLetters.2006
  • 3Cristianini N,Shawe-Taylor J.Support Vector Machines[]..2000
  • 4Solomonoff A,Campbell W M,Boardman I.Advances inchannel compensation for SVM speaker recognition[].Proc ICASSP.2005
  • 5XIONG Zhengyu,ZHENG Fang,SONG Zhanjiang,et al.Combining selection tree with observation reordering pruningfor efficient speaker identification using GMM-UBM[].Proc ICASSP.2005
  • 6Wan V,Renals S.Support Vector machine speakerverification methodology[].AcousticsSpeech and SignalProcessing.2003
  • 7DENG Jing,ZHENG Fang,WU Wenhu.Session variabilitysubspace projection based model compensation for speakerverification[].AcousticsSpeech and Signal Processing.2007
  • 8Furui S.Cepstral analysis technique for automatic speakerverification[].IEEE Trans Acoust Speech SignalProcessing.1981
  • 9Viikki O,Laurila K.Noise robust HMM-based speechrecognition using segmental cepstral feature vectornormalization[].ESCA NATO Workshop on RobustSpeech Recognition for Unknown Communication Channels.1997
  • 10D. A. Reynolds,T. Quatieri,and R. Dunn.Speaker verification using adapted Gaussian mixture models[].Digital Signal Processing.2000

共引文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部