基于WCCN和余弦评分的话者确认研究

Within-class covariance normalization and cos-score for speaker verification

下载PDF

导出

摘要本征音话者识别方法能够在一定程度上补偿因文本无关造成的语音类失配,但它并没有涉及另一个重要的失配因素——信道失配.本文提出了一种在本征音方法基础上补偿信道失配的方案.首先用本征音方法进行语音类失配补偿,然后采用WCCN(类内方差规整)进行信道失配补偿,从而得到经过语音类失配补偿和信道失配补偿的话者因子并将其作为话者模型,最后采用余弦评分方法进行性能评测.实验表明,本文方法在等误识率和最小检测代价函数上具有较好表现,同时本文方法对话者建模所需要空间较小. The eigenvoice-based speaker verification method can compensate for voice mismatch in text- independent speaker verification applications, but it does not compensate channel mismatch, which also exerts a negative impact on the verification. Therefore channel mismatch compensation based on eigenvoice method was proposed. First, eigenvoice was adopted to compensate voice mismatch, then WCCN was applied to compensate channel mismatch. After these compensations, the speaker factor was computed and acted as speaker model. Based on the speaker factor model, Cos-score calculation was conveniently used to test verification operation. The experiment results show better performance, with an improvement by 22.85% at EER and 31.22% at MinDCF, while compared with GMM-UBM-SVM, an improvement was achieved by 9. 14% at MinDCF. Meanwhile, the new method needs less storage space, which benefits practical applications.

作者丁聪敏唐建郭立

机构地区中国科学技术大学电子科学与技术系

出处《中国科学技术大学学报》 CAS CSCD 北大核心 2012年第10期813-819,共7页 JUSTC

关键词话者确认失配补偿话者因子模型类内方差规整余弦评分 speaker verification mismatch compensation speaker-vector model WCCN cos-score

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献12

1Reynolds D A,Quatieri T F,Dunn R B. Speakerverification using adapted Gaussian mixture models[J].Digital Signal Processing, 2000,10(1-3) : 19-41.
2潘镭,郭武,李轶杰,戴礼荣.基于本征音因子分析的短时说话人识别[J].数据采集与处理,2009,24(4):449-452. 被引量：3
3鲍焕军,郑方.GMM-UBM和SVM说话人辨认系统及融合的分析[J].清华大学学报（自然科学版）,2008,48(S1):693-698. 被引量：10
4Kenny P? Mihoubi M,Dumouchel P. New MapEstimators for Speaker Recognition [ C ]//ProcEurospeech-2003, 2003 : 2 961-2 964.
5Kenny P. Joint Factor analysis of speaker and sessionvariability: Theory and algorithms [EB/OL].http://www. crim. ca/perso/Patrick, kenny,2006.
6Kenny P,Ouellet P,Dehak N. A study of inter-speaker variability in speaker verification [J].IEEETransaction on Audio Speech and LanguageProcessing, 2008,16(5) : 980-988.
7Stefanos Z,Anastasioa T, Ioannis P. Minimum classvariance support vector machines [J].IEEE Transactionson Image Processing,2007,16(10) : 2 551-2 564.
8Gauvain J L,Lee C H. Maximum a posteriorestimation for multivariate Gaussian mixtureobservations of Markov chains [ J ].IEEE Trans.Speech and audio processing, 1994,2(2) : 291-298.
9Jeff A B. A Gentle Tutorial of the EM algorithm andits application to parameter estimation for Gaussianmixture and hidden Markov models [EB/OL].http://www. icsi. berkeley. edu. /ftp/global/pub/techreports/1997/tr-07-021. pdf. 1998.
10Andrew O H. Kernel optimization for support vectormachines : Application to speaker verification [ EB/OL].Technical Report No. UCB/EEC&*2006-187.http : / / www. eecs. berkerley. edu/Pubs/TechRpts/2006-187. pdf, 2006.

二级参考文献18

1Kenny P,Dumouchel P.Experiments in speaker verificationusing factor analysis likelihood ratios[].Proc Odyssey.2004
2Campbell W M,Sturimv D E,Reynolds D A.SVM basedspeaker verification suing a GMM supervector kernel andNAP variability compensation[].Signal ProcessingLetters.2006
3Cristianini N,Shawe-Taylor J.Support Vector Machines[]..2000
4Solomonoff A,Campbell W M,Boardman I.Advances inchannel compensation for SVM speaker recognition[].Proc ICASSP.2005
5XIONG Zhengyu,ZHENG Fang,SONG Zhanjiang,et al.Combining selection tree with observation reordering pruningfor efficient speaker identification using GMM-UBM[].Proc ICASSP.2005
6Wan V,Renals S.Support Vector machine speakerverification methodology[].AcousticsSpeech and SignalProcessing.2003
7DENG Jing,ZHENG Fang,WU Wenhu.Session variabilitysubspace projection based model compensation for speakerverification[].AcousticsSpeech and Signal Processing.2007
8Furui S.Cepstral analysis technique for automatic speakerverification[].IEEE Trans Acoust Speech SignalProcessing.1981
9Viikki O,Laurila K.Noise robust HMM-based speechrecognition using segmental cepstral feature vectornormalization[].ESCA NATO Workshop on RobustSpeech Recognition for Unknown Communication Channels.1997
10D. A. Reynolds,T. Quatieri,and R. Dunn.Speaker verification using adapted Gaussian mixture models[].Digital Signal Processing.2000

共引文献11

1展领,景新幸.基于VQ-MAP和SVM融合的说话人识别系统[J].计算机工程与应用,2011,47(13):136-138. 被引量：5
2姚红,谭敏,郭武.音素层特征超矢量的说话人识别性能及优化[J].计算机工程与应用,2011,47(26):140-142.
3杨迪,戚银城,刘明军,张华芳子,武军娜.说话人识别综述[J].电子科技,2012,25(6):162-165. 被引量：5
4李鉴,李杰.基于临界小波参数和新序列核支持向量机的说话人识别[J].信阳师范学院学报（自然科学版）,2012,25(3):398-401. 被引量：1
5黄奋,马皓,邓菁.说话人识别技术在社保系统中的远程身份认证应用研究[J].电子技术与软件工程,2014(2):79-83.
6罗元,孙龙.一种新的鲁棒声纹特征提取与融合方法[J].计算机科学,2016,43(8):297-299. 被引量：1
7刘建航,杨喜鹏,李世宝,陈海华,黄庭培.干扰空间投影在本征音说话人自适应中的应用[J].计算机应用与软件,2017,34(11):188-191.
8李荟,赵云敏.GMM-UBM和SVM在说话人识别中的应用[J].计算机系统应用,2018,27(1):225-230. 被引量：7
9李平,高清源,夏宇,张小勇,曹毅.基于SE-DR-Res2Block的声纹识别方法[J].工程科学学报,2023,45(11):1962-1969. 被引量：6
10李坤明.融合注意力机制的Res2Net-LSTM声纹识别方法[J].网络安全技术与应用,2024(5):58-61.

1林洪榕,于娟,沈晓强.拉曼放大器的交迭因子模型及其性能特性仿真[J].中国激光,2004,31(2):195-198. 被引量：1
2吴德辉,李辉,刘青松,戴蓓蒨.基于因子分析信道失配补偿的SVM话者确认方法[J].模式识别与人工智能,2010,23(1):59-64. 被引量：2
3龙爽,陈岚,陈巍巍,彭斐,吕志强.一种基于比较器失配补偿的高稳定性RC振荡器[J].微电子学,2014,44(4):456-458. 被引量：4
4吴河浚,陈天琪.掺铒光纤放大器的交迭因子模型[J].光电子．激光,1997,8(4):257-260.
5王怡,王艳温,王克家.一种具有校正信道失配功能的二维DOA估计方法[J].弹箭与制导学报,2005,25(SD):956-958.
6戴澜,徐国智,宁可庆.一种电流舵DAC电流源失配补偿算法与实现[J].科学技术与工程,2014,22(2):116-118.
7刘青松,戴蓓蒨,许东星,吴德辉.基于失配信息子空间失配补偿的话者确认[J].中国科学技术大学学报,2010,40(8):823-828.
8陈伟,李辉,张琨磊.基于扰动属性投影的说话人确认系统[J].计算机工程,2012,38(2):186-188.
9李全力,肖先赐.空间谱估计测向系统信道失配的单信号源校正方法[J].电子学报,1991,19(2):123-125. 被引量：13
10王甲池.GPS信道不一致性对数字波束形成算法性能影响的研究[J].舰船电子工程,2009,29(2):67-69.

中国科学技术大学学报

2012年第10期

浏览历史

内容加载中请稍等...

基于WCCN和余弦评分的话者确认研究

参考文献12

二级参考文献18

共引文献11

相关作者

相关机构

相关主题

浏览历史