期刊文献+

基于聚类优化GMM提高说话人识别性能的研究 被引量:3

A Study on GMM Optimization with Clustering for Improving Speaker Recognition
在线阅读 下载PDF
导出
摘要 高斯混合模型(GMM)已广泛地应用于文本无关的说话人识别系统,该方法具有简单高效的特点。但如果GMM模型的高斯混合分量的数目比较多时,整个模型运算的复杂度会比较大。针对这个问题,提出将聚类算法和传统的高斯混合建模结合起来从而优化高斯混合模型,能够有效地提高说话人识别的速度。实验结果验证了这种算法的高效性。 Ganssian mixture model (GMM) has been widely used for text- independent speaker recognition. This method has simple and efficient character. However,if it has a large number of Gaussians in GMM, it leads to a large complexity of computation. To solve this problem, proposes a new method which combines classical GMM with clustering algorithm to optimize the GMM for reducing the complexity of computation. Experimental results demonstrated that our approach was quite efficient to reduce the complexity of computation.
出处 《计算机技术与发展》 2009年第4期35-37,40,共4页 Computer Technology and Development
基金 "985工程"二期"信息技术"创新平台资助项目(0000-X07204)
关键词 说话人识别 高斯混合模型 聚类算法 speaker recognition Gaussian mixture model clustering algorithm
  • 相关文献

参考文献5

  • 1Reynolds D A,Rose R C. Robust text- independent speaker identification using Gaussian mixture speaker models[J ]. IEEE Trans. Speech Audio Processing, 1995,3 (1) : 72 - 83.
  • 2Reynolds D A. Speaker identification and verification using Gaussian mixture speaker models[ J ]. Speech Communication, 1995,17:91 - 108.
  • 3Doddington G R, Przybocki M A,Martin A F,et al. The NIST speaker recognition evaluation - overview, methodology,systems, results, perspective [ J ]. Speech Communication, 2000,31:225 - 254.
  • 4Yared G F G, Violaro F, Sousa L C. Gaussian elimination algorithm for HMM complexity reduction in continuous speech recognition systems [ C ]//Ninth European Conference on Speech Communication and Technology. Brazil: ISCA, 2005: 377 - 380.
  • 5Fisher W, Zue V, Bernstein J, et al. An acoustic-phonetic database[C]//JASA, suppl. A. [s. l. ] : Is. n. ] ,1986.

同被引文献37

  • 1何晓乾,陈雷霆,沈彬斌,房春兰.医学图像三维分割技术[J].计算机应用研究,2007,24(2):13-16. 被引量:16
  • 2Kenny P,Boulianne G,Ouellet P,et alJoint factor analysis versus eigenchannels in speaker recognition[J].IEEE Transactions on Audin,Speech and Language Processing,2007,vol.15,no.4:1435-1447.
  • 3Kuo-Hwei You,WANG Tai-wei.Combination of Autocorrelation-Based Features and Projection Measure Technique for Speaker IdentiGcation[J].IEEE Transactions on Speech and Audio Processing,2005,13(4):565-574.
  • 4W.M.Campbell,D.A.Reynolds.Support Vector Machines Using GMM Supervecters for Speaekr Verification[J].IEEE,2006,13(5):308-311.
  • 5C.Arun Kumar,B.Bharathi,T.Nagaranjan.A Discriminative GMM Technique using Product of Likelihood Gaussians[J].IEEE,2009,Page(s):1-6.
  • 6Chi-Sang Jung,Mo Young Kim,Hong-Goo kang.Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information[J].IEEE Transactions on Audio,Speech and Language Processing,August 2010,18(6):1332-1340.
  • 7Nagarajan T,Douglas 05haughnessy.Dicriminative MLE training using a product of Gaussian likelihood[J].in INTERSPEECH,2006.Pittsburgh,Pensylvania,USA,2006:601-604.
  • 8H.C.Peng,F.Long,and C.Ding,'Feature selection based on mutual information:Criteria of max-dependency,max-relevance,and min-redundancy,lEEEtrans.Pattern Analysis and Machine Intel] igen,2005(27):1226-1238.
  • 9Kim S,Yoon S W,Eriksson T,a1.A noise-robust pitch synchronous feature extraction algorithm for speaker recognition systems,in Proc.Eurospeech'05,2005.
  • 10Wu T,Compernolle D,Duchateau J,et al.Maximum likelihood based temporal frame selection[J].in Proc.Int.ConfAcoust.Speech Signal Process.,2006,pp.349-352.

引证文献3

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部