期刊文献+

基于GMM模型的自适应说话人识别研究 被引量:2

Research on Adaptive Speaker Recognition Based on GMM
在线阅读 下载PDF
导出
摘要 为了提高说话人识别的性能,提出一种基于GMM模型自适应说话人识别方法。该方法能自动根据不同的说话人选取不同时长的语音进行识别,从提取语音特征和计算识别概率两方面减少识别时间,在不降低识别率的前提下,比传统识别方法识别速度有大幅度提高。实验仿真表明,在保持正确识别率97%以上的情况下,总识别速度可提高4倍左右。该方法特别适合基于GMM的大集合说话人识别。 With the purpose of improving the performance of speaker recognition,an adaptive speaker recognition method based on GMM is proposed.It can automatically select different length of speech for different speakers so as to reduce the recognition time through two aspects: speaker acoustic features calculation and recognition probability estimation.So it can remarkably improve the recognition speed than customary methods while keeping the correct recognition ratio.Experiments show that the recognition speed is increased about 4 times while keeping the recognition ratio at the level of 97%.This novel method is very fit for large muster of speaker recognition based on GMM.
出处 《计算机与现代化》 2013年第7期91-93,共3页 Computer and Modernization
基金 江苏省自然科学基金资助项目(BK2009059) 解放军理工大学预研基金资助项目(2009TX08)
关键词 说话人识别 高斯混合模型 线性预测系数 自适应 speaker recognition Gaussian mixture model(GMM) linear prediction coefficient(LPC) adaptation
  • 相关文献

参考文献10

  • 1周翠梅,陈喆. 基于高斯混合模型的说话人识别技术[C]// 2010年通信理论与信号处理学术年会论文集. 中国,大连, 2010:469-474..
  • 2赵力.语音信号处理[M].北京:机械工业出版社,2002.
  • 3刘幺和,宋庭新.语音识别与控制应用技术[M].北京:科学出版社,2008.
  • 4Zeljkovic Ilija, Haffner Patrick, Amento Brian, et al. GMM/SVM N-best speaker identification under mismatch channel conditions[C]// IEEE International Conference on Acoustics, Speech and Signal Processings, 2008. Las Vegas, NV, 2008:4129-4132..
  • 5王金明,张雄伟.话者识别系统中语音特征参数的研究与仿真[J].系统仿真学报,2003,15(9):1276-1278. 被引量:17
  • 6杨大利,徐明星,吴文虎.语音识别特征参数选择方法研究[J].计算机研究与发展,2003,40(7):963-969. 被引量:21
  • 7Ahmad Al Marashli, Oumayma Al Dakkak. Automatic, text-independent, speaker identification and verification system using Mel cepstrum and GMM[C]// The 3rd International Conference on Information and Communication Technologies: From Theory to Applications, 2008. 2008:801-806..
  • 8Pelecanos J, Povey D, Ramaswamy G. Secondary classification for GMM based speaker recognition[C]// 2006 IEEE International Conference on Acoustics, Speech, and Signal Processings. 2006:109-112..
  • 9Yih-Ru Wang, Chen-Yu Chiang. A new common component GMM-based speaker recognition method[C]// 2005 IEEE International Conference on Acoustics, Speech, and Signal Processings. 2005:645-648..
  • 10赵恒,李冬梅,张玉宏.MATLAB环境下的基于GMM模型的说话人识别系统[J].微计算机信息,2007,23(31):261-263. 被引量:6

二级参考文献17

  • 1赵云鹏.MATLAB串口通信在数据采集中的应用[J].微计算机信息,2006,22(01S):111-112. 被引量:25
  • 2陈魁.实验设计与分析[M].北京:清华大学出版社,1996,8.94.
  • 3O Viikki, K Laurila. Cepstral domain segmental feature vector normalization for noise robust speech recognition. Speech Communication, 1998, 25(1): 133--147.
  • 4Yang Dali, Xu Mingxing, Wu Wenhu. A novel feature selection method in speech recognition. Int' 1 Conf on Chinese Computing,Singapore, 2001.
  • 5K Paliwal. Study of line spectrum pair frequencies for vowel recognition. Speech Communication, 1989, 8(1): 27--33.
  • 6Hermansky, Hykek, Morgan Nelson. RASTA processing of speech. IEEE Trans on Speech and Audio Processing, 1994, 2(4) : 578--589.
  • 7C Emmanouilidis, A Hunter. Multiobjective evolutionary setting for feature selection and a commonality-based crossover operator.In: Proc of the IEEE Conf on Evolutionary Computation.Piscataway: Institute of Electrical and Electronic Engineers Inc,2000. 309--316.
  • 8Sambur M R. Selection of Acoustic Features for Speaker Identification [C]. IEEE Trans On ASSP, 1975: 176-182.
  • 9Rabineer L R, Juang B H. Fundamentals of Speech Processing and Recognition[M]. Prentice-HalL 1993.
  • 10Junqua J C, Wakital H, Hermansky H. Evaluation and Optimization of perceptualyy-based ASR front-end[j]. IEEE Tran. ASSP-1, 1993, (3):39-48.

共引文献73

同被引文献31

  • 1刘敬伟,徐美芝,郑忠国,程乾生.基于DTW的语音识别和说话人识别的特征选择[J].模式识别与人工智能,2005,18(1):50-54. 被引量:13
  • 2于明,袁玉倩,董浩,王哲.一种基于MFCC和LPCC的文本相关说话人识别方法[J].计算机应用,2006,26(4):883-885. 被引量:14
  • 3Reynolds D A, Quatier T F, Dram R B. Speaker verifica- tion using adapted Gaussian mixture models [ J ]. Digital Singal Processing , 2000,10 : 19-24.
  • 4Reynolds D A, Campbell W, Gleason T T. The 2004 MIT Lincoln laboratory speaker recognition system [ A ]. In Pro- cessdings of ICASSP. Philadel Pbia. USA: [ s. n. ] ,2008.
  • 5Reynolds D A, Rose R. Robust text-independent speaker i- dentification using Gaussian mixture speaker models [ J ]. IEEE Trans on Speech and Audio Processing, 1995, 3 ( 1 ) : 72-83.
  • 6Frey B, Dueck D. Clustering by passing messages between data points[J]. Science, 2007, 315(5184) :972-976.
  • 7Zhong Y C, Hua X. Study on speech control of turning movements of the multifunctional nursing bed [ J ]. Ad- vances in Intelligent and Soft Computing, 2012 ( 1 ) : 67- 72.
  • 8Agrawal U K, Chandra M, Badgaiyan C. Fractional fou- rier transform combination with MFCC based speaker iden- tification in clean environment[ J]. International Journal of Advanced Science, Engineering and Technology, 2012, 1 ( 1 ) :26-28.
  • 9Yuan Y J, Zhao P H, Zhou Q. Research of speaker rec- ognition based on combination of LPCC and MFCC [ C ]// Proc of IEEE International Conference on IntelLigent Com- puting and Intelligent Systems. [ S. 1. ] : IEEE Press, 2010 : 765-767.
  • 10蒋晔,唐振民.GMM文本无关的说话人识别系统研究[J].计算机工程与应用,2010,46(11):179-182. 被引量:27

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部