基于GMM模型的自适应说话人识别研究被引量：2

Research on Adaptive Speaker Recognition Based on GMM

下载PDF

导出

摘要为了提高说话人识别的性能,提出一种基于GMM模型自适应说话人识别方法。该方法能自动根据不同的说话人选取不同时长的语音进行识别,从提取语音特征和计算识别概率两方面减少识别时间,在不降低识别率的前提下,比传统识别方法识别速度有大幅度提高。实验仿真表明,在保持正确识别率97%以上的情况下,总识别速度可提高4倍左右。该方法特别适合基于GMM的大集合说话人识别。 With the purpose of improving the performance of speaker recognition,an adaptive speaker recognition method based on GMM is proposed.It can automatically select different length of speech for different speakers so as to reduce the recognition time through two aspects： speaker acoustic features calculation and recognition probability estimation.So it can remarkably improve the recognition speed than customary methods while keeping the correct recognition ratio.Experiments show that the recognition speed is increased about 4 times while keeping the recognition ratio at the level of 97%.This novel method is very fit for large muster of speaker recognition based on GMM.

作者陈觉之张贵荣周宇欢

机构地区海军指挥学院信息系中国人民解放军解放军理工大学指挥信息系统学院

出处《计算机与现代化》 2013年第7期91-93,共3页 Computer and Modernization

基金江苏省自然科学基金资助项目(BK2009059) 解放军理工大学预研基金资助项目(2009TX08)

关键词说话人识别高斯混合模型线性预测系数自适应 speaker recognition Gaussian mixture model（GMM） linear prediction coefficient（LPC） adaptation

分类号 TP391.42 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1周翠梅,陈喆. 基于高斯混合模型的说话人识别技术[C]// 2010年通信理论与信号处理学术年会论文集. 中国,大连, 2010:469-474..
2赵力.语音信号处理[M].北京:机械工业出版社,2002.
3刘幺和,宋庭新.语音识别与控制应用技术[M].北京:科学出版社,2008.
4Zeljkovic Ilija, Haffner Patrick, Amento Brian, et al. GMM/SVM N-best speaker identification under mismatch channel conditions[C]// IEEE International Conference on Acoustics, Speech and Signal Processings, 2008. Las Vegas, NV, 2008:4129-4132..
5王金明,张雄伟.话者识别系统中语音特征参数的研究与仿真[J].系统仿真学报,2003,15(9):1276-1278. 被引量：17
6杨大利,徐明星,吴文虎.语音识别特征参数选择方法研究[J].计算机研究与发展,2003,40(7):963-969. 被引量：21
7Ahmad Al Marashli, Oumayma Al Dakkak. Automatic, text-independent, speaker identification and verification system using Mel cepstrum and GMM[C]// The 3rd International Conference on Information and Communication Technologies: From Theory to Applications, 2008. 2008:801-806..
8Pelecanos J, Povey D, Ramaswamy G. Secondary classification for GMM based speaker recognition[C]// 2006 IEEE International Conference on Acoustics, Speech, and Signal Processings. 2006:109-112..
9Yih-Ru Wang, Chen-Yu Chiang. A new common component GMM-based speaker recognition method[C]// 2005 IEEE International Conference on Acoustics, Speech, and Signal Processings. 2005:645-648..
10赵恒,李冬梅,张玉宏.MATLAB环境下的基于GMM模型的说话人识别系统[J].微计算机信息,2007,23(31):261-263. 被引量：6

二级参考文献17

1赵云鹏.MATLAB串口通信在数据采集中的应用[J].微计算机信息,2006,22(01S):111-112. 被引量：25
2陈魁.实验设计与分析[M].北京:清华大学出版社,1996,8.94.
3O Viikki, K Laurila. Cepstral domain segmental feature vector normalization for noise robust speech recognition. Speech Communication, 1998, 25(1): 133--147.
4Yang Dali, Xu Mingxing, Wu Wenhu. A novel feature selection method in speech recognition. Int' 1 Conf on Chinese Computing,Singapore, 2001.
5K Paliwal. Study of line spectrum pair frequencies for vowel recognition. Speech Communication, 1989, 8(1): 27--33.
6Hermansky, Hykek, Morgan Nelson. RASTA processing of speech. IEEE Trans on Speech and Audio Processing, 1994, 2(4) : 578--589.
7C Emmanouilidis, A Hunter. Multiobjective evolutionary setting for feature selection and a commonality-based crossover operator.In: Proc of the IEEE Conf on Evolutionary Computation.Piscataway: Institute of Electrical and Electronic Engineers Inc,2000. 309--316.
8Sambur M R. Selection of Acoustic Features for Speaker Identification [C]. IEEE Trans On ASSP, 1975: 176-182.
9Rabineer L R, Juang B H. Fundamentals of Speech Processing and Recognition[M]. Prentice-HalL 1993.
10Junqua J C, Wakital H, Hermansky H. Evaluation and Optimization of perceptualyy-based ASR front-end[j]. IEEE Tran. ASSP-1, 1993, (3):39-48.

共引文献73

1王光艳,赵晓群,王霞.基于MATLAB GUI的语音信号特征提取系统设计[J].河北工业大学学报,2010,39(4):14-18. 被引量：11
2李凡,吴军,黄刚.基于BPNN/HMM神经网络的声学模型研究[J].华中科技大学学报（自然科学版）,2004,32(9):9-11. 被引量：2
3于哲舟,杨佳东,蒲东兵,周春光,王纲巧.多门限声纹识别方法[J].吉林大学学报（信息科学版）,2005,23(2):216-220. 被引量：1
4刘雅琴,周炜.基于小波变换的说话人语音特征参数提取[J].河南科技大学学报（自然科学版）,2005,26(4):44-46. 被引量：10
5李仰祝.高校教师人力资源管理应注重“三个转变”[J].人才资源开发,2005(12):30-31.
6徐小华.建立有可能重新使用的软件图书馆[J].淮南师范学院学报,2006,8(3):14-15.
7郝征科,魏明果.基于小波包变换的说话人语音特征参数的提取[J].三峡大学学报（自然科学版）,2006,28(4):374-376. 被引量：2
8彭策,熊屹,陈文西,万柏坤.病态嗓声识别特征参数的优化选择[J].中国生物医学工程学报,2007,26(5):675-679. 被引量：1
9张东阳,张国杰.说话人识别系统研究[J].通信技术,2007,40(11):356-358. 被引量：5
10卢昌荆,王红雨,廖逢钗,张诚一.基于模糊矢量量化(FVQ)普通话等级测试模型研究[J].海南师范学院学报（自然科学版）,2007,20(4):316-320.

同被引文献31

1刘敬伟,徐美芝,郑忠国,程乾生.基于DTW的语音识别和说话人识别的特征选择[J].模式识别与人工智能,2005,18(1):50-54. 被引量：13
2于明,袁玉倩,董浩,王哲.一种基于MFCC和LPCC的文本相关说话人识别方法[J].计算机应用,2006,26(4):883-885. 被引量：14
3Reynolds D A, Quatier T F, Dram R B. Speaker verifica- tion using adapted Gaussian mixture models [ J ]. Digital Singal Processing , 2000,10 : 19-24.
4Reynolds D A, Campbell W, Gleason T T. The 2004 MIT Lincoln laboratory speaker recognition system [ A ]. In Pro- cessdings of ICASSP. Philadel Pbia. USA: [ s. n. ] ,2008.
5Reynolds D A, Rose R. Robust text-independent speaker i- dentification using Gaussian mixture speaker models [ J ]. IEEE Trans on Speech and Audio Processing, 1995, 3 ( 1 ) : 72-83.
6Frey B, Dueck D. Clustering by passing messages between data points[J]. Science, 2007, 315(5184) :972-976.
7Zhong Y C, Hua X. Study on speech control of turning movements of the multifunctional nursing bed [ J ]. Ad- vances in Intelligent and Soft Computing, 2012 ( 1 ) : 67- 72.
8Agrawal U K, Chandra M, Badgaiyan C. Fractional fou- rier transform combination with MFCC based speaker iden- tification in clean environment[ J]. International Journal of Advanced Science, Engineering and Technology, 2012, 1 ( 1 ) :26-28.
9Yuan Y J, Zhao P H, Zhou Q. Research of speaker rec- ognition based on combination of LPCC and MFCC [ C ]// Proc of IEEE International Conference on IntelLigent Com- puting and Intelligent Systems. [ S. 1. ] : IEEE Press, 2010 : 765-767.
10蒋晔,唐振民.GMM文本无关的说话人识别系统研究[J].计算机工程与应用,2010,46(11):179-182. 被引量：27

引证文献2

1王波,钟映春,陈俊彬.融合AP和GMM的说话人识别方法研究[J].广东工业大学学报,2015,32(4):145-149. 被引量：1
2甄倩倩,张庭亮.说话人识别综述[J].科技资讯,2017,15(25):241-243. 被引量：1

二级引证文献2

1薛雷,张弛,张程浩,章依文.汉语儿童言语发育水平自动评估关键技术的研究[J].工业控制计算机,2019,32(7):74-75.
2姜珊,张二华,张晗.基于Bi-GRU+BFE模型的短语音说话人识别[J].计算机与数字工程,2022,50(10):2233-2239. 被引量：3

1成新民,张迎,蒋云良.基于FVQMM的说话人识别[J].辽宁工程技术大学学报（自然科学版）,2007,26(5):719-722.
2李红,余娟芬.手写体数字识别的研究：传统识别方法＋神经元网络[J].微型计算机,1992,12(2):1-4.
3郭荣艳,胡雪惠.BP神经网络在车牌字符识别中的应用研究[J].计算机仿真,2010,27(9):299-301. 被引量：37
4李少坤,江先志,王增怀.康复机器人的语音控制及实现[J].信息通信,2012,25(1):260-261. 被引量：2
5陈功,张雄伟.基于ICA和HMM的战场混叠声目标识别[J].弹道学报,2007,19(1):92-96. 被引量：3
6陈军霞,刘紫玉.基于Baum-Welch算法HMM模型的孤词算法研究[J].河北科技大学学报,2015,36(1):52-57. 被引量：10
7荣蓉.基于神经网络的与文本相关说话人辨认系统[J].山东科学,2008,21(4):62-65.
8王志明,蔡莲红,艾海舟.基于支持向量回归的唇动参数预测[J].计算机研究与发展,2003,40(11):1561-1565. 被引量：7
9李津涛.语音特征参数提取的仿真研究[J].中国新通信,2009,11(9):52-54. 被引量：2
10荣蓉.声纹识别在校园网身份认证中的作用[J].枣庄学院学报,2008,25(5):80-83.

计算机与现代化

2013年第7期

浏览历史

内容加载中请稍等...

基于GMM模型的自适应说话人识别研究被引量：2

参考文献10

二级参考文献17

共引文献73

同被引文献31

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于GMM模型的自适应说话人识别研究 被引量：2

参考文献10

二级参考文献17

共引文献73

同被引文献31

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于GMM模型的自适应说话人识别研究被引量：2