一种层次化空间分析方法在语种识别系统中的应用

Language recognition method based on hierarchical space analysis

下载PDF

导出

摘要在针对电话语音的自动语种识别系统中,训练和测试语料之间存在不同说话人、信道等因素差异带来的不匹配,是影响识别性能提高的关键因素。为了消除此类影响,提出一种层次化空间分析方法,首先对前端部分MFCC+SDC特征进行HLDA(异方差线性判别分析),增大了语种各个类的类间差异;然后对经自适应得到含有冗余信息的GSV进行PCA特征选择,有效地去除了信道等冗余信息的干扰。实验结果表明,此方法能有效消除信道等噪声影响,从而提升了原有系统的识别性能。 In automatic spoken language recognition system on telephone conversation speech,differences between train and test utterances on channels,gender and speakers are the key factor of improving the performance of the system.This paper proposed a hierarchical space analysis method.Firstly,it mapped the front-end cepstral features of SDC into the HDLA space,aiming at increasing the discriminability between different languages.Secondly,it selected the characters of adaptive GMM super vector by the method of PCA,which eliminated the influences of different channels,speakers and so on.Experiment results indicate that this method is better for improving the system＇s performance than the original baseline system.

作者常振超刘斌石远超张兴明杨镇西张丽

机构地区解放军信息工程大学信息工程学院

出处《计算机应用研究》 CSCD 北大核心 2012年第10期3651-3654,共4页 Application Research of Computers

基金国家"863"计划重点资助项目(2008AA011002)

关键词语种识别层次化空间分析冗余信息 language recognition hierarchical space analysis redundant information

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1CAMPBELL W M, CAMPBELL J P, REYNOLDS D A, et al, Support vector machines for speaker and language recognition [ J]. Com- puter Speech & Language,2006,20(2-3): 210-229. .
2MATEJKA P. Phonotactic and acoustic language recognition [ D]. Brno: Brno University of Technology, 2008.
3宋彦,戴礼荣,王仁华.基于超向量子空间分析的自动语种识别方法[J].模式识别与人工智能,2010,23(2):165-170. 被引量：4
4DEHAK N, McCREE A, REYNOLDS D, et al, MITLL 2011 language recognition evaluation system description [ R]. Cambridge: Information Systems Technology Group, MIT Lincoln Laboratory, 2011.
5MARTINEZ G D, PLCHOT 0, BURGET L, et al. Language recognition in iVectors space [C J/ /Proc of the 12th Annual Conference on International Speecb Communication Association. 2011 : 861- 864.
6KENNY P. Joint factor analysis of speaker and session variability: theory and algorithms[EB/OL]. (2006-01-13). http://www.crim. calperso/patrick. kenny/FAtheory. polf.
7KUMAR N. Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition [ D] . Baltimore: John Hopkins University, 1997.
8GALES M J. Semi-tied covariance matrices for hidden Markov models [ J ]. IEEE Trans on Speech and Audio Processing, 1999 , 7 (3) :272-281.
9BISHOP C M. Pattern recognition and machine learning [ M]. New York: Springer ,2006.
10ROWlES S. EM algorithm for PCA and SPCA [ C]/ /Proc of Conference on Advances in Neural Information Processing System. Cambridge: MIT Press, 1997 :626-632.

二级参考文献13

1Torres-Carrasquillo P A,Singer E,Kchler M A,et.al.Approaches to Language Identification Using Gaussian Mixture Models and Shifted Delta Cepstral Features// Proc of the International Conference on Spoken Language Processing.Denver,USA,2002:89 -92.
2Pelecanos J,Sridharan S.Feature Warping for Robust Speaker Verification// Proc of the Speaker and Language Recognition Workshop.Crete,Greece,2001:213 -218.
3Burget L,Matejka P,Cernocky J.Discriminative Training Techniques for Acoustic Language Identification//Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006:209-212.
4Qu Dan,Wen Bingxi.Discriminative Training of GMM for Language Identification//Proc of the ICSA and IEEE Workshop on Spontaneous Speech Processing and Recognition.Tokyo,Japan,2003:108 -110.
5Campbell W M,Sturim D E,Reynolds D A.SVM Based Speaker Verification Using a GMM Supervector Kernel and NAP Variability //Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006:97-100.
6Smith N,Gales M.Data-Dependent Kernels in SVM Classification of Speech Patterns// Proc of the 6th International Conference on Spoken Language Processing.Beijing,China,2000,Ⅰ:297 -300.
7Campbell M,Campbell J P,Reynolds D A,et al.Support Vector Machines for Speaker and Language Recognition.Computer Speech and Language,2006,20(2/3):210-229.
8Bishop C M.Pattern Recognition and Machine Learning.New York,USA:Springer,2006.
9Chang C C,Lin C J.LIBSVM:A Library for Support Vector Machines[DB/OL].[2008-10-20].http://www.csie.ntu.edu.tw/_cjlin/libsvm.
10Hatch A O,Stolcke A.Generalized Linear Kernels for One Versus All Classification:Application to Speaker Recognition//Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006,Ⅴ:585-588.

共引文献3

1张丽,杨镇西,吉立新.语种识别算法中GSV计算的定点仿真与实现[J].计算机工程与设计,2012,33(2):679-683. 被引量：1
2聂智良,张兴明,杨镇西,张丽.区分性锚模型应用于语种识别的研究[J].计算机工程,2012,38(3):172-175. 被引量：1
3常振超,张兴明,杨镇西,张丽.一种结合支持向量机训练的锚模型语种识别方法[J].小型微型计算机系统,2013,34(4):837-842. 被引量：1

1陈思宝,胡郁,王仁华.一种结构受限的异方差线性判别分析[J].中文信息学报,2008,22(4):94-99.
2徐红,黄朝耿,宋洪波,周志光,李刚.并行计算的全通数字滤波器结构[J].电子学报,2015,43(10):2034-2039. 被引量：1
3黄鹏,武毅,唐福年.GSVF—1—Ⅱ到1kW分米波彩色电视发射机故障分析[J].内蒙古广播与电视技术,1997,14(4):46-48.
4王宪亮,吴志刚,杨金超,周若华,颜永红.基于SVM一对一分类的语种识别方法[J].清华大学学报（自然科学版）,2013,53(6):808-812. 被引量：10
5陈瑶玲,李奎.基于多分类器融合的语言识别研究[J].电子世界,2014(18):200-201.
6关一夫,张国毅,王晓峰.一种基于动态规划的雷达合批新算法[J].电讯技术,2013,53(11):1446-1451. 被引量：3
7陈瑶玲,杨鉴.基于多特征和多分类器融合的语种识别[J].微计算机信息,2010,26(25):195-197. 被引量：2
8宋彦,戴礼荣,王仁华.基于超向量子空间分析的自动语种识别方法[J].模式识别与人工智能,2010,23(2):165-170. 被引量：4
9孟斌,尹卫红,张景秋,张文忠.北京宜居城市满意度空间特征[J].地理研究,2009,28(5):1318-1326. 被引量：49
10徐向华,朱杰,郭强.决策树结构对说话人自适应影响的研究[J].声学学报,2006,31(1):42-47. 被引量：3

计算机应用研究

2012年第10期

浏览历史

内容加载中请稍等...

一种层次化空间分析方法在语种识别系统中的应用

参考文献11

二级参考文献13

共引文献3

相关作者

相关机构

相关主题

浏览历史