期刊文献+

一种层次化空间分析方法在语种识别系统中的应用

Language recognition method based on hierarchical space analysis
在线阅读 下载PDF
导出
摘要 在针对电话语音的自动语种识别系统中,训练和测试语料之间存在不同说话人、信道等因素差异带来的不匹配,是影响识别性能提高的关键因素。为了消除此类影响,提出一种层次化空间分析方法,首先对前端部分MFCC+SDC特征进行HLDA(异方差线性判别分析),增大了语种各个类的类间差异;然后对经自适应得到含有冗余信息的GSV进行PCA特征选择,有效地去除了信道等冗余信息的干扰。实验结果表明,此方法能有效消除信道等噪声影响,从而提升了原有系统的识别性能。 In automatic spoken language recognition system on telephone conversation speech,differences between train and test utterances on channels,gender and speakers are the key factor of improving the performance of the system.This paper proposed a hierarchical space analysis method.Firstly,it mapped the front-end cepstral features of SDC into the HDLA space,aiming at increasing the discriminability between different languages.Secondly,it selected the characters of adaptive GMM super vector by the method of PCA,which eliminated the influences of different channels,speakers and so on.Experiment results indicate that this method is better for improving the system's performance than the original baseline system.
出处 《计算机应用研究》 CSCD 北大核心 2012年第10期3651-3654,共4页 Application Research of Computers
基金 国家"863"计划重点资助项目(2008AA011002)
关键词 语种识别 层次化 空间分析 冗余信息 language recognition hierarchical space analysis redundant information
  • 相关文献

参考文献11

  • 1CAMPBELL W M, CAMPBELL J P, REYNOLDS D A, et al, Support vector machines for speaker and language recognition [ J]. Com- puter Speech & Language,2006,20(2-3): 210-229. .
  • 2MATEJKA P. Phonotactic and acoustic language recognition [ D]. Brno: Brno University of Technology, 2008.
  • 3宋彦,戴礼荣,王仁华.基于超向量子空间分析的自动语种识别方法[J].模式识别与人工智能,2010,23(2):165-170. 被引量:4
  • 4DEHAK N, McCREE A, REYNOLDS D, et al, MITLL 2011 language recognition evaluation system description [ R]. Cambridge: Information Systems Technology Group, MIT Lincoln Laboratory, 2011.
  • 5MARTINEZ G D, PLCHOT 0, BURGET L, et al. Language recognition in iVectors space [C J/ /Proc of the 12th Annual Conference on International Speecb Communication Association. 2011 : 861- 864.
  • 6KENNY P. Joint factor analysis of speaker and session variability: theory and algorithms[EB/OL]. (2006-01-13). http://www.crim. calperso/patrick. kenny/FAtheory. polf.
  • 7KUMAR N. Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition [ D] . Baltimore: John Hopkins University, 1997.
  • 8GALES M J. Semi-tied covariance matrices for hidden Markov models [ J ]. IEEE Trans on Speech and Audio Processing, 1999 , 7 (3) :272-281.
  • 9BISHOP C M. Pattern recognition and machine learning [ M]. New York: Springer ,2006.
  • 10ROWlES S. EM algorithm for PCA and SPCA [ C]/ /Proc of Conference on Advances in Neural Information Processing System. Cambridge: MIT Press, 1997 :626-632.

二级参考文献13

  • 1Torres-Carrasquillo P A,Singer E,Kchler M A,et.al.Approaches to Language Identification Using Gaussian Mixture Models and Shifted Delta Cepstral Features// Proc of the International Conference on Spoken Language Processing.Denver,USA,2002:89 -92.
  • 2Pelecanos J,Sridharan S.Feature Warping for Robust Speaker Verification// Proc of the Speaker and Language Recognition Workshop.Crete,Greece,2001:213 -218.
  • 3Burget L,Matejka P,Cernocky J.Discriminative Training Techniques for Acoustic Language Identification//Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006:209-212.
  • 4Qu Dan,Wen Bingxi.Discriminative Training of GMM for Language Identification//Proc of the ICSA and IEEE Workshop on Spontaneous Speech Processing and Recognition.Tokyo,Japan,2003:108 -110.
  • 5Campbell W M,Sturim D E,Reynolds D A.SVM Based Speaker Verification Using a GMM Supervector Kernel and NAP Variability //Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006:97-100.
  • 6Smith N,Gales M.Data-Dependent Kernels in SVM Classification of Speech Patterns// Proc of the 6th International Conference on Spoken Language Processing.Beijing,China,2000,Ⅰ:297 -300.
  • 7Campbell M,Campbell J P,Reynolds D A,et al.Support Vector Machines for Speaker and Language Recognition.Computer Speech and Language,2006,20(2/3):210-229.
  • 8Bishop C M.Pattern Recognition and Machine Learning.New York,USA:Springer,2006.
  • 9Chang C C,Lin C J.LIBSVM:A Library for Support Vector Machines[DB/OL].[2008-10-20].http://www.csie.ntu.edu.tw/_cjlin/libsvm.
  • 10Hatch A O,Stolcke A.Generalized Linear Kernels for One Versus All Classification:Application to Speaker Recognition//Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse,France,2006,Ⅴ:585-588.

共引文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部