期刊文献+

基于GMM-UBM和GLDS-SVM的英文发音错误检测方法 被引量:3

Automatic Mispronunciation Detection for English Learners by GMM-UBM and GLDS-SVM Methods
在线阅读 下载PDF
导出
摘要 将语种和说话人识别的方法应用到英语发音错误检测系统,提出一种基于广义线性区分序列支持向量机(Generalized linear dis-criminant sequence based SVM,GLDS-SVM)的发音错误检测方法.主要创新点为:1)提出一种基于状态拼接的特征规整方案,增强SVM对发音特征的建模能力;2)提出一种基于多模型融合的模型训练策略,该策略可以更加充分地利用训练数据,并在一定程度上解决了由于真实发音错误数据缺乏造成的正负样本不均衡的问题;3)将GLDS-SVM与基于通用背景模型GMM(Universal background modelsbased GMM,GMM-UBM)的方法进行融合,以进一步提高发音检错性能.GLDS-SVM和GMM-UBM的融合系统在仿真测试集和真实测试集上的等错误率(Equal error rate,EER)分别达到9.92%和16.35%.同时,GLDS-SVM在模型占用空间和运算速度方面均比传统径向基函数(Radial basic function,RBF)核方法具有明显优势. The paper proposes an efficient generalized linear discriminant sequence based SVM (GLDS-SVM) based mispronunciation detection method. Firstly, in order to enhance the ability of describing pronunciation characteristics, we introduce an improved SVM feature normalization scheme based on state-concatenated operation. Then, we propose a novel multi-model strategy for model training to make full use of samples and solve the problem of data unbalance caused by lack of the actual mispronunciation corpus. Finally, we combine GLDS-SVM with universal background models based GMM (GMM-UBM) to further improve the performance. The fused system by these two methods achieves 9.92% and 16.35% in equal error rate (EER) for simulation set and real set, respectively. Meanwhile, GLDS-SVM processes a higher computation speed and smaller model size than traditional radial basic function (RBF) kernel.
出处 《自动化学报》 EI CSCD 北大核心 2010年第2期332-336,共5页 Acta Automatica Sinica
基金 国家高技术研究发展计划(863计划)(2006AA010103)资助~~
关键词 计算机辅助语言学习 自动发音错误检测 支持向量机特征规整 多模型融合策略 Computer assisted language learning (CALL) automatic mispronunciation detection support vector machine (SVM) feature normalization multi-model fusion strategy
  • 相关文献

参考文献9

  • 1Pan F P, Zhao Q W, Yan Y H. Mandarin vowel pronunciation quality evaluation by a novel formant classification method and its combination with traditional algorithms. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. Las Vegas, USA: IEEE, 2008. 5061-5064.
  • 2董滨,赵庆卫,颜永红.基于共振峰模式的汉语普通话中韵母发音水平客观测试方法的研究[J].声学学报,2007,32(2):122-128. 被引量:16
  • 3Jiang J, Xu B. Exploring the automatic mispronunciation detection of confusable phones for Mandarin. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. Taipei, China: IEEE, 2009. 4833-4836.
  • 4Reynolds D A, Quatieri T F, Dunn R B. Speaker verification using adapted Gaussian mixture models. Digital Signal Processing, 2000, 10(1-3): 19-41.
  • 5Ganapathiraju A, Hamaker J E, Picone J. Applications of support vector machines to speech recognition. IEEE Transactions on Signal Processing, 2004, 52(8): 2348-2355.
  • 6Lin H T, Lin C J, Weng R C. A note on Platt's probabilistic outputs for support vector machines. Machine Learning, 2007, 68(3): 267-276.
  • 7Campbell W M, Campbell J P, Reynolds D A, Singer E, Torres-Carrasquillo P A. Support vector machines for speaker and language recognition. Computer Speech and Language, 2006, 20(2-3): 210-229.
  • 8Kittler J, Hatef M, Robert P W, Jiri M. On combining classifters. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998, 20(3): 226-239.
  • 9Chang C C, Lin C J. LIBSVM: a library for support vector machines [Online], available: http://www.csie.ntu.edu.tw/ -cjlin/libsvm, September 11, 2007.

二级参考文献14

共引文献15

同被引文献41

引证文献3

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部