汉语语音识别中的区分性声调建模方法被引量：4

Tone modeling based on discriminative training for Mandarin speech recognition

下载PDF

导出

摘要提出从特征提取参数、模型参数对隐马尔可夫声调模型进行区分型训练,来提高声调识别率;提出模型相关的权重对谱特征模型和声调模型的概率进行加权,并根据最小音子错误区分性目标函数对权重进行训练,来提高声调模型加入连续语音识别时的性能。声调识别实验表明区分性的声调模型训练以及特征提取方法显著提高了声调识别率。区分性模型权重训练能够在声调模型加入之后进一步连续语音识别系统的识别率。 To improve tone recognition accuracy,discriminative training in both feature and model parameters for hidden Markov model based tone modeling is proposed.When incorporating tone models into continuous speech recognition,discriminative model weight training is presented.Acoustic and tone model distributions are scaled by model dependent weights trained by the mini- mum phone error criterion.Experiments show tone recognition and large vocabulary continuous speech recognition accuracy can be considerably improved by the proposed methods.

作者黄浩朱杰哈力旦

机构地区上海交通大学电子工程系新疆大学信息科学与工程学院新疆大学电气工程学院

出处《计算机工程与应用》 CSCD 北大核心 2009年第11期178-182,共5页 Computer Engineering and Applications

基金国家自然科学基金No.60865001~~

关键词区分性训练声调建模汉语语音识别特征提取 discriminative training tone modeling Mandarin speech recognition feature extraction

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献8

1Huang C H,Side F.Pitch tracking and tone features for mandarin speech recognition[C]//Proceedings of the 25th International Conference on Acoustics,Speech and Signal Processing,Istanbul,Turkey,2000:1523-1526.
2Lei X, Siu M, Hwang M, Ostendorf M,et al.Improved tone modeling for Mandarin broadcast news speech recognition[C]//Proceedings of Interspeech (ICSLP), Pittsburgh, USA, 2006 : 1277-1280.
3Povey D,Woodland P C.Minimum phone error and i-smoothing for improved discriminative training[C]//Proceedings of the 27th International Conference on Acoustics,Speech and Signal Processing, Florida, USA, 2002 : 105-108.
4Povey D.Discriminative training for large vocabulary speech recognition[D].Peterhouse : Cambridge University, 2004.
5Povey D, Kingsbury B, Mangu L, et al.fMPE : discriminatively trained features for speech recognition[C]//Proceedings of International Conference on Acoustics,Speech and Signal Processing,Philadelphia, USA, 2005,1 : 961-964.
6Chang E,Shi Yu,Zhou Jian-lai,et al.Speech lab in a box:a Mandarin speech toolbox to jumpstart speech related research[C]//Proceedings of the 7th European Conference on Speech Communication and Technology,Aalborg,Denmark,2001:2779-2782.
7Gopalakrishnan P S,Kanevsky D,Nadas A,et al.A generalization of the Baum algorithm to rational objective functions[C]//Proceedings of the 25th International Conference on Acoustics,Speech and Signal Processing, Glasgow, Seotland, 1989:631-634.
8Lee T,Lau W,Wong Y W,et al.Using tone information in cantonese continuous speech recognition[J].ACM Transactions on Asian Language Information Processing, 2002, 1 ( 1 ) : 83-102.

同被引文献15

1高新涛,陈乖丽.语音识别技术的发展现状及应用前景[J].甘肃科技纵横,2007,36(4):13-13. 被引量：19
2YAN Long,ZHAO Ren-cai,LIU Gang,et al. Large vocabulary manda-rin Chinese continuous speech recognition system based on tonaltriphone [ C] //Proc of International Symposium on Tonal Aspects ofLanguages. 2004:28 - 31.
3YOUNG S J, WOODLAND P C. State clustering in hidden Markovmodel-based continuous speech recognition [ J]. Computer Speechand Language, 1994,8(4) :369-384.
4WANG Guang-sen, SIM K C. An investigation of tied-mixture GMMbased triphones state clustering [ C] //Proc of IEEE International Con-ference on Acoustics,Speech and Signal Processing.2012 -.4717-4720.
5REICHL W,CHOU W. Robust decision tree state tying for continuousspeech recognition [ J]. IEEE Trans on Speech and Audio Pro-cessing,2000,8(5) ;555-566.
6LIU Chao-jun,WU Xin-tian,YAN Yong-hong. High accuracy acousticmodeling using two-level decision-tree based state-tying[ C] //Proc ofthe 5 th European Conference on Speech Communication and Techno-logy. 1999:1703-1706.
7WONG Y W,CHANG E. The effect of pitch and lexical tone on diffe-rent mandarin speech recognition tasks[ C]//Proc of the 7th EuropeanConference on Speech Communication and Technology. 2001:2741-2744.
8SHINODA K. Speaker adaptation techniques for automatic speechrecognition[ C]//Proc of APSIPA. 2011.
9詹新明,黄南山,杨灿.语音识别技术研究进展[J].现代计算机,2008,14(9):43-45. 被引量：44
10倪崇嘉,刘文举,徐波.汉语大词汇量连续语音识别系统研究进展[J].中文信息学报,2009,23(1):112-123. 被引量：39

引证文献4

1齐耀辉,潘复平,葛凤培,颜永红.汉语连续语音识别系统中三音子模型的优化[J].计算机应用研究,2013,30(10):2920-2922. 被引量：4
2王坤,郭起云,郭光.大数据时代下档案信息采集新思路[J].数字与缩微影像,2014(2):7-8. 被引量：2
3刘豫军,夏聪.连续语音识别技术及其应用前景分析[J].网络安全技术与应用,2014(8):15-16. 被引量：5
4陈拥权,李建中,郑荣稳,鲁加旺.连续语音识别技术及其应用前景分析[J].数码世界,2016,0(1):29-31.

二级引证文献11

1许金普,诸叶平.基于语音识别的农产品价格信息采集方法[J].中国农业科学,2015,48(3):449-459. 被引量：8
2许金普.基于MMSE谱减算法的农产品市场信息语音识别技术[J].河南农业科学,2015,44(5):156-160. 被引量：2
3许金普,许丰娟,诸叶平,刘升平,岳慧丽,刘丹.农产品市场信息采集的语音识别鲁棒性方法[J].中国农业科技导报,2015,17(4):100-106.
4刘润东.云计算平台下的语音信号处理[J].现代电子技术,2016,39(2):15-17. 被引量：1
5丁磊,蒋东国,王志韬.语音识别技术在电子货架标签系统中的应用[J].计算机测量与控制,2016,24(10):186-189. 被引量：1
6惠益龙,张太红,吕莲花,王蓓蓓.语音识别中的统计语言模型研究[J].信息技术,2017,41(1):44-46. 被引量：2
7张志强.论网络对当代大学生的负面影响[J].中小企业管理与科技,2017,1(5):161-162.
8赵宇环.云计算平台下的语音信号处理探析[J].山西科技,2017,32(6):74-75.
9易明,冯翠翠,莫富传.大数据时代的信息资源管理创新研究[J].图书馆学研究,2019,0(6):56-61. 被引量：14
10翟永杰,杨旭,彭雅妮,王新颖.基于计算机听觉技术的电力设备状态监测研究综述[J].广东电力,2019,32(9):24-32. 被引量：19

1黄浩,朱杰.TONE MODELING BASED ON HIDDEN CONDITIONAL RANDOM FIELDS AND DISCRIMINATIVE MODEL WEIGHT TRAINING[J].Transactions of Nanjing University of Aeronautics and Astronautics,2008,25(1):43-50. 被引量：1
2黄浩,朱杰.Discriminative tone model training and optimal integration for Mandarin speech recognition[J].Journal of Southeast University(English Edition),2007,23(2):174-178.
3黄浩,朱杰.汉语语音识别中基于区分性权重训练的声调集成方法[J].声学学报,2008,33(1):1-8. 被引量：2
4张琰彬,呼月宁,初敏,黄超,梁满贵.汉语普通话声调发音错误检测[J].清华大学学报（自然科学版）,2008,48(S1):683-687. 被引量：1
5黄浩,李兵虎,吾守尔.斯拉木.汉语语音识别声调模型集成中基于决策树的上下文相关权重参数聚类方法[J].新疆大学学报（自然科学版）,2011,28(3):260-266.
6Wang,IY,管长发.与计算机网络模型相关的非派生队列网络的稳态概率...[J].数理译丛,1991(1):23-29.
7吴强,符涛,周娜.移动网络网元分布式部署研究[J].邮电设计技术,2010(5):34-38. 被引量：2
8吴志勇,蔡莲红,蔡锐.语音合成中基于听辨指导的权重训练算法[J].清华大学学报（自然科学版）,2005,45(1):52-56. 被引量：1
9赵力,邹采荣,吴镇扬.基于3维空间Viterbi算法的音素模型和声调模型识别概率统合方法的研究[J].声学学报,2001,26(3):259-263. 被引量：3
10刘明辉,戴蓓蒨,解焱陆.基于GMM多维概率输出的SVM话者确认[J].模式识别与人工智能,2008,21(1):28-33. 被引量：2

计算机工程与应用

2009年第11期

浏览历史

内容加载中请稍等...

汉语语音识别中的区分性声调建模方法被引量：4

参考文献8

同被引文献15

引证文献4

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

汉语语音识别中的区分性声调建模方法 被引量：4

参考文献8

同被引文献15

引证文献4

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

汉语语音识别中的区分性声调建模方法被引量：4