期刊文献+

汉语连续语音识别中的分级聚类算法的研究和应用 被引量:2

A Hierarchical Clustering Algorithm in Continuous Mandarin Speech Recognition
在线阅读 下载PDF
导出
摘要 针对汉语语音单音节结构的特点,考虑音节间协同发音的现象,本文提出了一种对三音子模型进行分级聚类的方法。与传统的基于决策树的状态聚类算法相比,该方法通过对稀少三音子模型聚类,更充分地利用训练数据,减少稀少三音子对状态聚类的影响,从而提高声学模型的鲁棒性。实验结果表明:大词汇量连续语音识别器采用这种分级聚类方法,不仅可以大大减少模型及其参数的数量,还可使系统识别率有所提高,其中误识率相对于传统的决策树状态聚类系统降低了4.93%。 Based on the single syllable characteristics of Mandarin and considering the inter-syllable coarticulatory phenomena, a new hierarchical clustering algorithm is proposed. Compared with the traditional decision-tree based state-tying, the algorithm can take better use of training data and lessen the impact of rare triphones to state-tying. Experiments on large vocabulary continuous Mandarin speech recognition system show that the method can get better performance (about 4.93% word error rate reduction) with even fewer parameters.
出处 《信号处理》 CSCD 2004年第5期497-500,共4页 Journal of Signal Processing
基金 上海市科委重点基金项目资助(01JC14033)
关键词 状态聚类 决策树 训练数据 聚类算法 三音子 鲁棒性 聚类方法 汉语连续语音识别 协同发音 误识率 continuous speech recognition decision tree model-based clustering state-tying
  • 相关文献

参考文献11

  • 1K. E Ice, H. W. Hon, and R. Reddy. An overview of the SPHINX speech recognition system. IEEE Trans. Acoustics, Speech, Signal Processing, 1990, 38: 35-45.
  • 2C. H. Lee, E. Giachin, L. Rabiner, and A. Rosenberg.Improved acoustic modeling for large vocabulary continuous speech recognition. Computer Speech and Language, 1992, 6: 103-127.
  • 3M. Y. Hwang, X.D. Huang. Subphonetic modeling with Markov states-senone. Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing, 1992, 1: 33-36.
  • 4S..L Young, J.J. Odell, and E C. Woodland. Tree-based state tying for high accuracy acoustic modeling. In Proceedings ARPA Workshop on Human Language Technology, 1994: 307-312.
  • 5M. Y. Hwang, X. D. Huang and E Alleva. Predicting unseen tdphones with senones. Proc. IEEE Int. Conf.Acoustics, Speech, Signal processing, 1993, 2:311-314.
  • 6W. Rcichl, W. Chou. Robust decision tree state tying for continuous speech reognition. IEEE Trans. Acoustics,Speech, Signal Processing, 2000, 8: 555-566.
  • 7C. J. Liu, X.T. Wu, and Y. H. Yah. High accuracy acoustic modeling using two-level decision-tree based state-tying.Proc. 6^th Eur. Conf. Speech Communication Technology,1999, 4: 1703-1706.
  • 8E. Chang, J. L. Zhou, S. Di, C. Huang, and K. E Lee.Large vocabulary Mandarin speech recognition with different approaches in modeling tones. Proc. IEF.E Int. Conf. Spoken Language Processing, 2000, 983-986.
  • 9J. T. Chien, C. H. Huang and S. J. Chen. Compact decision trees with cluster validity for speech recognition.Proc. IEEE Int. Conf. Acoustics, Speech, Signal processsing, 2000, 2: 873-876.
  • 10E. Chang, Y. Shi, J. L. Zhou, and C. Huang. Speech lab in a box: a Mandarin speech toolbox to jumpstart speech related research. Proc. 7^th Eur. Conf. Speech Communication Technology, 2001.

同被引文献33

  • 1曹剑芬.普通话双音子和三音子结构系统代表语料集[J].语言文字应用,1997(1):62-70. 被引量:7
  • 2王志明,蔡莲红,艾海舟.基于数据驱动方法的汉语文本-可视语音合成(英文)[J].软件学报,2005,16(6):1054-1063. 被引量:16
  • 3张翠丽,张申生,李磊.基于统一受理的农业呼叫中心解决方案[J].计算机应用与软件,2006,23(10):31-32. 被引量:9
  • 4赵春江,申长军,邢振,郑文刚,鲍锋,吴文彪.农产品信息采集器及采集方法[P].中国:CNl02122430A,2011.
  • 5Singh G. Multi utility e-controlled cum voice operated farm.International Journal of Computer Applications, 2010, 1(13): 109-113.
  • 6Mantena G V, Rajendran S, Rambabu B, Gangashetty S V, Yegnanarayana B, Prahallad K. A speech-based conversation system for accessing agriculture commodity prices in Indian languages. Hands-free Speech Communication and Microphone Arrays (HSCMA) 2011 Joint Workshop on, 2011: 153-154.
  • 7Plauche M, Nallasamy U, Pal J, Wooters C, Ramachandran D. Speech recognition for illiterate access to information and technology. //Proceedings of the First International Conference on Information and Communication Technologies and Development (ICTD '06). Berkeley, CA, 2006: 83-92.
  • 8Ou W H, Gao W L, Li Z, Zhang S L, Wang Q. Application of keywords speech recognition in agricultural voice information system. //Computational Intelligence and Natural Computing Proceedings ( CINC), 2010 Second International Conference. Wuhan, Hubei, 2010: 197-200.
  • 9Chedad A, Moshou D, Aerts J M, Van Hirtum A, Ramon H, Berckmans D. Recognition system for pig cough based on probabilistic neural networks. Journal of Agricultural Engineering Research, 2001, 79(4): 449-457.
  • 10Guarino M, Jans P, Costa A, Aerts J M, Berckmans D. Field test of algorithm for automatic cough detection in pig houses. Computers and Electronics in Agriculture, 2008, 62(1): 22-28.

引证文献2

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部