期刊文献+

汉语语音识别中的一种音节分割方法 被引量:5

A Method for Syllable Segmentation in Mandarin Speech Recognition
在线阅读 下载PDF
导出
摘要 汉语语音识别研究中,识别单元的选取是很重要的。随着大词汇量连续语音识别研究的发展,越来越多汉语语音识别研究中选取次音节单位作为识别单元。结合汉语发音声学特性,提出了音节的重叠音素分割策略,并利用小波方法实现了音节的分割,实验证明该方法分割准确可靠。 It's very important to choose speech recognition units in Mandarin speech recognition researches.As the development of large vocabulary continuous speech recognition,sub--syllable is used as recognition units in many Mandarin speech recognition researches.In this paper,based on Mandarin acoustic characteristics,an overlapped segmenting scheme is presented and wavelets are used to segment phoneme.Experiments show that the method is available.
机构地区 南京理工大学
出处 《火力与指挥控制》 CSCD 北大核心 2004年第6期94-96,99,共4页 Fire Control & Command Control
关键词 音素 语音识别 小波变换 突变检测 phoneme,speech recognition,wavelets transform,jump-detection
  • 相关文献

参考文献5

  • 1Raviner L R,et al. A Segmental K-Means Training Procedure for Connected Word Recognition Based on Whole Word Reference Pattern [J]. AT&T Tech. J. 1986,165:21-31.
  • 2Mallat S,Zhong S. Characterization of Signals from Multiscale Edges[J]. IEEE Trans,on PAMI. 1992,PAMI-14 (7) : 710-732.
  • 3Mallat S, Hwang M L. Singularity Detection and Processing with Wavelets [J]. IEEE Trans. OnIT.1992,32 (2): 617-643.
  • 4Mallat S. Zero-Crossing of a Wavelet Transform[J]. IEEE Trans ,on IT. 1991,37 (4): 1019-1033.
  • 5Hao Yu,Zhou Xiaoyan. A New Feature in Speech Recognition Based on Wavelet Transform [J ].Proceedings of ICSP2000. 2000:1526-1529.

同被引文献33

  • 1张静亚.基于CHMM的高性能连续数字语音识别算法[J].常熟理工学院学报,2005,19(2):93-96. 被引量:4
  • 2杜利民,侯自强.汉语语音识别研究面临的一些科学问题[J].电子学报,1995,23(10):110-116. 被引量:21
  • 3O' Shaughnessy D.Invited paper: automatic speech recognition: History,methods and challenges[J].Pattern Recognition,2008,41: 2965-2979.
  • 4闫润强,祖漪清,朱贻盛.递归趋势分析在汉语语音声韵母切分中的应用研究[J].信号处理,2007,23(4):521-525. 被引量:5
  • 5SARKAR M, YEGNANARAYANA B. Rough-fuzzy set theoretic approach to vvaluate the importance of input features in classification[C]// Proceedings of 1997 Int. Conf. on Neural Networks. Texas:IEEE Press, 1997,3: 1590-1595.
  • 6SARKAR M. Fuzzy-rough nearest neighbors algorithm[C]// Proceedings of 2000 IEEE Int. Conf. on Systems, Man, and Cybernetics. Nashville : IEEE Press, 2000,5 : 3556-3561.
  • 7Dusan S, Rabiner L. On the relation between maximum spec- tral transition positions and phone boundaries [C]. Pittsburgh, PA, USA: Ninth International Conference on Spoken Lan guage Processing, 2006: 17-21.
  • 8Jurado R S, Gomez-Gil P, Garda C- Speech text-independent seg- mentation using an improv-nent method for identification of phoneme boundaries [C]. Cholula, Puebla: International Conference on Elec- trical, Communications, and Computers, 2009: 20-24.
  • 9Thcera-Umpon N, Chansareewittaya S, Auephanwiriyakul S. Thai phoneme soft segmentation and recognition using hidden Markov models [C]. Taipei, Taiwan : 33rd Annual Conference of the IEEE Industrial Electronics Society, 2007. 251-255.
  • 10Kanda H, Ogata T, Komatani K, et al. Segmenting acoustic signal with articulatory movement using recurrent neural net-work for phoneme acquisition [C]. Nice, France: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008. 1712-1717.

引证文献5

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部