期刊文献+

汉语孤立字全音节实时识别系统 被引量:4

A real-time speaker-dependent syllable recognition system of the complete vocabulary of Chinese
原文传递
导出
摘要 本文在大量语音实验的基础上,对汉语语音识别方法进行了较为深入的探讨,并以IBMPC/AT配以自行研制开发的TMS320C25-E型高速信号处理板为硬件基础,建立了一个特定人汉语普通话全音节实时识别系统.该系统针对汉语普通话的语音特点,采用了分层识别策略.整个系统响应时间小于0.2秒,用4遍1240个全音节语音对系统进行的严格测试表明:系统四声识别的平均正确率为99%左右,音节识别前5个候选的正确识别率分别为82%,91%,94%,96%,97%;同时,本文根据这一测试结果建立了相应的声韵母混淆矩阵和基于Shepard方法的相似度集群分析树图,并对照汉语语音合成清晰度测试结果及汉语语音知觉结构的集群分析结果,对本系统各部分进行了较为深入的分析,提出了相应的改进措施. Based on a large number of speech experiments, Mandarin speech recognition approaches have been thoroughly studied, and a real-time speaker-dependent all-syllable recognition system of Mandarin has been developed on an IBM PC/AT microcomputer with a high-speed digital signal processing board TMS320C25-E. In accordance with the phonetic characteristics of Mandarin, the three-stage recognition strategy is adopted in this system. Experiments for the speech datas of 4 times 1240 syllables show that, average correct rate of four tone recognition is about 99%, correct rates of the first 5 candidates of syllable recognition are 82%, 91%, 94%, 96%, and 97% respectively, and the whole system response time is less than 0.2 second. In addition, the Mandarin initials and finals confusion matrices, and the corresponding hierarchical clustering diagram of the similarity are obtained from the experiment results, and they are analyzed in comparision with the references [1,2] so as to further improve the system performance.
出处 《声学学报》 EI CSCD 北大核心 1993年第3期161-171,共11页 Acta Acustica
基金 国家自然科学基金资助项目
  • 相关文献

参考文献5

  • 1陈韬,1990年
  • 2陈永彬,语言信号处理,1990年
  • 3吴宗济,实验语音学概要,1989年
  • 4张家禄,J Chin Lingustics,1982年,10卷,190页
  • 5张家禄,心理学报,1981年,1卷,76页

同被引文献16

  • 1栗学丽,丁慧,徐柏龄.基于熵函数的耳语音声韵分割法[J].声学学报,2005,30(1):69-75. 被引量:34
  • 2潘凌云,孙达传,吴美朝.语音识别中基于语谱图的语音音素分割方法[J].杭州大学学报(自然科学版),1995,22(1):42-46. 被引量:7
  • 3齐士钤 张家禄.汉语普通话辅音音长分析[J].声学学报,1982,(1):8-13.
  • 4曹剑芬.现代语音基础知识[M].北京:人民教育出版社,1990..
  • 5王成友,汤叔祺,梁甸农,陈辉煌,唐朝京.语音识别中多种特征信息综合利用的方法[J].声学学报,1997,22(2):111-115. 被引量:6
  • 6Taisuke Itoh, Kazuya Takeda and Fumitada Itakura.Acoustic analysis and recognition of whispered speech. In:Proc. ICASSP, Orlando, Florida, USA, 2002:389-392.
  • 7Robert W. Morris, Mark A. Clements. Reconstruction of speech from whispers. Medical Engineering ~ Physics,2002; 24(8): 515-520.
  • 8Higashikawa M, Nakai K, Sakakura A, Takahashi H. Perceived pitch of whispered vowels-relationship with formant frequencies: a preliminary study. Journal of Voice,1996; 10(2): 155-158.
  • 9Izmirli O. Using a spectral flatness based feature for audio segmentation and retrieval. In: Proc. International Symposium on Music Information Retrieval, Plymouth, USA,2000:100-101.
  • 10Fakotakis N, Sirigos J, Kokkinakis G. High performance text-independent speaker recognition system based on voiced/unvoiced segmentation and multiple neural nets.In: Proc. EUROSPEECH, Budapest, Hungary, 1999:979~982.

引证文献4

二级引证文献41

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部