期刊文献+

非特定人孤立词语音识别系统的片上实现 被引量:10

On chip realization of speaker-independent isolated word speech recognizer
在线阅读 下载PDF
导出
摘要 在SEED-DEC5502DSP嵌入式系统开发平台上实现了一个面向非特定人的孤立词语音识别系统,和传统的基于特定人的语音识别系统相比,该系统无需用户训练,易于使用。系统采用改进的基于语音对数域能量变化率的实时端点检测算法,只对检测的有声段语音进行特征提取,从而减少了要处理的语音帧数;提出了改进的共享声学单元状态发射概率共享的解码策略,进一步降低了计算负担。实验表明系统在100词条的情况下识别率达到98.1%,识别时间为1.03倍实时。 An embedded speaker-independent isolated word speech recognition system is designed and realized in the SEED- DEC5502 EVM platform.Compared with the speaker-dependent system,the speaker-independent recognition technique cannot requires training by the users and easy to use.With the help of a modified real time Voice Activity Detection algorithm(VAD) based on the log-energy acceleration associated with voice onset,we only perform feature extraction to the active voice and decrease the frames of processing.To further decrease the computational loads,A modified decoding strategy based on the share the uint states emission probabilities (SUSEP) is also presented.Test on 100 words vocabulary shows that system provides a recognition accuracy rate of 98.1% using only 1.03 times of real time.
出处 《计算机工程与应用》 CSCD 北大核心 2007年第13期194-196,共3页 Computer Engineering and Applications
基金 河北省教育厅自然科学指令计划(No.2005340) 河北省科技厅科技发展规划指导计划(No.052135147)。
关键词 语音识别 嵌入式系统 端点检测 发射概率 speech recognition embedded system speech endpoint detect state emission probabilities
  • 相关文献

参考文献6

  • 1Gong Y F,Kao Y H,Implementing a high accuracy speaker-independent continuous speech recognizer on a fixed-point DSP[C]//Proc ICASSP' 00,2000 : 3686-3689.
  • 2Kao Y H.Rajasekaran P K.A low cost dynamic vocabulary speech recognizer on a gpp-dsp system[C]//Proc ICASSP'00,2000:3215-3218.
  • 3杜利民,谢凌云,刘斌.HMM非特定人连续语音识别的嵌入式实现[J].电子与信息学报,2005,27(1):60-63. 被引量:6
  • 4王志强.孤立词语音识别系统关键问题的研究[D].北京:北京邮电大学.2004.
  • 5ETSI standard,ES 202 212 v 1.1.1,2003.11 Distributed speech recognition;speech processing,transmission and quality aspect[S].
  • 6Rogina F J.The bucket box intersection(BBI) algorithm for fast appl,oximative evaluation of diagonal mixture Gaussians[C]//Proc ICASSP,1996 : 837-840.

二级参考文献4

  • 1Du Limin, Feng Junlan, Song Yi, Sun Jinchen. A Chinese-English speech translation prototype system: CEST-CAS1.0.ICSPAT'99, Orlando, USA, 1999.
  • 2Du Limin, Feng Junlan, Song Yi, Wang Heng. Speech translation on internet CEST-CAS2.0. Proc. of ISIMP2001, Hong Kong,2001: 189- 192.
  • 3Rabiner L, Juang B H. Fundamentals of Speech Recognition.New Jersey, USA, Prentice Hall, 1993:350 - 352.
  • 4Ney H, Ortmanns S. Dynamic programming search for continuous speech recognition. IEEE Signal Processing Magazine, 1999, 16(5): 64 - 83.

共引文献5

同被引文献56

引证文献10

二级引证文献78

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部