期刊文献+

特定人孤立词的语音识别系统研究 被引量:17

Research on Speaker-Depended Isolated-Word Speech Recognition System
原文传递
导出
摘要 语音识别是让机器自动识别和理解语音信号,并把语音信号转变为相应的文本或命令的技术。通过对特定人孤立词语音特点的研究,在对语音信号进行预处理的过程中,选择过零率与短时平均能量两项指标作为对语音信号端点检测的依据,提取语音线性预测系数,通过计算分析后获得线性预测倒谱系数,作为语音特征参数。选择动态时间规整法为模板匹配算法,并针对传统匹配算法中计算量大的特点,作出改进,采用全局限制的方法以减小匹配过程中的计算量。采用上述算法设计了一种基于特定人的孤立词语音识别系统,并对该系统进行了多种背景条件下的M atlab仿真研究。仿真实验结果表明,此算法对于特点人孤立词的语音识别能达到较好的识别效果。 Voice - identification is a kind of technology that is using computer to transfer the voice signal to an associated text or command by identification and understand. Zero - crossing rate and short - term energy are selected as the basis in the endpoint detecting in preprocessing of speech signal through the studying of the characters of single - word. LPC ( Linear Prediction Coefficient) is extracted from the signal, then cepstral coefficient is obtained as the speech characteristic parameters. DTW ( Dynamic Time Warping) algorithm , which is improved to reduce the amount of data in the matching process by using global constraint, is used for the matching of the model. Based on the algorithm, a speaker- depended isolated -word speech recognition system is designed, and the simulation and analysis is carried on by using Matlab in a variety of background conditions. The experiment shows that the algorithm for speaker - depended isolated- word speech recognition can achieve good results.
出处 《控制工程》 CSCD 北大核心 2011年第3期397-400,404,共5页 Control Engineering of China
基金 国家自然科学基金资助项目(60443008)
关键词 语音识别 线性预测倒谱 动态规划 动态时间规整 voice - identification LPCC DP DTW
  • 相关文献

参考文献7

  • 1Deller John R, Proakis John G, Hansen John H L. Discrete-Time Processing of Speech Signals[ M ]. Macmillan Publishing Company, 1993.
  • 2Fei W C, Bai L. Pattern recognition method for size series of cocoon filament [ J ]. Japan Silk Science and Technology, 2005,14 ( 9 ) : 81-85.
  • 3Lau Y K, Chan C. Speech recognition based on zero-crossing rate and energy. IEEE Trans. ASSP,1985,33- 1 ) :320-323.
  • 4Sambur M R, Rabiner L R. A Speaker Independent Digit Recognition System. BSTJ,1975,54( 1 ) :81-102.
  • 5张杰,黄志同,王晓兰.语音识别中隐马尔可夫模型状态数的选取原则及研究[J].计算机工程与应用,2000,36(1):67-69. 被引量:21
  • 6Goertzel G. An Algorithm for the Evaluation of Finite Trigonometric Series. American Math Monthly, 1958.65 ( 1 ) : 34 -35.
  • 7万春,黄杰圣,曹煦晖.基于DTW的孤立词语音识别研究和算法改进[J].计算机与现代化,2003(11):4-6. 被引量:8

二级参考文献4

  • 1Deller John R, Proakis John G, Hansen John H L. Discrete-Time Processing of Speech Signals [ M ]. Macmillan Publishing Company, 1993.
  • 2Kondoz A M. Digital Speech( Coding for low bit rate communication systems) [ M]. University of Surrey, UK, 1994.
  • 3常迥,信息理论基础,1993年
  • 4万春.基于DTW的语音识别应用系统研究与实现[J].集美大学学报(自然科学版),2002,7(2):104-108. 被引量:19

共引文献27

同被引文献131

引证文献17

二级引证文献69

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部