期刊文献+

内窥镜自动定位语音识别系统 被引量:2

Speech recognition for endoscopic automatic positioning system
在线阅读 下载PDF
导出
摘要 提出一种基于特定人的内窥镜自动定位语音识别系统,通过识别特定医生的语音控制口令实现内窥镜的定位,为手持内窥镜操作提供更加智能化的解决方案。在识别算法上提出了参考模板归一化平均的动态时间规划(Normalized Average-Dynamic Time Warping,NA-DTW)算法,可获得更高的识别率,系统以片上Windows CE操作系统和ARM作为系统的软硬件平台。实验通过对10个不同测试人的共1 250组测试数据进行识别检测,NA-DTW算法与传统DTW算法相比,识别率从96.6%提高到99.76%,运算时间从469 ms缩短到241 ms。验证了NA-DTW算法可以完成基于特定人、孤立词的语音识别功能,并满足嵌入式系统中的实时检测条件。 A novel system for minimally invasive surgery is presented in this paper. The system utilizes an Endoscopic Automatic Positioner(EAP)controlled by speech recognition engine to implement the clamping and dynamical positioning of the laparoscope. The motion instructions of the EAP are transformed from voice commands of specific doctor recog-nized by speaker dependent speech recognition algorithm named Dynamic Time Warping(DTW). The DTW recognizes particular commands and rejects irrelevant items by enhancing the performance of the reference template. An ARM-core embedded platform is designed to run the DTW on Windows CE operating system. And on that basis, the performance of DTW is demonstrated by 1250 groups of experiments from 10 individual speakers. Compared with the traditional algo-rithm, the enhanced algorithm can improve the recognition rate by 3.16%and shorten the time of calculation by 51%. The results demonstrate the availability of the enhanced algorithm and its ability to satisfy the real time requirement in embed-ded system.
出处 《计算机工程与应用》 CSCD 2014年第8期207-210,共4页 Computer Engineering and Applications
基金 国家自然科学基金仪器专项(No.60827005)
关键词 内窥镜 动态时间规整 参考模板 特定人 嵌入式系统 endoscopic dynamic time warping reference template speaker dependent embedded system
  • 相关文献

参考文献8

  • 1Schuller B, Can S, Feussner H, et al.Speech control in surgery: a field analysis and strategies[C]//Proc ICME, 2009: 1214-1217.
  • 2Zhang Yuxin, Miyanaga Y.An improved dynamic time warp- ing algorithm employing nonlinear median filtering[C]// IEEE Conferences on Communications and Information Technologies(ISCIT), Hangzhou, 2011 : 439-442.
  • 3Zaharia T, Segarceanu S, Cotescu M, et al.Quantized Dynamic Time Warping (DTW) algorithm[C]//The 8th International Conference on Communications(COMM), Bucharest, 2010: 91-94.
  • 4Myers C, Rabinar L R, Rosenberg A E.Performance trade- offs in dynamic time warping algorithm for isolated word recognition[J].IEEE Transactions on Acoustics, Speech and Signal Processing, 1980,28 : 623-635.
  • 5Abdulla W, Chow D, Sin G.Cross-words reference tem- plate for DTW-based speech recognition systems[C]// IEEE TENCON, Conferences on Convergent Technolo- gies for Asia-Pacific Region,Bangalore,2003:1576-1579.
  • 6汪兵 李存斌 陈鹏.EVC高级编程及其应用开发[M].北京:中国水利水电出版社,2005..
  • 7李振静,王国胤,杨勇,罗飞.基于谱熵噪声估计的改进减谱法[J].计算机工程,2009,35(18):164-166. 被引量:5
  • 8张震,王化清.语音信号特征提取中Mel倒谱系MFCC的改进算法[J].计算机工程与应用,2008,44(22):54-55. 被引量:29

二级参考文献8

  • 1张雄伟,陈亮,杨吉斌.现代语音技术及应用[M].北京:机械工业出版社.2003.
  • 2Fakhr W,Salam A A,Hamdy N.Enhancement of mismatched conditions in speaker recognition for multimedia applications [J].IEEE International Conference on Acoustics,Speech,and Signal Processing, 2004.
  • 3Shen Jialin, Jeihweih H, Linshan L. Robust Entropy-based Endpoint Detection for Speech Recognition in Noisy Environments[C]//Proc. of ICSLP'98. Sydney, Australia: ASSTA, 1998.
  • 4Kitaoka N, Yamamoto K, Kusmizu T, et al. Development of VAD Evaluation Framework CENSREC-I-C and Investigation of Relationship between VAD and Speech Recognition Performance[C]//Proc. of ASRU'07. Kyoto, Japan: IEEE Signal Processing Society, 2007.
  • 5中国科学院自动化所.CASIA汉语情感语料库[Z].http://www.chineseldc.org.2005.
  • 6王炳锡,屈丹,彭煊.实用语音识别基础[M].北京:国防工业出版社,2004:264-286.
  • 7马义德,邱秀清,陈昱莅,刘映杰,朱敬锋.改进的基于听觉掩蔽特性的语音增强[J].电子科技大学学报,2008,37(2):255-257. 被引量:8
  • 8邵央,刘丙哲,李宗葛.基于MFCC和加权矢量量化的说话人识别系统[J].计算机工程与应用,2002,38(5):127-128. 被引量:34

共引文献189

同被引文献16

引证文献2

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部