期刊文献+

结合关键词混淆网络的关键词检出系统 被引量:2

Research of keyword spotting based on a keyword spotting confusion network
在线阅读 下载PDF
导出
摘要 为了高效地从大词汇量连续语音识别(LVCSR)的多候选中得到关键词结果,保证最小词错误率,提出了将混淆网络的思想应用到关键词检出系统中.在传统混淆网络生成方法基础上,提出一种改进的更加适合于关键词检出的关键词混淆网络作为关键词检出的中间结构,该方法只对所有关键词竞争候选生成带有得分标记的关键词混淆网络,突出候选之间竞争关系,并根据得分标记确定关键词.与传统的N-best作为中间结构的关键词检出系统比较,基于混淆网络的关键词检出系统的召回率为87.11%,提高了21.65%.实验表明,在提高召回率的同时,所提方法具有关键词直接定位的特点,因此具有较低的时间开销. In order to achieve a higher keyword recall rate from large vocabulary continuous speech recognition (LVCSR) and minimize the word error rate, a confusion network was used in a keyword spotting system. Moreover, an improved method of generating a keyword confusion network which was more suitable for keyword spotting was proposed based on the traditional algorithm. This method only focused on keyword competitions, and was capa- ble of transforming all the key-word competitions into a confusion network with a marked score, and highlighted com- petitions to all the candidates. Compared with the traditional keyword spotting system which uses N-best as the me- dium structure, the proposed method increased the recall rate of confusion network to 87.11% ; compared with the keyword spotting system based on N-best, there is a 21.65% improvement in the recall rate. Experiments show the proposed method could locate keywords directly, besides increasing the recall rate, so the system costs less time.
出处 《智能系统学报》 2010年第5期432-435,共4页 CAAI Transactions on Intelligent Systems
基金 国家自然科学基金资助项目(60702053) 黑龙江省青年骨干教师支持计划资助项目(1155G17)
关键词 关键词检出 混淆网络 语音识别 keyword spotting confusion network speech recognition
  • 相关文献

参考文献9

  • 1叶靓,王智斌,邵谦明.基于相关反馈的语音检索引擎[J].计算机工程,2007,33(17):228-230. 被引量:2
  • 2王让定,袁旭海,徐霁.一种新颖的混合语音检索算法[J].计算机应用研究,2008,25(5):1349-1351. 被引量:1
  • 3陈立伟,宋宪晨,章东升,杨洪利.一种基于优化小波神经网络的语音识别[J].应用科技,2008,35(2):17-20. 被引量:3
  • 4郑铁然,韩纪庆.汉语语音检索中基于音节的索引方法研究[C]//第八届全国人机语音通讯学术会议论文集.北京,中国,2005:419-424.
  • 5MANGU L, BRILL E, STOLCKE A. Finding consensus in speech recognition: word error minimization and other applications of confusion networks [ J ]. Computer Speech and Language, 2000, 14(4): 373-400.
  • 6XUE Jian, ZHAO Yunxin. Improved confusion network algorithm and shortest path search from word Lattice [ C ]// Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Philadelphia, USA, 2005: 853 -856.
  • 7ZHANG Pengyuan, SHAO Jian, ZHAO Qingwei, et al. Keyword spotting based on syllable confusion network [ C ]//The Third International Conference on Natural Computing. Haikou, China, 2007: 656-659.
  • 8YONG S, EVERMANN G, GALES M. The HTK book (for HTK 3.3 ) [ EB/OL ]. [ 2009-11-25]. Http: //htk. eng. cam. ac. uk.
  • 9GOODMAN J T. A bit of progress in language modeling[ J]. Computer Speech and Language, 2001, 15(4) : 403434.

二级参考文献22

  • 1Patel N,Sethi I.Audio Characterization for Video Indexing[C]// Proceedings of the SPIE on Storage and Retrieval for Still Image and Video Databases.1996.
  • 2Foote J.An Overview of Audio Information Retrieval[J].Multimedia Systems,1999,7(1).
  • 3Christian S,Emmanuel F.Soundspotter--A Prototype System for Content-based Audio Retrieval[C]//Proc.of Digital Audio Effects.2002.
  • 4Baeza-Yates R,Ribeiro-Neto B.Modern Information Retrieval[M].[S.l.]:Addison Wesley,1999.
  • 5Willie W,Paul L,Philip K,et al.Sphinx-4:A Flexible Open Source Framework for Speech Recognition[Z].(2006-05).http:// cmusphinx.sourceforge.net/sphinx4/doc/Sphinx4Whitepaper.pdf.
  • 6Erik H,Otis G.Lucene in Action[M].[S.l.]:Manning Publishing,2004.
  • 7SHI Y, EBERHART R C. A modified swarm optimizer [ C ]//IEEE International Conference on Evolutionary Computation. Anchorage, USA, 1998.
  • 8CLERK M, KENNEDY J. The particle swarm-explosion, stability, and convergence in a multidimensional complex space [ J ]. IEEE Transaction on Evolutionary Computer, 2002, 6( 1 ) : 58-73.
  • 9EBERHART R C, SHI Y. Particle swarm optimization: developments, applications and resources [ C ]//Proc Congress on Evolutionary Computation 2001, Piscataway, USA IEEE press, 2001.
  • 10KENNEDY J, EBERHART R C. Particle swarm optimization [ C ]//IEEE International Conference on Neural Networks. [ s. 1 ] , 1995.

共引文献3

同被引文献20

引证文献2

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部