期刊文献+

面向海量数据的语音敏感信息检测系统 被引量:2

Massive Data-Based Sensitive Information Detection System
在线阅读 下载PDF
导出
摘要 为了解决海量数据条件下敏感语音信息检测的问题,设计并实现了一个面向内容的语音检索系统。该系统采用先进的语音识别、语音搜索技术,可以快速、准确地在海量数据中搜索需要的敏感词汇。系统可以作为海量音频条件下信息监测、控制的重要辅助手段。实验证明,系统具有灵活的配制能力与较高的搜索性能。 A content based information retrieval system is developed to detect sensitive information from massive data.The system is designed on the basis of advanced speech technologies,such as speech recognition and voice retrieval.It could find out sensitive words from a large amount of audios fast and precisely.The system could be considered as an important auxiliary means of information monitoring and control for large amount of voice data.Results have shown the system is flexible to configure and has high searching performance.
出处 《信息工程大学学报》 2010年第5期544-548,共5页 Journal of Information Engineering University
关键词 语音搜索 语音识别 关键词检测 spoken information retrieval speech recognition spoken term detection
  • 相关文献

参考文献12

  • 1Rose RC,Paul DB.A hidden Markov model based keyword recognition system[C] //Proc.ICASSP 90.1990:129-132.
  • 2Wilcox LD,Bush MA.Training and search algorithms for an interactive word spotting system[C] //Proc.ICASSP 92.1992,2:97-100.
  • 3Jeanrenaud P,Ng K,Siu M,et al.Gish H Phonetic-based word spotter:Various configurations and application to event spotting[C] //Eurospeech 93.1993:2145-2148.
  • 4Fiscus J G,Ajot J,Garofolo JS,et al.Results of the 2006 Spoken Term Detection Evaluation[C] //2007 SIGIR Workshop on Searching Spontaneous Conversational Speech.2007:45-51.
  • 5Huang X D.Acero A,Hon H W.Spoken Language Processing:A Guide to Theory,Algorithm,and System Development[M].Prentice Hall,2001.
  • 6Young S J,Evermann G,Gales M,et al.The HTK Book (for HTK Version 3.4)[M].University of Cambridge,2006.
  • 7Mangu L,Brill E,Stolcke A.Finding Consensus Among Words:Lattice-Based Word Error Minimization[C] //Eurospeech'99.1999:495-498.
  • 8孟莎,刘加.汉语语音检索的集外词问题与两阶段检索方法[J].中文信息学报,2009,23(6):91-97. 被引量:8
  • 9Wang Dong.Out-of-Vocabulary Spoken Term Detection[D].PhD Thesis,University of Edinburgh,2010.
  • 10Mamou J,Ramabhadran B,Siohan O.Vocabulary independent spoken term detection[C] //Proc.of ACM SIGIR.2007:615-622.

二级参考文献16

  • 1孟莎,余鹏,Frank Seide,刘加.基于后验概率词格的汉语自然对话语音索引[J].清华大学学报(自然科学版),2008,48(S1):673-677. 被引量:2
  • 2周梁,高鹏,丁鹏,徐波.语音识别准确率与检索性能的关联性研究[J].中文信息学报,2006,20(3):99-104. 被引量:2
  • 3M. Saraclar and R. Sproat. Lattice-based Search for Spoken Utterance[C]//Proceeding of Human Language Technology Conference. Boston, 2004: 129-136.
  • 4C. Chelba and A. Acero. Position specific posterior lattices for indexing speech [C]//Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics. Ann Arbor, 2005: 443-450.
  • 5F. Seide, P. Yu and Y. Shi. Towards Spoken-Document Retrieval for the Enterprise: Approximate Word- Lattice Indexing with Text Indexers [C]//Proceeding of IEEE Workshop on Automatic Speech Recognition and Understanding. Kyoto, 2007: 629-634.
  • 6B. Logan, P. Moreno, J. M. Van Tong et al. An Experimental Study of an Audio Indexing System for the Web [C]//Proceeding of Sixth International Conference on Spoken Language Processing. Beijing, 2000: 676-679.
  • 7K. Ng. Subword-Based Approaches for Spoken Document Retrieval [D]. Ph. D. thesis, Massachusetts In- stitute of Technology, 2000.
  • 8P. Yu and F. Seide. A Hybrid Word/Phoneme-based Approach for Improved Vocabulary-independent Search in Spontaneous Speech [C]//Proceeding of Sixth International Conference on Spoken Language Processing, Korean, 2004: 293-296.
  • 9J. Shao, P Yu, Q. Zhao, Y. Yan. F. Seide. Towards Vocabulary-Independent Speech Indexing for Large-Scale Repositories [C]//Proceeding of Inter- speech. Brisbane, 2008:2150-2153.
  • 10H. M. Wang, H. Meng, P. Schone, B. Chen, W. K. Lo. Multi-Scale Audio Indexing for Translingual Spoken Document Retrieval [C]//Proceedings of IEEE Interna- tional Conference on Acoustics, Speech and Signal Processing. Salt Lake City, 2001: 605-608.

共引文献7

同被引文献8

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部