摘要
为了解决海量数据条件下敏感语音信息检测的问题,设计并实现了一个面向内容的语音检索系统。该系统采用先进的语音识别、语音搜索技术,可以快速、准确地在海量数据中搜索需要的敏感词汇。系统可以作为海量音频条件下信息监测、控制的重要辅助手段。实验证明,系统具有灵活的配制能力与较高的搜索性能。
A content based information retrieval system is developed to detect sensitive information from massive data.The system is designed on the basis of advanced speech technologies,such as speech recognition and voice retrieval.It could find out sensitive words from a large amount of audios fast and precisely.The system could be considered as an important auxiliary means of information monitoring and control for large amount of voice data.Results have shown the system is flexible to configure and has high searching performance.
出处
《信息工程大学学报》
2010年第5期544-548,共5页
Journal of Information Engineering University
关键词
语音搜索
语音识别
关键词检测
spoken information retrieval
speech recognition
spoken term detection