期刊文献+

基于音频分析的视频场景检测 被引量:3

Inspection of Video Frequency Scene Based on Audio Frequency Analysis
在线阅读 下载PDF
导出
摘要 目前场景检测的研究,主要是基于图像和视频。但音频同样具有丰富的场景信息,基于音频分析的计算量是比较少的,对自动或者半自动的场景检测,基于音频分析的方法也是更为让用户接受的。可以把基于音频分析的方法作为视频场景检测的辅助手段,以获得更为准确的场景检测和分割。该文提出了一个基于内容的音频分析系统,对视频序列实现基于音频分析的场景检测和分割。该系统能有效的解决许多诸如图像变化了,而实际场景并未变化的情形。且本系统整体运算复杂度较基于视频/图像的场景检测与分割系统要低。 The present study on scene detection is mainly based on image and video sequence. Audio sequence, however, also has rich information. Therefore, a method based on audio analysis can be applied as an auxiliary, so as to get more accurate scene detection and split. In this paper, a content - based audio data analysis system is proposed, which accepts video sequence and implements scene detection and segmentation from audio part. The system can solve problems such as changing of image without changing of scene, and the complexity of operation in this system is much lower than that of the system based on image and video sequence analysis.
出处 《计算机仿真》 CSCD 2006年第8期184-187,195,共5页 Computer Simulation
关键词 音频分析 场景检测 音频特征 隐马尔可夫模型 Audio analysis Scene detection Audio feature Hidden markov model
  • 相关文献

参考文献8

  • 1J Saunders.Real-Time Discrimination of Broadcast Speech/Music[C].Proc.ICASSP'96,Atlanta,May,1996,II:993-996.
  • 2E Scheirer,M Slaney.Construction and Evaluation of a Robust Multifeature Speech/Music Discrimination[C].Proc.ICASSP'97.Munich,Germany,April,1997.
  • 3A Ghias.Query By Humming-Musical Information Retrieval in An Audio Database[C].Proc.ACM Multimedia Conference,Anaheim,CA,1995.231-235.
  • 4J Foote.Content-Based Retrieval of Music and Audio[A],Proc.SPIE'97[C].Dallas,1997.
  • 5Z Liu,et al.Audio Feature Extraction and Analysis for Scene Classification[C].Proc.Of IEEE 1st Multimedia Workshop,1997.
  • 6D Kimber,L Wilcox.Acoustic Segmentation for Audio Browsers[C].Proc.Interface Conference.Sydney,Australia,July,1996.
  • 7Tong Zhang,C C JayKuo.Content-based classification andretrieval of audio[C].SPIE's 43rd Annual Meeting-Conference on advanced Signal Processing Algorithms,Architectures,and Implementations[J].VII,SPIE San Diego,July 1998,3461:432-443.
  • 8L Rabinar,B Juang.Fundamentals of Speech Recognition[M].New Jersey:Prentice-Hall,Inc.,1993.

同被引文献108

引证文献3

二级引证文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部