期刊文献+

音频高层语义分析 被引量:4

Semantic-audio Content Analysis at High Level
在线阅读 下载PDF
导出
摘要 为跨越语义鸿沟,提出了一种提取音频中高层语义概念的方法。该方法先用隐马尔可夫模型(HMM)建立对应于分析窗口的低层语义概念,即基本声音语义事件(basic semantic-audio event,BE);然后以音框为单位将声音信号通过短时傅里叶变换及ICA处理来得到对应于HMM模型的可观察符号;接着用贝叶斯决策排除语义窗口对应声音段中的非预定义BE后,按贝叶斯公式所得最大后验概率为准则得到此语义窗口的一个基本声音语义事件组(group of BE,)GBE;最后采用高层语义逻辑定义来描述GBE与高层声音语义概念间的联系,结合由实例训练得到的高层语义逻辑定义最终得到相应语义窗口的高层语义声音概念(high level audio semantic concept,HC)。实验表明此方法能提取与人思维中相似的高层语义概念,在一定程度上可跨越语义鸿沟。 To bridge the semantic gap between audio feature and high-level semantic concept, an approach for semantic- audio content Analysis is presented in this paper. Hidden Markov model(HMM) is trained for modeling BE. In order to extract GsE corresponding to a semantic window, Bayesian decision theory is used to eliminate the analysis window not belonging to any predefined HMM. Then, each of the residual analysis windows within the semantic window is classified to BE class by criterion of maximum Bayesian posterior probability. Ignoring the order and repetition of BE, GSE is got. Logic definition of high level audio semantic concept is the connection of GSE and HC, through which HC can be extracted. The experimental results demonstrate that the proposal approach could extract HC like human thoughts, and could bridge the semantic gap to some degree.
出处 《中国图象图形学报》 CSCD 北大核心 2007年第1期141-147,共7页 Journal of Image and Graphics
基金 国家自然科学基金项目(60273035) 江苏省科技攻关项目(BE2003064)
关键词 声音语义内容分析 高层语义概念 语义视频分析 隐马尔可夫模型 semantic-audio content analysis, high level semantic-concept, semantic-video analysis, HMM
  • 相关文献

参考文献8

  • 1Chu W T,Cheng W H,Wu J L.Generative and discriminative modeling toward semantic context detection in audio tracks[A].In:Proceedings of the 1 1th International Multimedia Modelling Conference,2004.MMM 2005.[C],Melbourne,Australia,2005:38 - 45.
  • 2Umapathy K,Krishnan S,Jimaa S.Multigroup classification of audio signals using time-frequency parameters[J].IEEE Transactions on Multimedia,2005,7(2):308 -315.
  • 3Kim H-G,Moreau N,Sikora T.Audio classification based on MPEG-7 spectral basis representations[J].IEEE Transactions on Circuits and Systems for Video Technology,2004,14(5):716 - 725.
  • 4Cai Rui,Lu Lie,Zhang Hong-jiang,et al.Highlight sound effects detection in audio stream[A].In:2003 IEEE International Conference on Multimedia & Expo (ICME ' 03)[C],Baltimore,Maryland,USA,2003,Ⅲ:37-40.
  • 5ISO/IEC JTC 1/SC 29.Information technology multimedia content description interface-Part 4:Audio[S],15938-4,ISO,June,2001.
  • 6Panagiotakis C,Tziritas G.A speech/music discriminator based on RMS and zero-crossings[J].IEEE Transactions on Multimedia,2005,7(1):155 - 166.
  • 7Rabiner L R.A tutorial on hidden Markov models and selected applications in speech recognition[J].Proceedings of IEEE,1989,77(2):257 -286.
  • 8Richard O.D.,Peter E H,David G S 等著,李宏东等译.模式分类(第二版)[M].北京:机械工业出版社,2003.

同被引文献56

引证文献4

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部