期刊文献+

基于K-L距离的两步固定音频检索方法 被引量:8

Two-stage Specific Audio Retrieval Method Based on K-L Distance
在线阅读 下载PDF
导出
摘要 根据音频文件数据量大、数据间存在一定相关性的特点,提出一种基于K-L距离的两步固定音频检索方法。该方法采用基于可变门限的直方图检索方法快速筛选出相似度较高的语音文件,利用特征矩阵的K-L距离对剩余语音进行精确比较,取得较好的效果。实验结果证明,该方法能使检索准确率达到90%左右。 Due to the huge amount of audio data,and some relation among them,this paper proposes a two-stage specific audio retrieval method based on K-L Distance.The method uses histogram retrieval method based on variable threshold to choose audio file of high similarity,compares precisely with residual audio using K-L distance of feature matrix,and obtains good effect.Experimental results show that the retrieval accuracy is over 90%.
出处 《计算机工程》 CAS CSCD 北大核心 2011年第19期160-162,共3页 Computer Engineering
基金 国家"863"计划基金资助项目(2008AA011002)
关键词 固定音频检索 过零率 直方图 美尔频率倒谱系数 K-L距离 specific audio retrieval Zero Crossing Rate(ZCR) histogram Mel Frequency Cepstral Coefficient(MFCC) K-L distance
  • 相关文献

参考文献10

  • 1Hanesn J H L, Huang Rongqing. Speech Find: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word[J]. IEEE Transactions on Speech and Audio Processing, 2005, 13(5): 712-730.
  • 2Chechil G, Le E, Rehn M, et al. Large Scale Content Based Audio Retrieval from Text Queries[C]//Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval. New York, USA: ACM Press, 2008: 105-112.
  • 3张卫强,刘加.网络音频数据检索技术[J].通信学报,2007,28(12):152-155. 被引量:10
  • 4张卫强,刘加,陈恩庆.一种基于仿生模式识别思想的固定音频检索方法[J].自然科学进展,2008,18(7):808-813. 被引量:7
  • 5Smith G, Murase H, Kashino K. Quick Audio Retrieval Using Active Search[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. New York, USA: IEEE Press, 1998: 3777-3780.
  • 6Kashino K, Kurozumi T, Murase H. A Quick Search Method for Audio and Video Signals Based on Histogram Pruning[J]. IEEE Transactions on Multimedia, 2003, 5(3): 384-357.
  • 7Kedem B. Spectral Analysis and Discrimination by Zero- crossings[J]. Proceedings of the IEEE, 1986, 74(11): 1477-1493.
  • 8Saunders J. Real-time Discrimination of Broadcast Speech Music[C]//Proceedings of IEEE ICASSP’96. [S. 1.]: IEEE Press, 1996: 993-996.
  • 9Li S Z. Content-based Classification and Retrieval of Audio Using the Nearest Feature Line Method[J]. IEEE Trans. on Speech Audio Processing, 2000, 8(5): 619-625.
  • 10江星华,李应.基于LPCMCC的音频数据检索方法[J].计算机工程,2009,35(11):246-247. 被引量:5

二级参考文献26

  • 1张成,蒋皓石,林嘉宇.基于16位单片机的语音电子门锁系统[J].电子技术应用,2005,31(7):18-21. 被引量:9
  • 2王守觉,潘晓霞,徐春燕,陈旭,安冬,曹文明.一种基于高维空间覆盖动态搜索方法的非特定人连续数字语音识别的研究[J].电子学报,2005,33(10):1790-1793. 被引量:7
  • 3Wang Y, Liu Z, Huang JC. Multimedia content analysis-using both audio and visual clues. IEEE Signal Processing Magazine, 2000, 17(6): 12-36
  • 4Foote J. An overview of audio information retrieval. Multimedia Systems, 1999, 7(1):2-10
  • 5Hansen JHL, Huang R, Zhou B, et al. Speechfind.. Advances in spoken document retrieval for a national gallery of the spoken word. IEEE Transactions on Speech and Audio Processing, 2005, 13(5): 712-730
  • 6Kashino K, KurozumiT, Murase H. A quick search method for audio and video signals based on histogram pruning. IEEE Transactions on Multimedia, 2003, 5(3) : 348-357
  • 7Kim KM, Kim SY, Jeon JK, et al. Quick audio retrieval using multiple feature vectors. IEEE Transactions on Consumer Electronics, 2006, 52(1): 200-205
  • 8Zhang WQ, Liu J. Two-stage method for specific audio retrieval. IEEE International Conference on Acoustics, Speech, and Signa Processing(ICASSP), Hawaii, 2007. New Jersey: IEEE Press 2007, Ⅳ 85-88
  • 9Wang SJ, Liu YY. An algorithm for removing facial makeup disturbances based on high dimensional imaginal geometry. Chinese Journal of Electronics, 2006, 15(4A): 789-792
  • 10Haykin S著,宋铁成,等译.通信系统.北京:电子工业出版社,2003,56-58

共引文献15

同被引文献69

  • 1熊福生.对数伽玛与负对数伽玛分布的再生性[J].经济数学,2003,20(4):63-69. 被引量:10
  • 2郑贵滨,韩纪庆,李海峰,郑铁然.基于分段的实时声频检索方法[J].声学学报,2006,31(2):101-108. 被引量:5
  • 3李超,熊璋,朱成军.基于距离相关图的音频相似性度量方法[J].北京航空航天大学学报,2006,32(2):224-227. 被引量:7
  • 4蔡择林,李开灿.常见分布的最大Kullback-Leibler距离[J].武汉大学学报(理学版),2007,53(5):513-517. 被引量:12
  • 5周颀.基于音频匹配的广告智能建波系统[D].南京:南京理工大学,2013.
  • 6Pruzansky S.Pattern-matching procedure for automatictalker recognition[J].The Journal of the Acoustical Societyof America,1963,50:637-655.
  • 7Atal B S.Automatic speaker recognition based on pitchcontour[D].Brooklyn:Polytechnic Inst,1968.
  • 8Doddington G R.A new method of speaker verification[J].The Journal of the Acoustical Society of America,1971,139(A).
  • 9Itakura F.Line spectrum representation of linear predictivecoefficients[J].The Journal of the Acoustical Societyof Japan,1975,75(S).
  • 10Colombi J M,Ruck D W,Anderson T R,et al.Cohortselection and word grammar effects for speaker recognition[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing,1996:85-88.

引证文献8

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部