期刊文献+

基于加权PCA的声音指纹降维技术 被引量:5

Dimensionality reduction in audio fingerprint based on weighted PCA
在线阅读 下载PDF
导出
摘要 声音指纹技术现在已经广泛的应用到了歌曲搜索、乐曲识别、声音修复等各个领域,但其关键技术———音频降维技术仍存在分类效果不好、可靠性不高等问题。针对音频数据高维化存在较大随意性,提出了基于模式识别的音频数据高维化的最优方法。并在此基础上,提出了采用加权PCA方法作为声音指纹的降维技术,不仅分类效果大为明显,且由于方法还保持了线性方法的简单性,保证了大批量处理数据成为可能。 Audio fingerprint technology has been widely used in the music searching, melody identification, and sound restoration. However, dimensionality reduction, the key to audio fingerprint technology, still cannot achieve satisfactory classification and reliability. Firstly, this paper introduced an optimal audio-dimensionality-segment method based on pattern recognition. Secondly, weighted PCA(Principal Component Analysis) was suggested as the kernel dimensionality reduction technology in audio fingerprint processing. This method not only enhances the classification of music data, but also keeps the merits of linear dimensionality reduction, simplicity and fast computation, which makes the heavy-data-precessing become feasible.
出处 《计算机应用》 CSCD 北大核心 2006年第9期2250-2254,共5页 journal of Computer Applications
关键词 加权主成分分析 声音指纹 线性降维 weighted PCA(Principal Component Analysis) audio fingerprint linear dimensionality reduction
  • 相关文献

参考文献12

  • 1ZHANG T, KUO CCJ. Heirarehical classiciation of audio data for archiving and retrieving[A]. Proc. IEEE Int. Conf. Acoustics,Speech, and Signal Processing[ C], 1999,6:3001 -3004.
  • 2LU L, JIANG H, ZHANG H. A robust audio classification and segmentation method[ R]. Microsoft Research, Redmond, WA, Tech.Rep., 2001.
  • 3Burges CJC, Platt JC, Jana S. Distortion discriminant analysis for audio fingerprinting[ A]. IEEE Trans. Speech and Audio Processing[ C], 2003, 11:165 - 174.
  • 4FOOTE J. Content-based retrieval of music and audio[ A]. Proc.SPIE[ C], 1997. 138 - 147.
  • 5SEO JS, JIN M, LEE S, et al.Audio fingerprinting based on nor-realized spectral subband centroids[ EB/OL]. http://mmp.kaist.ac. kr/- sunillee/papers/conf-ICASSP-2005_aufing.pdf, 2005.
  • 6DIAMANTARAS K , KUNG S . Principal Component Neural Networks[ M]. New York: Wiley, 1996.
  • 7BURGES C, PLATY J, JANA S. Extracting noise robust features from audio data[A]. Proc. Int. Conf. Acoustics, Speech, Signal Processing[ C], 2002. 1021 - 1024.
  • 8谭璐.高维数据集的结构[D].长沙:国防科技大学,2005.
  • 9谭璐,易东云,冯国柱,吴翊.局部不变投影[J].自然科学进展,2004,14(3):282-287. 被引量:10
  • 10ROWELS ST, SAUL LK. Nonlinear dimensionality reduction by locally, linear embedding[ J]. Science, 2000, 290 ( 12): 2323 -2326.

二级参考文献13

  • 1[1]Donoho D L. High-dimensional data analysis: The curses and blessings of dimensionality. Am Math Soc Conf, Los Angels, 2000http: //www-stat. stanford. edu/~ donoho/Lectures/AMS2000/Curses. pdf
  • 2[2]Enrique F, et al. Lose less coding through the concentration of measure phenomenon. AMS Subject Classification, May 2002http: //www. math. gatech. edu/~ houdre/research/papers/lossless. pdf
  • 3[3]K' egl Bal'azs. Intrinsic dimension estimation using packing numbers. Neural Information Processing Systems, December 2002http: //www. cse. msu. edu/~ lawhiu/manifold
  • 4[4]Belman, et al. Adaptive Control Processes: A Guided Tour.Princeton: Princeton University Press, 2000
  • 5[5]Kevin B, et al. When is' nearest neighbor' meaningful? In: 7th International Conference on Database Theory (ICDT-1999),Jerusalem, Israel, 1999. 217http: //citeseer. nj. nec. com/beyer99when. html
  • 6[6]Chen Tsuhan, et al. Principle component analysis and its variants for biometrics. In: IEEE 2002 International Conference on Image Processing, 2002http: //amp. ece. cmu. edu/Publication/Wende/01037959. pdf
  • 7[7]Griffiths T L, et al. A multidimensional scaling approach to mental multiplication. Memory & Cognition, 2002, 30(1): 97
  • 8[8]Balakrishnama S, et al. Linear discriminant analysis - a brief tutorial. Institute for Signal and Information Processing, March 1998http: //www. isip. msstate. edu/publications/reports/isip- internal/1998/linear- discrim- analysis/Ida- theory- vi. 1. pdf
  • 9[9]Belkin M, et al. Laplacian eigenmaps and spectral techniques for embedding and clustering. Advances in Neural Information Processlng Systems 15, Vancouver, British Columbia, Canada, 2001http: //citeseer. nj. nec. com/belkin021aplacian. html
  • 10[10]Rowels S T, et al. Nonlinear dimensionality reduction by locally linear embedding. Science, 2000, 290:2323

共引文献10

同被引文献43

  • 1谭璐,易东云,吴翊,袁伟.基于非线性降维的图像识别[J].计算机工程,2005,31(13):54-55. 被引量:4
  • 2孟爱国,章登勇,陈志坚,李峰.基于小波包变换和支持向量机的虹膜识别方法[J].计算机工程与设计,2006,27(10):1769-1771. 被引量:2
  • 3钟家强,王润生.基于独立成分分析的多时相遥感图像变化检测[J].电子与信息学报,2006,28(6):994-998. 被引量:30
  • 4John Daugman,How iris recognition works[J].IEEE Transactions on Circuits and Systems for Video Technology,2004,14:21-30.
  • 5Daugman J.The importance of being random:Statistical principles of iris recognition[J].Pattem Recognition,2003,36(2):279- 291.
  • 6MaLi,Tan Tieniu,Wang Yunhong,et al.Efficient iris recognition by characterizing key local variation[J].IEEE Transaction on Image Processing,2004,13:739-750.
  • 7Wildes P.Iris recognition emerging biometric technology[J].Proceeding of IEEE, 1997(5): 1347-1363.
  • 8Mikhail Belkin,Partha Niyogi.Laplacian eigenmaps for dimensionality reduction and data representation[J].Neural Computation,2003,15(6): 1373-1396.
  • 9Mikhail Belkin,Partha Niyogi.Laplacian eigenmaps and spectral techniques for embedding and clustering[C].Vancouver, Bdtish Columbia, Canada:Advances in Neural Information Processing Systems,2001.
  • 10中科院虹膜图像库CASIA iris image database(version2.O)[EB/OL].http://www.sinobiometrics.com.

引证文献5

二级引证文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部