基于加权PCA的声音指纹降维技术被引量：5

Dimensionality reduction in audio fingerprint based on weighted PCA

下载PDF

导出

摘要声音指纹技术现在已经广泛的应用到了歌曲搜索、乐曲识别、声音修复等各个领域,但其关键技术———音频降维技术仍存在分类效果不好、可靠性不高等问题。针对音频数据高维化存在较大随意性,提出了基于模式识别的音频数据高维化的最优方法。并在此基础上,提出了采用加权PCA方法作为声音指纹的降维技术,不仅分类效果大为明显,且由于方法还保持了线性方法的简单性,保证了大批量处理数据成为可能。 Audio fingerprint technology has been widely used in the music searching, melody identification, and sound restoration. However, dimensionality reduction, the key to audio fingerprint technology, still cannot achieve satisfactory classification and reliability. Firstly, this paper introduced an optimal audio-dimensionality-segment method based on pattern recognition. Secondly, weighted PCA（Principal Component Analysis） was suggested as the kernel dimensionality reduction technology in audio fingerprint processing. This method not only enhances the classification of music data, but also keeps the merits of linear dimensionality reduction, simplicity and fast computation, which makes the heavy-data-precessing become feasible.

作者胡永刚吴翊卜江

机构地区国防科学技术大学数学与系统科学系

出处《计算机应用》 CSCD 北大核心 2006年第9期2250-2254,共5页 journal of Computer Applications

关键词加权主成分分析声音指纹线性降维 weighted PCA（Principal Component Analysis） audio fingerprint linear dimensionality reduction

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献12

1ZHANG T, KUO CCJ. Heirarehical classiciation of audio data for archiving and retrieving[A]. Proc. IEEE Int. Conf. Acoustics,Speech, and Signal Processing[ C], 1999,6:3001 -3004.
2LU L, JIANG H, ZHANG H. A robust audio classification and segmentation method[ R]. Microsoft Research, Redmond, WA, Tech.Rep., 2001.
3Burges CJC, Platt JC, Jana S. Distortion discriminant analysis for audio fingerprinting[ A]. IEEE Trans. Speech and Audio Processing[ C], 2003, 11:165 - 174.
4FOOTE J. Content-based retrieval of music and audio[ A]. Proc.SPIE[ C], 1997. 138 - 147.
5SEO JS, JIN M, LEE S, et al.Audio fingerprinting based on nor-realized spectral subband centroids[ EB/OL]. http://mmp.kaist.ac. kr/- sunillee/papers/conf-ICASSP-2005_aufing.pdf, 2005.
6DIAMANTARAS K , KUNG S . Principal Component Neural Networks[ M]. New York: Wiley, 1996.
7BURGES C, PLATY J, JANA S. Extracting noise robust features from audio data[A]. Proc. Int. Conf. Acoustics, Speech, Signal Processing[ C], 2002. 1021 - 1024.
8谭璐．高维数据集的结构[D]．长沙：国防科技大学，2005．
9谭璐,易东云,冯国柱,吴翊.局部不变投影[J].自然科学进展,2004,14(3):282-287. 被引量：10
10ROWELS ST, SAUL LK. Nonlinear dimensionality reduction by locally, linear embedding[ J]. Science, 2000, 290 ( 12): 2323 -2326.

二级参考文献13

1[1]Donoho D L. High-dimensional data analysis: The curses and blessings of dimensionality. Am Math Soc Conf, Los Angels, 2000http: //www-stat. stanford. edu/～ donoho/Lectures/AMS2000/Curses. pdf
2[2]Enrique F, et al. Lose less coding through the concentration of measure phenomenon. AMS Subject Classification, May 2002http: //www. math. gatech. edu/～ houdre/research/papers/lossless. pdf
3[3]K' egl Bal'azs. Intrinsic dimension estimation using packing numbers. Neural Information Processing Systems, December 2002http: //www. cse. msu. edu/～ lawhiu/manifold
4[4]Belman, et al. Adaptive Control Processes: A Guided Tour.Princeton: Princeton University Press, 2000
5[5]Kevin B, et al. When is' nearest neighbor' meaningful? In: 7th International Conference on Database Theory (ICDT-1999),Jerusalem, Israel, 1999. 217http: //citeseer. nj. nec. com/beyer99when. html
6[6]Chen Tsuhan, et al. Principle component analysis and its variants for biometrics. In: IEEE 2002 International Conference on Image Processing, 2002http: //amp. ece. cmu. edu/Publication/Wende/01037959. pdf
7[7]Griffiths T L, et al. A multidimensional scaling approach to mental multiplication. Memory & Cognition, 2002, 30(1): 97
8[8]Balakrishnama S, et al. Linear discriminant analysis - a brief tutorial. Institute for Signal and Information Processing, March 1998http: //www. isip. msstate. edu/publications/reports/isip- internal/1998/linear- discrim- analysis/Ida- theory- vi. 1. pdf
9[9]Belkin M, et al. Laplacian eigenmaps and spectral techniques for embedding and clustering. Advances in Neural Information Processlng Systems 15, Vancouver, British Columbia, Canada, 2001http: //citeseer. nj. nec. com/belkin021aplacian. html
10[10]Rowels S T, et al. Nonlinear dimensionality reduction by locally linear embedding. Science, 2000, 290:2323

共引文献10

1谭璐,易东云,吴翊,袁伟.基于非线性降维的图像识别[J].计算机工程,2005,31(13):54-55. 被引量：4
2胡永刚,吴翊,王洪志,卜江.高维数据降维的DCT变换[J].计算机工程与应用,2006,42(32):21-23. 被引量：9
3黄启宏,刘钊.流形学习中非线性维数约简方法概述[J].计算机应用研究,2007,24(11):19-25. 被引量：24
4欧海英,张为华,赵经成,付战平.非线性主轴降维映射法在固体火箭发动机设计优化中的应用[J].推进技术,2007,28(4):346-351. 被引量：2
5涂腾涛,顾嗣扬.基于非线性降维的人脸识别新算法[J].计算机应用,2008,28(8):2030-2032.
6高小方.流形学习方法中的若干问题分析[J].计算机科学,2009,36(4):25-28. 被引量：15
7张付志,张启凤.一种改进的基于流形对齐的协同过滤算法[J].模式识别与人工智能,2009,22(4):614-618.
8张付志,张启凤.融合多系统用户信息的协同过滤算法[J].计算机工程,2009,35(21):258-260. 被引量：2
9黄玮.高光谱遥感分类与信息提取综述[J].数字技术与应用,2010,28(5):134-136. 被引量：8
10李伟生,张勤.基于局部线性嵌入和Haar小波的人脸识别方法[J].计算机工程与应用,2011,47(4):181-184. 被引量：9

同被引文献43

1谭璐,易东云,吴翊,袁伟.基于非线性降维的图像识别[J].计算机工程,2005,31(13):54-55. 被引量：4
2孟爱国,章登勇,陈志坚,李峰.基于小波包变换和支持向量机的虹膜识别方法[J].计算机工程与设计,2006,27(10):1769-1771. 被引量：2
3钟家强,王润生.基于独立成分分析的多时相遥感图像变化检测[J].电子与信息学报,2006,28(6):994-998. 被引量：30
4John Daugman,How iris recognition works[J].IEEE Transactions on Circuits and Systems for Video Technology,2004,14:21-30.
5Daugman J.The importance of being random:Statistical principles of iris recognition[J].Pattem Recognition,2003,36(2):279- 291.
6MaLi,Tan Tieniu,Wang Yunhong,et al.Efficient iris recognition by characterizing key local variation[J].IEEE Transaction on Image Processing,2004,13:739-750.
7Wildes P.Iris recognition emerging biometric technology[J].Proceeding of IEEE, 1997(5): 1347-1363.
8Mikhail Belkin,Partha Niyogi.Laplacian eigenmaps for dimensionality reduction and data representation[J].Neural Computation,2003,15(6): 1373-1396.
9Mikhail Belkin,Partha Niyogi.Laplacian eigenmaps and spectral techniques for embedding and clustering[C].Vancouver, Bdtish Columbia, Canada:Advances in Neural Information Processing Systems,2001.
10中科院虹膜图像库CASIA iris image database(version2.O)[EB/OL].http://www.sinobiometrics.com.

引证文献5

1刘爱林,张天桥.基于非线性降维的虹膜识别方法[J].计算机工程与设计,2009,30(10):2442-2443. 被引量：2
2关耀铧,申凌,吴云,赵勇.音频指纹搜索中数据预处理的改进算法[J].计算机工程与应用,2010,46(21):145-147. 被引量：1
3高欣.基于投影和主成分分析的偏心检测[J].电子测试,2010,21(10):19-22. 被引量：2
4张兴忠,王运生,曾智,牛保宁.一种高效过滤提纯音频大数据检索方法[J].计算机研究与发展,2015,52(9):2025-2032. 被引量：8
5赵桂儒.较大规模数据应用PCA降维的一种方法[J].电脑知识与技术（过刊）,2014,20(3X):1835-1837. 被引量：11

二级引证文献24

1张伟平,马宏伟,李培华.改进的互相关虹膜匹配算法[J].计算机工程与设计,2010,31(15):3476-3479.
2蔚慧甜,薛迎,杨海,赵榉云,周学威.三维CT变螺距螺旋投影数据模拟方法[J].电子测试,2011,22(6):19-21.
3魏选平,安石,孟庆勋,王晓林.锁相环工作原理及仿真分析[J].电子测试,2011,22(6):50-53. 被引量：4
4贾振华,庄连英.基于切空间判别的稀疏数据降维方法[J].计算机工程与设计,2012,33(11):4268-4271.
5沈崇德,童思木.医院智能语音客户服务系统的创新研究与应用示范[J].中国医学装备,2013,10(1):71-73. 被引量：7
6宋卫华,张青.PCA算法在图像特征降维中的应用研究[J].黄山学院学报,2014,16(5):20-22. 被引量：4
7王晓英.海量冗余数据干扰下数据库中数据优化检索方法[J].华侨大学学报（自然科学版）,2016,37(6):758-761. 被引量：7
8万晓桐.出版行业的数据信息资源优化管理方法研究[J].计算机仿真,2017,34(4):323-326. 被引量：3
9张虹.数据库中工业产品资源信息准确定位仿真[J].计算机仿真,2017,34(10):406-409. 被引量：1
10赵欢,刘旭红,李宁.基于SVM的汉字字体识别研究[J].北京信息科技大学学报（自然科学版）,2017,32(5):62-66. 被引量：3

1王德芬,高建强,李莉.基于中值PCA和加权PCA数据分类的研究[J].信息技术,2014,38(2):14-18. 被引量：3
2王进军,王汇源.基于WPCA和修正的最大间距准则的人脸识别[J].计算机工程与应用,2010,46(5):151-153. 被引量：2
3高晓红.巧借工具,找出另类重复文件[J].电脑知识与技术（经验技巧）,2012(5):31-33.
4李嘉頔,陈振学,刘成云.分块CS-LBP和加权PCA的低分辨率人脸识别[J].光电子．激光,2016,27(2):210-216. 被引量：12
5方陵生.“声音指纹”识别技术[J].世界科学,2015,0(3):40-40. 被引量：1
6乔蕊,李靖.利用FW-PCA检测遮挡区域的人脸识别[J].量子电子学报,2015,32(3):270-277. 被引量：1
7曹林,杜康宁.基于加权PCA的人眼定位算法[J].北京信息科技大学学报（自然科学版）,2010,25(3):52-55. 被引量：1
8杨开睿,孟凡荣,梁志贞.一种自适应权值的PCA算法[J].计算机工程与应用,2012,48(3):189-191. 被引量：14
9康珮珮,于凤芹,陈莹.车辆检测中可变形部件模型的改进与应用[J].计算机工程与应用,2016,52(20):209-213. 被引量：4
10蔡平胜,闫乐林.主成分分析法在掌纹图像识别中的应用[J].计算机系统应用,2010,19(9):187-190. 被引量：5

计算机应用

2006年第9期

浏览历史

内容加载中请稍等...

基于加权PCA的声音指纹降维技术被引量：5

参考文献12

二级参考文献13

共引文献10

同被引文献43

引证文献5

二级引证文献24

相关作者

相关机构

相关主题

浏览历史

基于加权PCA的声音指纹降维技术 被引量：5

参考文献12

二级参考文献13

共引文献10

同被引文献43

引证文献5

二级引证文献24

相关作者

相关机构

相关主题

浏览历史

基于加权PCA的声音指纹降维技术被引量：5