视觉特征空间中大规模聚类问题的一种鲁棒近似算法被引量：1

A robust approximate algorithm for large-scale clustering of visual features

下载PDF

导出

摘要视觉特征空间中的大规模聚类问题是图像识别和检索中亟待解决的问题.当前最好的算法是近似k-means算法,它是Lloyd算法的近似算法,只能依靠采用高准确率的近似搜索近似地保证聚类结果的性能.为此针对近似k-means算法提出改进的基本不增加时间、空间代价新算法,具有更好的算法收敛性和聚类性能.该算法利用了迭代求解过程中更多的信息,更有效地更新子类划分,使得聚类损失单调不增并且快速减小.理论证明,采用任意准确率的近似搜索,该算法都可以在有限轮迭代后收敛到Lloyd算法的收敛解.实验结果表明,分别采用最优参数产生同等性能结果时,所提出的算法比近似k-means算法快10倍.此外,通过比较全局特征聚类实验中的子类的图像,也直观地验证了其聚类效果. The large scale clustering problem of visual features is crucial for image recognition and retrieval. The state-of-the-art algorithm, the approximate k means, approximately guarantees the clustering performance by applying the high-precision approximate search. An improved algorithm was proposed, which requires no extra memory cost and nearly no extra time consumption. The robust approximate algorithm can better guarantee its convergence and clustering performance by utilizing more information in the iteration to update the partition, so that clustering loss is non-increasing and reduced rapidly. Theoretical proofs guarantee that the algorithm converges to the converged solution of the Lloyd algorithm, regard less of the precision values of approximate search. The experiment results show that the algorithm has about 10 times the speed of the approximate k-means algorithm. Besides, the clustering performance is also directly verified by comparing the images in the clustering results of global features.

作者李大瑞杨林军华先胜张宏江

机构地区中国科学技术大学教育部-微软联合实验室微软必应微软研究院金山软件

出处《中国科学技术大学学报》 CAS CSCD 北大核心 2014年第10期844-852,共9页 JUSTC

关键词大规模聚类 K-MEANS 近似算法 large scale elustering k-means approximate algorithm

分类号 TN911.73 [电子电信—通信与信息系统] TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献14

1Nister D, Stewenius H. Scalable recognition with a vocabulary tree [C]// 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognitior:New York, USA: IEEE Press, 2006, 2:2 161-2 168.
2Philbin J, Chum O, Isard M, et al. Object retrieval with large vocabularies and fast spatial matching[C]// IEEE Conference on Computer Vision and PatternRecognition. Minneapolis, USA= IEEE Press, 2007.. 1 8.
3Tuytelaars T, Lampert C 14, Blaschko M B, et al. Unsupervised object discovery: A comparison [J]. International Journal of Computer Vision, 2010, 88(2): 284- 302.
4Philbin J, Zisserman A. Obiect mining using a matching graph on very large image collections[C]// Proceedings of the Inidan Conference on Computer Vision, Graphics &Image Processing. Bhubaneswar, India: IEEE Press, 2008: 738-745.
5Li X W, Wu C C, Zach C, et al. Modeling and recognition of landmark image collections using iconic scene graphs [C]// Proceedings of 10th European Conference on Computer Vision. Marseilie, France: Springer, 2008: 427-440.
6Torralba A, Fergus R, Freeman W T. 80 million tiny images: A large data set for nonparametric object and scene recognition [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30 ( 11 ) .. 1 958-1 970.
7Lloyd S. Least squares quantization in PCM[J]. IEEE Transactions on Information Theory, 1982,28(2) : 129-136.
8Selim S Z, Ismail M A. K-means-type algorithms: A generalized convergence theorem and characterization of local optimality [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1984, 6(1) : 81-87.
9Jain A K. Data clustering: 50 years beyond Keans[J]. Pattern Recognition Letters, 2010,31(8) ..651- 666.
10Judd D, McKinley P K, Jain A K. Large>scale parallel data clustering[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998,20(8): 871-876.

同被引文献1

1陈舒,蒋志会,陆恒,缪天翔.路网环境中关于模糊组最近邻问题的研究[J].计算机应用研究,2016,33(2):343-346. 被引量：3

引证文献1

1邹蕾.一种近似的K最近邻图算法[J].江苏科技大学学报（自然科学版）,2017,31(4):513-518. 被引量：1

二级引证文献1

1陈功平,王红.基于邻近图拓扑构造算法的仿真设计[J].绥化学院学报,2020,40(2):158-160. 被引量：1

1荆晓远,金忠,杨静宇.基于子类划分的分类器设计方法[J].南京理工大学学报,1999,23(4):293-296.
2熊子源,徐振海,肖顺平.聚类子阵划分及子阵级单脉冲测角性能分析[J].系统工程与电子技术,2013,35(9):1867-1872. 被引量：7
3韩海.K进制遗传算法在聚类问题求解中的应用[J].无线互联科技,2016,13(17):135-136.
4毕文武,范良志,张楠.基于ANSYS的超薄扁平永磁直线电机的热设计研究[J].机电产品开发与创新,2012,25(1):52-54.
5俞国燕,王筱珍.改进遗传算法的应用研究[J].机械制造,2007,45(5):58-60. 被引量：3
6周进.Windows XP中增加时间服务器方法[J].软件,2002,23(5):41-41.
7萧萍,曹钧.构造基于RBAC模型的安全数据库[J].现代计算机,2009,15(1):146-148.
8ZHANG HuaZi,ZHANG ZhaoYang,YUEN Chau.Energy-efficient spectrum-aware clustering for cognitive radio sensor networks[J].Chinese Science Bulletin,2012,57(28):3731-3739. 被引量：2
9刘立民,靳晨霞,杨丽芸,李法朝.两阶段遗传算法的结构及性能分析[J].河北科技大学学报,2007,28(1):44-48. 被引量：2
10耿卫江.基于计算机的虚拟仪器技术的设计与应用[J].信息技术与信息化,2015(7):88-88. 被引量：1

中国科学技术大学学报

2014年第10期

浏览历史

内容加载中请稍等...

视觉特征空间中大规模聚类问题的一种鲁棒近似算法被引量：1

参考文献14

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

视觉特征空间中大规模聚类问题的一种鲁棒近似算法 被引量：1

参考文献14

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

视觉特征空间中大规模聚类问题的一种鲁棒近似算法被引量：1