局部Gist特征匹配核的场景分类被引量：25

Scene categorization of local Gist feature match kernel

导出

摘要针对场景分类任务中全局Gist特征粒度较为粗糙的问题,提出一种基于稠密网格的局部Gist特征描述,利用空间金字塔结构加入空间信息,通过引入RGB颜色空间加入颜色信息,并基于词汇包(BOW)模型设计一种高效匹配核来度量局部特征间的相似性,核化特征匹配过程,使用线性SVM完成场景分类。实验考察了不同尺度、方向、粒度和不同匹配核的局部Gist特征以及训练样本集的大小对分类结果的影响,并通过在OT场景图像集上与全局Gist特征和稠密SIFT特征的场景分类结果进行比较,充分说明了本文特征构造方法和分类模型的有效性。 Due to the coarse fineness of global Gist features in scene categorization tasks, we propose a local Gist feature description based on a dense grid. It uses a spatial pyramid structure to add distribution information and introduces the RGB color space to add color information. The feature matching process is kernelized by an efficient match kernel which mea- sures the similarity between local features based on the BOW model. The scene categorization task can be done with linear SVM. Experiment shows the influence to the classification accuracy with local Gist features which have different scale, orientation, fineness, match kernels and numbers of training samples. By using the classification result of the global Gist feature and dense SIFT features on the OT scene dataset, we demonstrate that the proposed feature construction method and classification model are efficient.

作者杨昭高隽谢昭吴克伟

机构地区合肥工业大学计算机与信息学院

出处《中国图象图形学报》 CSCD 北大核心 2013年第3期264-270,共7页 Journal of Image and Graphics

基金国家自然科学基金项目(60905005 6875012 61273237) 教育部博士点基金项目(20090111110015)

关键词局部Gist特征空间金字塔高效匹配核场景分类 local Gist feature spatial pyramid efficient match kernel scene classification

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献11

1Sivic J, Zisserman A. Video google: a text retrieval approach to object matching in videos [ C ] //Proceedings of International Conference on Computer Vision. Washington DC: [ s. n. ], 2003,1470-1477.
2Jurie F, Triggs B. Creating efficient codebooks for visual recogni- tion [ C ]//Proceedings of International Conference on Computer Vision. Beijing: [s. n. ], 2005: 604-610.
3Lazebnik S, Schmid C. Beyond bags of features : spatial pyramid matching for recognizing natural scene categories [ C ]//Procee- dings of IEEE Conference on Computer Vision and Pattern Recog- nition. New York: IEEE, 2006, 2:2169-2178.
4Oliva A, Torralba A. Modeling the shape of the scene a holistie representation of the spatial envelope [ J ]. International Journal in Computer Vision, 2001,42(3) : 145-175.
5Oliva A, Torralba A. Building the gist of a scene: the role of global image features in recognition [ J ]. Progress in Brain Research : Visual Perception, 2006, 155 : 23-36.
6Muller K R, Mika S, Ratsch G, et al. An introduction to kernel based learning algorithms [ J]. IEEE Transactions on Neural Net- works, 2001, 12(2) : 181-201.
7Hofman T, Sch~lkopf B. Kernel methods in machine learning [J]. The Annals of Statistics, 2008, 36(3) : 1171-1220.
8Vapnik V N. Statistical Learning Theory [ M ]. New York: Wiley, 1998.
9Scholkopf B, Smola A J. Learning with Kernels [ M ]. Massa- chusetts: The MIT Press, 2002.
10Daugman J. Uncertainty relation for resolution in space, spatial, frequency, and orientation optimized by two-dimensional visual cortical filters [ J]. Journal of the Optical Society of America, 1985, 2(7) : 1160-1169.

同被引文献192

1王改梅,刘瑞光,刘芳.基于小波包变换的纹理图像检索[J].计算机工程与应用,2004,40(18):44-46. 被引量：14
2王冰,赵志伟.基于内容的图像检索技术[J].信息技术与信息化,2005(5):81-82. 被引量：6
3李晓宇,张新峰,沈兰荪.支持向量机(SVM)的研究进展[J].测控技术,2006,25(5):7-12. 被引量：46
4郑芳炫,杨志强.以消失点为基础下从单张影像中估测深度[J].信息技术与应用,2006,1(3):229-235.
5KonradJ. Brown G. Wang M. et al . Automatic 2D-to-3D image conversion using 3D examples[rom the internet[CJ //Proceedings of SPIE. Bellingham: Society of Photo?Optical Instrumentation Engineers Press. 2012. 8288: 82880F.
6KonradJ, Wang M, Ishwar P. 2D-to-3D image conversion by learning depth from examples[CJ / /Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. Los Alamitos: IEEE Computer Society Press, 2012: 16-22.
7Barnes C. Goldman D B, Shechtman E, et al. The PatchMatch randomized matching algorithm for image manipulation[J]. Communications of the ACM, 2011. 54 (1): 103-110.
8Lai K, Bo L F, Ren X F, et al. A large-scale hierarchical multi-view RGB-D object dataset[CJ //Proceedings of IEEE International Conference on Robotics and Automation. Los Alamitos: IEEE Computer Society Press, 2011: 1817-1824.
9Janoch A, Karayev S,Jia Y Q, et al. A category-level 3-D object dataset: putting the Kinect to work[CJ //Proceedings of the 13th International Conference on Computer Vision. Los Alamitos: IEEE Computer Society Press, 2011: 1168-1174.
10Silberman N, Hoiem D, Kohli P, et al. Indoor segmentation and support inference from RGBD images[CJ //Proceedings of the 12th European Conference on Computer Vision. Berlin: Springer, 2012: 746-760.

引证文献25

1袁红星,吴少群,朱仁祥,胡劲松,安鹏.利用深度传感器大数据的单目图像深度估计[J].计算机辅助设计与图形学学报,2013,25(12):1786-1792. 被引量：2
2龚成清.改进的BEMD在纹理图像检索中的应用[J].现代计算机,2013,19(22):28-32. 被引量：2
3季海峰,高隽,郑鹏,王婧.多尺度空间判别性概率潜在语义分析的场景分类[J].中国图象图形学报,2014,19(1):109-118. 被引量：2
4申晓霞,张桦,高赞,徐光平,薛彦兵.基于Kinect和金字塔特征的行为识别算法[J].光电子．激光,2014,25(2):357-363. 被引量：13
5蔡丽娟.引入图片相似性的PatchMatchGraph方法[J].闽南师范大学学报（自然科学版）,2014,27(3):53-58.
6肖保良.基于Gist特征与PHOG特征融合的多类场景分类[J].中北大学学报（自然科学版）,2014,35(6):690-694. 被引量：6
7袁红星,吴少群,朱仁祥,安鹏.加权SIFT流深度迁移的单幅图像2D转3D[J].电子学报,2015,43(2):242-247.
8刘静,郭建,贺遵亮.基于Gist和PHOG特征的场景分类[J].计算机工程,2015,41(4):232-235. 被引量：5
9胡正平,陈俊岭,王蒙,赵淑欢.卷积神经网络分类模型在模式识别中的新进展[J].燕山大学学报,2015,39(4):283-291. 被引量：31
10魏凤梅.基于视觉内容的图书检索系统研究与实现[J].高校图书情报论坛,2016,0(1):48-52.

二级引证文献123

1刘中涛,胡凡,王淦,李钊,王磊,葛平高,王建娟.基于特征融合的深度学习场景识别与应用[J].计算机应用研究,2020,37(S01):418-420. 被引量：1
2张新刚.低分辨率模糊车辆的人工智能识别研究[J].信息通信,2019,0(12):121-123.
3徐世武,曾珏,张诗慧,李长征,李亭谕.一种深度卷积神经网络土地利用场景照片的分类方法[J].测绘通报,2020(2):24-28. 被引量：2
4秦进红.外语音像资料员的情报意识和服务意识[J].图书馆建设,2000(3):88-89. 被引量：1
5杨志安.我国中小企业发展的模式选择[J].经济管理,2000,26(4):13-14. 被引量：12
6腾云,贾勇勇,杨景刚,谢天喜.基于多算法融合的移动相机视频识别技术研究[J].自动化与仪器仪表,2019(1):60-63. 被引量：3
7孔颉,孙权森,纪则轩,刘亚洲.基于仿射不变离散哈希的遥感图像快速目标检测新方法[J].南京大学学报（自然科学版）,2019,55(1):49-60. 被引量：3
8温旭杰,卢辉斌,李强.基于局部语义上下文的场景分类方法[J].燕山大学学报,2014,38(6):551-556.
9张飞燕,李俊峰,沈军民.基于梯度和光流统计特性的人体行为识别[J].光电子．激光,2015,26(8):1593-1601. 被引量：5
10许嘉琳,朱耀麟,武桐.基于Kinect的人物抠图算法[J].西安工程大学学报,2015,29(6):724-727. 被引量：3

1郭兰图,余芳,陈金凤.一种局部与全局特征结合的图像检索算法[J].微型机与应用,2013,32(18):44-46.
2哈力旦·阿布都热依木.基于GIST特征和多特征融合的维吾尔文字幕关键帧提取方法研究[J].新疆大学学报（自然科学维文版）,2016,0(1):1-10.
3梁雪琦.基于Gist特征与CNN的场景分类方法[J].电视技术,2016,40(11):7-11.
4张雪松.基于全局Gist特征和局部碎片特征的物体检测研究[J].自动化与仪器仪表,2015(2):85-88.
5徐涛,庹红娅,方正,刘力,敬忠良.基于特征筛选的码本区分性增强方法[J].计算机应用研究,2014,31(5):1597-1600.
6岳占峰,汤丰.基于图像嵌入空间集成学习的图像分类[J].中国传媒科技,2016,0(9):35-36.
7刘宏,普杰信.一种改进的自然场景特征提取方法[J].计算机工程,2011,37(21):182-184. 被引量：3
8刘静,郭建,贺遵亮.基于Gist和PHOG特征的场景分类[J].计算机工程,2015,41(4):232-235. 被引量：5
9蔡淞,鲁帅.基于多特征融合的人脸识别研究[J].计算机应用与软件,2015,32(12):140-144. 被引量：4
10王斌,常发亮,刘春生.基于多特征融合的交通标志分类[J].山东大学学报（工学版）,2016,46(4):34-40. 被引量：5

中国图象图形学报

2013年第3期

浏览历史

内容加载中请稍等...

局部Gist特征匹配核的场景分类被引量：25

参考文献11

同被引文献192

引证文献25

二级引证文献123

相关作者

相关机构

相关主题

浏览历史

局部Gist特征匹配核的场景分类 被引量：25

参考文献11

同被引文献192

引证文献25

二级引证文献123

相关作者

相关机构

相关主题

浏览历史

局部Gist特征匹配核的场景分类被引量：25