期刊文献+

局部Gist特征匹配核的场景分类 被引量:25

Scene categorization of local Gist feature match kernel
原文传递
导出
摘要 针对场景分类任务中全局Gist特征粒度较为粗糙的问题,提出一种基于稠密网格的局部Gist特征描述,利用空间金字塔结构加入空间信息,通过引入RGB颜色空间加入颜色信息,并基于词汇包(BOW)模型设计一种高效匹配核来度量局部特征间的相似性,核化特征匹配过程,使用线性SVM完成场景分类。实验考察了不同尺度、方向、粒度和不同匹配核的局部Gist特征以及训练样本集的大小对分类结果的影响,并通过在OT场景图像集上与全局Gist特征和稠密SIFT特征的场景分类结果进行比较,充分说明了本文特征构造方法和分类模型的有效性。 Due to the coarse fineness of global Gist features in scene categorization tasks, we propose a local Gist feature description based on a dense grid. It uses a spatial pyramid structure to add distribution information and introduces the RGB color space to add color information. The feature matching process is kernelized by an efficient match kernel which mea- sures the similarity between local features based on the BOW model. The scene categorization task can be done with linear SVM. Experiment shows the influence to the classification accuracy with local Gist features which have different scale, orientation, fineness, match kernels and numbers of training samples. By using the classification result of the global Gist feature and dense SIFT features on the OT scene dataset, we demonstrate that the proposed feature construction method and classification model are efficient.
出处 《中国图象图形学报》 CSCD 北大核心 2013年第3期264-270,共7页 Journal of Image and Graphics
基金 国家自然科学基金项目(60905005 6875012 61273237) 教育部博士点基金项目(20090111110015)
关键词 局部Gist特征 空间金字塔 高效匹配核 场景分类 local Gist feature spatial pyramid efficient match kernel scene classification
  • 相关文献

参考文献11

  • 1Sivic J, Zisserman A. Video google: a text retrieval approach to object matching in videos [ C ] //Proceedings of International Conference on Computer Vision. Washington DC: [ s. n. ], 2003,1470-1477.
  • 2Jurie F, Triggs B. Creating efficient codebooks for visual recogni- tion [ C ]//Proceedings of International Conference on Computer Vision. Beijing: [s. n. ], 2005: 604-610.
  • 3Lazebnik S, Schmid C. Beyond bags of features : spatial pyramid matching for recognizing natural scene categories [ C ]//Procee- dings of IEEE Conference on Computer Vision and Pattern Recog- nition. New York: IEEE, 2006, 2:2169-2178.
  • 4Oliva A, Torralba A. Modeling the shape of the scene a holistie representation of the spatial envelope [ J ]. International Journal in Computer Vision, 2001,42(3) : 145-175.
  • 5Oliva A, Torralba A. Building the gist of a scene: the role of global image features in recognition [ J ]. Progress in Brain Research : Visual Perception, 2006, 155 : 23-36.
  • 6Muller K R, Mika S, Ratsch G, et al. An introduction to kernel based learning algorithms [ J]. IEEE Transactions on Neural Net- works, 2001, 12(2) : 181-201.
  • 7Hofman T, Sch~lkopf B. Kernel methods in machine learning [J]. The Annals of Statistics, 2008, 36(3) : 1171-1220.
  • 8Vapnik V N. Statistical Learning Theory [ M ]. New York: Wiley, 1998.
  • 9Scholkopf B, Smola A J. Learning with Kernels [ M ]. Massa- chusetts: The MIT Press, 2002.
  • 10Daugman J. Uncertainty relation for resolution in space, spatial, frequency, and orientation optimized by two-dimensional visual cortical filters [ J]. Journal of the Optical Society of America, 1985, 2(7) : 1160-1169.

同被引文献192

  • 1王改梅,刘瑞光,刘芳.基于小波包变换的纹理图像检索[J].计算机工程与应用,2004,40(18):44-46. 被引量:14
  • 2王冰,赵志伟.基于内容的图像检索技术[J].信息技术与信息化,2005(5):81-82. 被引量:6
  • 3李晓宇,张新峰,沈兰荪.支持向量机(SVM)的研究进展[J].测控技术,2006,25(5):7-12. 被引量:46
  • 4郑芳炫,杨志强.以消失点为基础下从单张影像中估测深度[J].信息技术与应用,2006,1(3):229-235.
  • 5KonradJ. Brown G. Wang M. et al . Automatic 2D-to-3D image conversion using 3D examples[rom the internet[CJ //Proceedings of SPIE. Bellingham: Society of Photo?Optical Instrumentation Engineers Press. 2012. 8288: 82880F.
  • 6KonradJ, Wang M, Ishwar P. 2D-to-3D image conversion by learning depth from examples[CJ / /Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. Los Alamitos: IEEE Computer Society Press, 2012: 16-22.
  • 7Barnes C. Goldman D B, Shechtman E, et al. The PatchMatch randomized matching algorithm for image manipulation[J]. Communications of the ACM, 2011. 54 (1): 103-110.
  • 8Lai K, Bo L F, Ren X F, et al. A large-scale hierarchical multi-view RGB-D object dataset[CJ //Proceedings of IEEE International Conference on Robotics and Automation. Los Alamitos: IEEE Computer Society Press, 2011: 1817-1824.
  • 9Janoch A, Karayev S,Jia Y Q, et al. A category-level 3-D object dataset: putting the Kinect to work[CJ //Proceedings of the 13th International Conference on Computer Vision. Los Alamitos: IEEE Computer Society Press, 2011: 1168-1174.
  • 10Silberman N, Hoiem D, Kohli P, et al. Indoor segmentation and support inference from RGBD images[CJ //Proceedings of the 12th European Conference on Computer Vision. Berlin: Springer, 2012: 746-760.

引证文献25

二级引证文献123

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部