期刊文献+

面向复杂环境的图像语义分割方法综述 被引量:51

Research on Image Semantic Segmentation for Complex Environments
在线阅读 下载PDF
导出
摘要 图像语义分割是视觉智能方向最重要的基础性技术之一,语义分割效果关系着智能系统对其应用场景的理解能力,因此在诸如无人驾驶、机器人认知与导航、安防监控与无人机着陆系统等重要领域均具有较大的应用价值。由于复杂环境下的目标存在非结构化、目标多样化、形状不规则化以及光照变化、视角变化、尺度变化与物体遮挡等各种干扰因素,给图像的语义分割带来了较大挑战。近年来,受益于深度学习理论的快速发展,图像语义分割方向涌现了一大批具有典型意义的研究成果。为启发图像语义分割领域的学术研究及其相关智能系统的工程化开发,文中首先全面阐述了图像语义分割方法的研究发展历程,并将其划分为:传统的图像语义分割方法、传统方法与深度学习相结合的图像语义分割方法、基于深度学习的图像语义分割方法;其次从复杂环境下图像语义分割面临的问题出发,重点对近年来涌现的各种面向复杂环境的语义分割方法的模型、算法、性能及存在的问题进行了详细地分析与对比,并按照强监督、弱监督、无监督图像语义分割方法分类进行阐述;然后归纳了当前主流的PASCALVOC,Cityscape,SUNRGB-D等9类包含各种复杂环境的数据集,以及3项评估指标PA,mPA和mIoU;最后对面向复杂环境的图像语义分割研究工作进行了总结,并对其在实时视频分割、三维场景重构及无监督语义分割等方向的发展进行了展望。 Image semantic segmentation is one of the most important fundamental technologies for visual intelligence.Semantic segmentation can greatly enable intelligent systems to understand their surrounding scenarios,so it has enormous value in application domains such as unmanned vehicles, robot cognition and navigation,video surveillance and drone landing systems.Great challenges also exist in the semantic segmentation of images,due to various interfering factors of targets in complex environments,such as unstructured targets,diversity of objectives, irregular shapes,illumination changes,different viewing angles,scale variation,object occlusion,etc.In recent years,benefiting from the great advancements in deep learning techniques,a large number of research approaches with practical significance emerge in ima- ge semantic segmentation.For having a comprehensive survey and inspiring the academic research,this paper extensively discussed the existing state-of-the-art image semantic segmentation methods,and further classified them into the traditional image semantic segmentation ones,the ones combining traditional and deep learning techniques,and those based purely on deep learning.In order to address these problems in complex environments, various semantic segmentation methods for complex environment emerged in recent years were analyzed and compared in detail,including the mo- dels ,algorithms and performance with the category of strong supervised,weak supervised and unsupervised semantic segmentation methods.Furthermore, the current main datasets such as PASCAL VOC,Cityscape,SUN RGB-D,which contains various complex environments and 3 evaluation indicators of PA, mPA,mIoU were summarized.Finally,the existing research of image semantic segmentation for complex environment was summarized,and its future trends were prospected such as optimization in real-time video,3d scene reconstruction and unsupervised semantic segmentation techniques.
作者 王嫣然 陈清亮 吴俊君 WANG Yan-ran;CHEN Qing-liang;WU Jun-jun(College of Information Science and Technology,Jinan University,Guangzhou 510632,China;School of Mechatronics Engineering,Foshan University,Foshan,Guangdong528225,China)
出处 《计算机科学》 CSCD 北大核心 2019年第9期36-46,共11页 Computer Science
基金 国家自然科学基金(61603103,61673125) 广东省自然科学基金(2016A030310293) 广州市科技计划科学研究专项(201707010013)资助
关键词 语义分割 视觉智能 深度学习 图像分割 卷积神经网络 Semantic segmentation Visual intelligence Deep learning Image segmentation Convolutional neural network
  • 相关文献

参考文献4

二级参考文献119

  • 1唐鹏,高琳,盛鹏.基于动态形状的红外目标提取算法[J].光电子.激光,2009,20(8):1049-1052. 被引量:3
  • 2闫成新,桑农,张天序.基于图论的图像分割研究进展[J].计算机工程与应用,2006,42(5):11-14. 被引量:34
  • 3陶文兵,金海.一种新的基于图谱理论的图像阈值分割方法[J].计算机学报,2007,30(1):110-119. 被引量:61
  • 4LI XiaoBin,TIAN Zheng.Multiscale stochastic hierarchical image segmentation by spectral clustering[J].Science in China(Series F),2007,50(2):198-211. 被引量:14
  • 5Pal N R, Pal S K. A review on image segmentation tech- niques. Pattern Recognition, 1993, 26(9): 1277-1294.
  • 6Veksler O. Efficient Graph-based Energy Minimization Methods in Computer Vision [Ph.D. dissertation], Cornell University, USA, 1999.
  • 7Bhandarkar S M, Zhang H. A comparison of stochastic op- timization techniques for image segmentation. International Journal o? Intelligent Systems, 2000, 15(5): 441-476.
  • 8Wang J S, Swendsen R H. Cluster Monte Carlo algorithms. Physica A: Statistical Mechanics and Its Applications, 1990, 167(3): 565--578.
  • 9Tu Z W, Zhu S C. Image segmentation by data-driven Markov chain Conte Carlo. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(5): 657-673.
  • 10Barbu A, Zhu S C. Generalizing Swendsen-Wang to sam- pling arbitrary posterior probabilities. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(8): 1239-1253.

共引文献298

同被引文献327

引证文献51

二级引证文献271

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部