Ground military target recognition plays a crucial role in unmanned equipment and grasping the battlefield dynamics for military applications, but is disturbed by low-resolution and noisyrepresentation. In this paper,...Ground military target recognition plays a crucial role in unmanned equipment and grasping the battlefield dynamics for military applications, but is disturbed by low-resolution and noisyrepresentation. In this paper, a recognition method, involving a novel visual attention mechanismbased Gabor region proposal sub-network(Gabor RPN) and improved refinement generative adversarial sub-network(GAN), is proposed. Novel central-peripheral rivalry 3D color Gabor filters are proposed to simulate retinal structures and taken as feature extraction convolutional kernels in low-level layer to improve the recognition accuracy and framework training efficiency in Gabor RPN. Improved refinement GAN is used to solve the problem of blurry target classification, involving a generator to directly generate large high-resolution images from small blurry ones and a discriminator to distinguish not only real images vs. fake images but also the class of targets. A special recognition dataset for ground military target, named Ground Military Target Dataset(GMTD), is constructed. Experiments performed on the GMTD dataset effectively demonstrate that our method can achieve better energy-saving and recognition results when low-resolution and noisy-representation targets are involved, thus ensuring this algorithm a good engineering application prospect.展开更多
Vision-based technologies have been extensively applied for on-street parking space sensing,aiming at providing timely and accurate information for drivers and improving daily travel convenience.However,it faces great...Vision-based technologies have been extensively applied for on-street parking space sensing,aiming at providing timely and accurate information for drivers and improving daily travel convenience.However,it faces great challenges as a partial visualization regularly occurs owing to occlusion from static or dynamic objects or a limited perspective of camera.This paper presents an imagery-based framework to infer parking space status by generating 3D bounding box of the vehicle.A specially designed convolutional neural network based on ResNet and feature pyramid network is proposed to overcome challenges from partial visualization and occlusion.It predicts 3D box candidates on multi-scale feature maps with five different 3D anchors,which generated by clustering diverse scales of ground truth box according to different vehicle templates in the source data set.Subsequently,vehicle distribution map is constructed jointly from the coordinates of vehicle box and artificially segmented parking spaces,where the normative degree of parked vehicle is calculated by computing the intersection over union between vehicle’s box and parking space edge.In space status inference,to further eliminate mutual vehicle interference,three adjacent spaces are combined into one unit and then a multinomial logistic regression model is trained to refine the status of the unit.Experiments on KITTI benchmark and Shanghai road show that the proposed method outperforms most monocular approaches in 3D box regression and achieves satisfactory accuracy in space status inference.展开更多
基金the National Key Research and Development Program of China(No.2016YFC0802904)National Natural Science Foundation of China(No.61671470)Natural Science Foundation of Jiangsu Province(BK20161470).
文摘Ground military target recognition plays a crucial role in unmanned equipment and grasping the battlefield dynamics for military applications, but is disturbed by low-resolution and noisyrepresentation. In this paper, a recognition method, involving a novel visual attention mechanismbased Gabor region proposal sub-network(Gabor RPN) and improved refinement generative adversarial sub-network(GAN), is proposed. Novel central-peripheral rivalry 3D color Gabor filters are proposed to simulate retinal structures and taken as feature extraction convolutional kernels in low-level layer to improve the recognition accuracy and framework training efficiency in Gabor RPN. Improved refinement GAN is used to solve the problem of blurry target classification, involving a generator to directly generate large high-resolution images from small blurry ones and a discriminator to distinguish not only real images vs. fake images but also the class of targets. A special recognition dataset for ground military target, named Ground Military Target Dataset(GMTD), is constructed. Experiments performed on the GMTD dataset effectively demonstrate that our method can achieve better energy-saving and recognition results when low-resolution and noisy-representation targets are involved, thus ensuring this algorithm a good engineering application prospect.
基金This work was supported in part by National Natural Science Foundation of China(No.51805312)in part by Shanghai Sailing Program(No.18YF1409400)+2 种基金in part by Training and Funding Program of Shanghai College young teachers(No.ZZGCD15102)in part by Scientific Research Project of Shanghai University of Engineering Science(No.2016-19)in part by the Shanghai University of Engineering Science Innovation Fund for Graduate Students(No.18KY0613).
文摘Vision-based technologies have been extensively applied for on-street parking space sensing,aiming at providing timely and accurate information for drivers and improving daily travel convenience.However,it faces great challenges as a partial visualization regularly occurs owing to occlusion from static or dynamic objects or a limited perspective of camera.This paper presents an imagery-based framework to infer parking space status by generating 3D bounding box of the vehicle.A specially designed convolutional neural network based on ResNet and feature pyramid network is proposed to overcome challenges from partial visualization and occlusion.It predicts 3D box candidates on multi-scale feature maps with five different 3D anchors,which generated by clustering diverse scales of ground truth box according to different vehicle templates in the source data set.Subsequently,vehicle distribution map is constructed jointly from the coordinates of vehicle box and artificially segmented parking spaces,where the normative degree of parked vehicle is calculated by computing the intersection over union between vehicle’s box and parking space edge.In space status inference,to further eliminate mutual vehicle interference,three adjacent spaces are combined into one unit and then a multinomial logistic regression model is trained to refine the status of the unit.Experiments on KITTI benchmark and Shanghai road show that the proposed method outperforms most monocular approaches in 3D box regression and achieves satisfactory accuracy in space status inference.