Object detection plays a crucial role in the field of computer vision,and small object detection has long been a challenging issue within this domain.In order to improve the performance of object detection on small ta...Object detection plays a crucial role in the field of computer vision,and small object detection has long been a challenging issue within this domain.In order to improve the performance of object detection on small targets,this paper proposes an enhanced structure for YOLOv5,termed ATC-YOLOv5.Firstly,a novel structure,AdaptiveTrans,is introduced into YOLOv5 to facilitate efficient communication between the encoder and the detector.Consequently,the network can better address the adaptability challenge posed by objects of different sizes in object detection.Additionally,the paper incorporates the CBAM(Convolutional Block Attention Module)attention mechanism,which dynamically adjusts the weights of different channels in the feature map by introducing a channel attention mechanism.Finally,the paper addresses small object detection by increasing the number of detection heads,specifically designed for detecting high-resolution andminute target objects.Experimental results demonstrate that on the VisDrone2019 dataset,ATC-YOLOv5 outperforms the original YOLOv5,with an improvement in mAP@0.5 from 34.32%to 42.72%and an increase in mAP@[0.5:0.95]from 18.93%to 24.48%.展开更多
文摘Object detection plays a crucial role in the field of computer vision,and small object detection has long been a challenging issue within this domain.In order to improve the performance of object detection on small targets,this paper proposes an enhanced structure for YOLOv5,termed ATC-YOLOv5.Firstly,a novel structure,AdaptiveTrans,is introduced into YOLOv5 to facilitate efficient communication between the encoder and the detector.Consequently,the network can better address the adaptability challenge posed by objects of different sizes in object detection.Additionally,the paper incorporates the CBAM(Convolutional Block Attention Module)attention mechanism,which dynamically adjusts the weights of different channels in the feature map by introducing a channel attention mechanism.Finally,the paper addresses small object detection by increasing the number of detection heads,specifically designed for detecting high-resolution andminute target objects.Experimental results demonstrate that on the VisDrone2019 dataset,ATC-YOLOv5 outperforms the original YOLOv5,with an improvement in mAP@0.5 from 34.32%to 42.72%and an increase in mAP@[0.5:0.95]from 18.93%to 24.48%.