The different imaging conditions of high spatial resolution remote sensing images(HSRRSIs)tend to cause large differences in the background information of bridges from the images,including problems of difficult detect...The different imaging conditions of high spatial resolution remote sensing images(HSRRSIs)tend to cause large differences in the background information of bridges from the images,including problems of difficult detection of multiscale bridges,leakage of small bridges and insufficient detection accuracy for their detection.To address these problems,a YOLOv5 network with a decoupled head for the automatic detection of bridges in HSRRIs is proposed in this paper.First,the problem of inconsistent scale of information fusion of each feature in the feature pyramid network is solved using a weighted bi-directional feature pyramid network(BiFPN).Then,the convolutional block attention module(CBAM)is fused into the three effective feature layers after feature pyramid network processing.The bridge feature information is effectively extracted from the channel and spatial dimensions.Next,the decoupled head is fused in the YOLO Head to separate the classifier and regressor to speed up the network convergence and improve the network detection accuracy simultaneously.Finally,the practical effect is evaluated by calculating the average precision(AP).According to the experimental results,the AP of the proposed method is 98.1%,which is improved by 4.1%∼23.5%compared with other models.展开更多
A forest fire is a natural disaster characterized by rapid spread,difficulty in extinguishing,and widespread destruction,which requires an efficient response.Existing detection methods fail to balance global and local...A forest fire is a natural disaster characterized by rapid spread,difficulty in extinguishing,and widespread destruction,which requires an efficient response.Existing detection methods fail to balance global and local fire features,resulting in the false detection of small or hidden fires.In this paper,we propose a novel detection technique based on an improved YOLO v5 model to enhance the visual representation of forest fires and retain more information about global interactions.We add a plug-and-play global attention mechanism to improve the efficiency of neck and backbone feature extraction of the YOLO v5 model.Then,a re-parameterized convolutional module is designed,and a decoupled detection head is used to accelerate the convergence speed.Finally,a weighted bi-directional feature pyramid network(BiFPN)is introduced to merge feature information for local information processing.In the evaluation,we use the complete intersection over union(CIoU)loss function to optimize the multi-task loss for different kinds of forest fires.Experiments show that the precision,recall,and mean average precision are increased by 4.2%,3.8%,and 4.6%,respectively,compared with the classic YOLO v5 model.In particular,the mAP@0.5:0.95 is 2.2% higher than the other detection methods,while meeting the requirements of real-time detection.展开更多
基金funded by National Natural Science Foundation of China[grant nulmber 41961039]Yunnan Fun-damental Research Projects[grant numbers 202201 AT070164,202101AT070102].
文摘The different imaging conditions of high spatial resolution remote sensing images(HSRRSIs)tend to cause large differences in the background information of bridges from the images,including problems of difficult detection of multiscale bridges,leakage of small bridges and insufficient detection accuracy for their detection.To address these problems,a YOLOv5 network with a decoupled head for the automatic detection of bridges in HSRRIs is proposed in this paper.First,the problem of inconsistent scale of information fusion of each feature in the feature pyramid network is solved using a weighted bi-directional feature pyramid network(BiFPN).Then,the convolutional block attention module(CBAM)is fused into the three effective feature layers after feature pyramid network processing.The bridge feature information is effectively extracted from the channel and spatial dimensions.Next,the decoupled head is fused in the YOLO Head to separate the classifier and regressor to speed up the network convergence and improve the network detection accuracy simultaneously.Finally,the practical effect is evaluated by calculating the average precision(AP).According to the experimental results,the AP of the proposed method is 98.1%,which is improved by 4.1%∼23.5%compared with other models.
基金supported by the Graduate Research and Innovation Projects of Jiangsu Province(No.SJCX23_0320).
文摘A forest fire is a natural disaster characterized by rapid spread,difficulty in extinguishing,and widespread destruction,which requires an efficient response.Existing detection methods fail to balance global and local fire features,resulting in the false detection of small or hidden fires.In this paper,we propose a novel detection technique based on an improved YOLO v5 model to enhance the visual representation of forest fires and retain more information about global interactions.We add a plug-and-play global attention mechanism to improve the efficiency of neck and backbone feature extraction of the YOLO v5 model.Then,a re-parameterized convolutional module is designed,and a decoupled detection head is used to accelerate the convergence speed.Finally,a weighted bi-directional feature pyramid network(BiFPN)is introduced to merge feature information for local information processing.In the evaluation,we use the complete intersection over union(CIoU)loss function to optimize the multi-task loss for different kinds of forest fires.Experiments show that the precision,recall,and mean average precision are increased by 4.2%,3.8%,and 4.6%,respectively,compared with the classic YOLO v5 model.In particular,the mAP@0.5:0.95 is 2.2% higher than the other detection methods,while meeting the requirements of real-time detection.