An improved model based on you only look once version 8(YOLOv8)is proposed to solve the problem of low detection accuracy due to the diversity of object sizes in optical remote sensing images.Firstly,the feature pyram...An improved model based on you only look once version 8(YOLOv8)is proposed to solve the problem of low detection accuracy due to the diversity of object sizes in optical remote sensing images.Firstly,the feature pyramid network(FPN)structure of the original YOLOv8 mode is replaced by the generalized-FPN(GFPN)structure in GiraffeDet to realize the"cross-layer"and"cross-scale"adaptive feature fusion,to enrich the semantic information and spatial information on the feature map to improve the target detection ability of the model.Secondly,a pyramid-pool module of multi atrous spatial pyramid pooling(MASPP)is designed by using the idea of atrous convolution and feature pyramid structure to extract multi-scale features,so as to improve the processing ability of the model for multi-scale objects.The experimental results show that the detection accuracy of the improved YOLOv8 model on DIOR dataset is 92%and mean average precision(mAP)is 87.9%,respectively 3.5%and 1.7%higher than those of the original model.It is proved the detection and classification ability of the proposed model on multi-dimensional optical remote sensing target has been improved.展开更多
In response to challenges posed by complex backgrounds,diverse target angles,and numerous small targets in remote sensing images,alongside the issue of high resource consumption hindering model deployment,we propose a...In response to challenges posed by complex backgrounds,diverse target angles,and numerous small targets in remote sensing images,alongside the issue of high resource consumption hindering model deployment,we propose an enhanced,lightweight you only look once version 8 small(YOLOv8s)detection algorithm.Regarding network improvements,we first replace tradi-tional horizontal boxes with rotated boxes for target detection,effectively addressing difficulties in feature extraction caused by varying target angles.Second,we design a module integrating convolu-tional neural networks(CNN)and Transformer components to replace specific C2f modules in the backbone network,thereby expanding the model’s receptive field and enhancing feature extraction in complex backgrounds.Finally,we introduce a feature calibration structure to mitigate potential feature mismatches during feature fusion.For model compression,we employ a lightweight channel pruning technique based on localized mean average precision(LMAP)to eliminate redundancies in the enhanced model.Although this approach results in some loss of detection accuracy,it effec-tively reduces the number of parameters,computational load,and model size.Additionally,we employ channel-level knowledge distillation to recover accuracy in the pruned model,further enhancing detection performance.Experimental results indicate that the enhanced algorithm achieves a 6.1%increase in mAP50 compared to YOLOv8s,while simultaneously reducing parame-ters,computational load,and model size by 57.7%,28.8%,and 52.3%,respectively.展开更多
In the field of remote sensing,the rapid and accurate acquisition of the category and location of airplanes has emerged as a prominent research.However,remote sensing fuzzy imaging and complex environmental interferen...In the field of remote sensing,the rapid and accurate acquisition of the category and location of airplanes has emerged as a prominent research.However,remote sensing fuzzy imaging and complex environmental interference affect airplane detection.Besides,the inconsistency in the size of remote sensing images and the low accuracy of small target detection are crucial challenges that need to be addressed.To tackle these issues,we propose a novel network SDaDCS(SAHI-data augmentation-dilation-channel and spatial attention)based on YOLOX model and the slicing aided hyper inference(SAHI)framework,a new data augmentation technique and dilation-channel and spatial(DCS)attention mechanism.Initially,we create a remote sensing dataset for airplane targets and introduce a new data augmentation technique based on the Rotate-Mixup and mixed data augmentation to enhance data diversity.The DCS attention mechanism,which comprises the dilated convolution block,channel attention and spatial attention,is designed to bolster the feature extraction and discrimination of the network.To address the challenges arised by the difficulties of detecting small targets,we integrate the YOLOX model with the SAHI framework.Experiment results show that,when compared to the original YOLOX model,the proposed SDaDCS remote sensing target detection algorithm enhances overall accuracy by 13.6%.The experimental results validate the effectiveness of the proposed algorithm.展开更多
Considering the important applications in the military and the civilian domain, ship detection and classification based on optical remote sensing images raise considerable attention in the sea surface remote sensing f...Considering the important applications in the military and the civilian domain, ship detection and classification based on optical remote sensing images raise considerable attention in the sea surface remote sensing filed. This article collects the methods of ship detection and classification for practically testing in optical remote sensing images, and provides their corresponding feature extraction strategies and statistical data. Basic feature extraction strategies and algorithms are analyzed associated with their performance and application in ship detection and classification.Furthermore, publicly available datasets that can be applied as the benchmarks to verify the effectiveness and the objectiveness of ship detection and classification methods are summarized in this paper. Based on the analysis, the remaining problems and future development trends are provided for ship detection and classification methods based on optical remote sensing images.展开更多
The latest UN environmental report warns that the world is facing three interconnected crises:climate change,environmental pollution,and biodiversity loss.Notably,under the dual pressures of extreme weather events and...The latest UN environmental report warns that the world is facing three interconnected crises:climate change,environmental pollution,and biodiversity loss.Notably,under the dual pressures of extreme weather events and long-term environmental pollution,both the rate of species extinction and the extent of habitat degradation have reached unprecedented levels.Efficient monitoring and prediction of high-risk areas and critical periods are es-sential to mitigating biodiversity loss.Traditional methods,which rely heavily on in-situ ground observations,often provide only localized snapshots of ecosystem change.These approaches fall short in addressing the need for global,cross-scale identification and early warning of biodiversity degradation risks.A multi-scale remote sens-ing network offers a powerful solution,enabling early detection and timely mitigation by integrating satellite observations,aerial drones,ground-based monitoring stations,and oceanic sensors.The fusion of complemen-tary data sources enhances both spatial resolution and coverage,allowing for dynamic tracking of biodiversity across scales.To operationalize such a system globally,international collaboration,open data access,standard-ized indicators,and strategic technological investments are crucial.Taken together,this integrated framework will strengthen real-time ecological forecasting and support proactive policy and conservation responses.展开更多
In the study of oriented bounding boxes(OBB)object detection in high-resolution remote sensing images,the problem of missed and wrong detection of small targets occurs because the targets are too small and have differ...In the study of oriented bounding boxes(OBB)object detection in high-resolution remote sensing images,the problem of missed and wrong detection of small targets occurs because the targets are too small and have different orientations.Existing OBB object detection for remote sensing images,although making good progress,mainly focuses on directional modeling,while less consideration is given to the size of the object as well as the problem of missed detection.In this study,a method based on improved YOLOv8 was proposed for detecting oriented objects in remote sensing images,which can improve the detection precision of oriented objects in remote sensing images.Firstly,the ResCBAMG module was innovatively designed,which could better extract channel and spatial correlation information.Secondly,the innovative top-down feature fusion layer network structure was proposed in conjunction with the Efficient Channel Attention(ECA)attention module,which helped to capture inter-local cross-channel interaction information appropriately.Finally,we introduced an innovative ResCBAMG module between the different C2f modules and detection heads of the bottom-up feature fusion layer.This innovative structure helped the model to better focus on the target area.The precision and robustness of oriented target detection were also improved.Experimental results on the DOTA-v1.5 dataset showed that the detection Precision,mAP@0.5,and mAP@0.5:0.95 metrics of the improved model are better compared to the original model.This improvement is effective in detecting small targets and complex scenes.展开更多
基金supported by the National Natural Science Foundation of China(No.62241109)the Tianjin Science and Technology Commissioner Project(No.20YDTPJC01110)。
文摘An improved model based on you only look once version 8(YOLOv8)is proposed to solve the problem of low detection accuracy due to the diversity of object sizes in optical remote sensing images.Firstly,the feature pyramid network(FPN)structure of the original YOLOv8 mode is replaced by the generalized-FPN(GFPN)structure in GiraffeDet to realize the"cross-layer"and"cross-scale"adaptive feature fusion,to enrich the semantic information and spatial information on the feature map to improve the target detection ability of the model.Secondly,a pyramid-pool module of multi atrous spatial pyramid pooling(MASPP)is designed by using the idea of atrous convolution and feature pyramid structure to extract multi-scale features,so as to improve the processing ability of the model for multi-scale objects.The experimental results show that the detection accuracy of the improved YOLOv8 model on DIOR dataset is 92%and mean average precision(mAP)is 87.9%,respectively 3.5%and 1.7%higher than those of the original model.It is proved the detection and classification ability of the proposed model on multi-dimensional optical remote sensing target has been improved.
基金supported in part by the National Natural Foundation of China(Nos.52472334,U2368204)。
文摘In response to challenges posed by complex backgrounds,diverse target angles,and numerous small targets in remote sensing images,alongside the issue of high resource consumption hindering model deployment,we propose an enhanced,lightweight you only look once version 8 small(YOLOv8s)detection algorithm.Regarding network improvements,we first replace tradi-tional horizontal boxes with rotated boxes for target detection,effectively addressing difficulties in feature extraction caused by varying target angles.Second,we design a module integrating convolu-tional neural networks(CNN)and Transformer components to replace specific C2f modules in the backbone network,thereby expanding the model’s receptive field and enhancing feature extraction in complex backgrounds.Finally,we introduce a feature calibration structure to mitigate potential feature mismatches during feature fusion.For model compression,we employ a lightweight channel pruning technique based on localized mean average precision(LMAP)to eliminate redundancies in the enhanced model.Although this approach results in some loss of detection accuracy,it effec-tively reduces the number of parameters,computational load,and model size.Additionally,we employ channel-level knowledge distillation to recover accuracy in the pruned model,further enhancing detection performance.Experimental results indicate that the enhanced algorithm achieves a 6.1%increase in mAP50 compared to YOLOv8s,while simultaneously reducing parame-ters,computational load,and model size by 57.7%,28.8%,and 52.3%,respectively.
基金supported in part by National Natural Science Foundation of China(No.62471034)Hebei Natural Science Foundation(No.F2023105001)。
文摘In the field of remote sensing,the rapid and accurate acquisition of the category and location of airplanes has emerged as a prominent research.However,remote sensing fuzzy imaging and complex environmental interference affect airplane detection.Besides,the inconsistency in the size of remote sensing images and the low accuracy of small target detection are crucial challenges that need to be addressed.To tackle these issues,we propose a novel network SDaDCS(SAHI-data augmentation-dilation-channel and spatial attention)based on YOLOX model and the slicing aided hyper inference(SAHI)framework,a new data augmentation technique and dilation-channel and spatial(DCS)attention mechanism.Initially,we create a remote sensing dataset for airplane targets and introduce a new data augmentation technique based on the Rotate-Mixup and mixed data augmentation to enhance data diversity.The DCS attention mechanism,which comprises the dilated convolution block,channel attention and spatial attention,is designed to bolster the feature extraction and discrimination of the network.To address the challenges arised by the difficulties of detecting small targets,we integrate the YOLOX model with the SAHI framework.Experiment results show that,when compared to the original YOLOX model,the proposed SDaDCS remote sensing target detection algorithm enhances overall accuracy by 13.6%.The experimental results validate the effectiveness of the proposed algorithm.
文摘Considering the important applications in the military and the civilian domain, ship detection and classification based on optical remote sensing images raise considerable attention in the sea surface remote sensing filed. This article collects the methods of ship detection and classification for practically testing in optical remote sensing images, and provides their corresponding feature extraction strategies and statistical data. Basic feature extraction strategies and algorithms are analyzed associated with their performance and application in ship detection and classification.Furthermore, publicly available datasets that can be applied as the benchmarks to verify the effectiveness and the objectiveness of ship detection and classification methods are summarized in this paper. Based on the analysis, the remaining problems and future development trends are provided for ship detection and classification methods based on optical remote sensing images.
基金supported by the National Natural Science Foundation of China(52422901).
文摘The latest UN environmental report warns that the world is facing three interconnected crises:climate change,environmental pollution,and biodiversity loss.Notably,under the dual pressures of extreme weather events and long-term environmental pollution,both the rate of species extinction and the extent of habitat degradation have reached unprecedented levels.Efficient monitoring and prediction of high-risk areas and critical periods are es-sential to mitigating biodiversity loss.Traditional methods,which rely heavily on in-situ ground observations,often provide only localized snapshots of ecosystem change.These approaches fall short in addressing the need for global,cross-scale identification and early warning of biodiversity degradation risks.A multi-scale remote sens-ing network offers a powerful solution,enabling early detection and timely mitigation by integrating satellite observations,aerial drones,ground-based monitoring stations,and oceanic sensors.The fusion of complemen-tary data sources enhances both spatial resolution and coverage,allowing for dynamic tracking of biodiversity across scales.To operationalize such a system globally,international collaboration,open data access,standard-ized indicators,and strategic technological investments are crucial.Taken together,this integrated framework will strengthen real-time ecological forecasting and support proactive policy and conservation responses.
文摘In the study of oriented bounding boxes(OBB)object detection in high-resolution remote sensing images,the problem of missed and wrong detection of small targets occurs because the targets are too small and have different orientations.Existing OBB object detection for remote sensing images,although making good progress,mainly focuses on directional modeling,while less consideration is given to the size of the object as well as the problem of missed detection.In this study,a method based on improved YOLOv8 was proposed for detecting oriented objects in remote sensing images,which can improve the detection precision of oriented objects in remote sensing images.Firstly,the ResCBAMG module was innovatively designed,which could better extract channel and spatial correlation information.Secondly,the innovative top-down feature fusion layer network structure was proposed in conjunction with the Efficient Channel Attention(ECA)attention module,which helped to capture inter-local cross-channel interaction information appropriately.Finally,we introduced an innovative ResCBAMG module between the different C2f modules and detection heads of the bottom-up feature fusion layer.This innovative structure helped the model to better focus on the target area.The precision and robustness of oriented target detection were also improved.Experimental results on the DOTA-v1.5 dataset showed that the detection Precision,mAP@0.5,and mAP@0.5:0.95 metrics of the improved model are better compared to the original model.This improvement is effective in detecting small targets and complex scenes.
文摘针对遥感图像中大纵横比目标因正样本不足而出现的学习不充分问题,提出一种基于形状自适应标签分配的遥感有向目标检测网络(shape-adaptive label assignment for oriented object detection network,SALANet)。首先,引入纵横比敏感系数建立目标几何特征与正样本数量的动态映射关系,缓解传统方法中固定分配规则引发的样本分布不平衡问题;其次,设计自适应标签分配策略,通过对交并比(intersection over union,IoU)进行排名实现高质量正样本选择;最后,提出中心轴先验,将圆形中心先验区扩展为目标中心轴的矩形区域,增强大纵横比目标的几何特征表征能力。在DOTAv1.0和HRSC2016数据集上的对比实验表明,SALANet分别取得0.777 1和0.932 3的平均精度均值(mean average precision,mAP),较基线方法RoI Transformer分别提升8.15%和2.87%。