High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes an...High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes and wealth of spatial details pose challenges for semantic segmentation.While convolutional neural networks(CNNs)excel at capturing local features,they are limited in modeling long-range dependencies.Conversely,transformers utilize multihead self-attention to integrate global context effectively,but this approach often incurs a high computational cost.This paper proposes a global-local multiscale context network(GLMCNet)to extract both global and local multiscale contextual information from HRSIs.A detail-enhanced filtering module(DEFM)is proposed at the end of the encoder to refine the encoder outputs further,thereby enhancing the key details extracted by the encoder and effectively suppressing redundant information.In addition,a global-local multiscale transformer block(GLMTB)is proposed in the decoding stage to enable the modeling of rich multiscale global and local information.We also design a stair fusion mechanism to transmit deep semantic information from deep to shallow layers progressively.Finally,we propose the semantic awareness enhancement module(SAEM),which further enhances the representation of multiscale semantic features through spatial attention and covariance channel attention.Extensive ablation analyses and comparative experiments were conducted to evaluate the performance of the proposed method.Specifically,our method achieved a mean Intersection over Union(mIoU)of 86.89%on the ISPRS Potsdam dataset and 84.34%on the ISPRS Vaihingen dataset,outperforming existing models such as ABCNet and BANet.展开更多
While algorithms have been created for land usage in urban settings,there have been few investigations into the extraction of urban footprint(UF).To address this research gap,the study employs several widely used imag...While algorithms have been created for land usage in urban settings,there have been few investigations into the extraction of urban footprint(UF).To address this research gap,the study employs several widely used image classification method classified into three categories to evaluate their segmentation capabilities for extracting UF across eight cities.The results indicate that pixel-based methods only excel in clear urban environments,and their overall accuracy is not consistently high.RF and SVM perform well but lack stability in object-based UF extraction,influenced by feature selection and classifier performance.Deep learning enhances feature extraction but requires powerful computing and faces challenges with complex urban layouts.SAM excels in medium-sized urban areas but falters in intricate layouts.Integrating traditional and deep learning methods optimizes UF extraction,balancing accuracy and processing efficiency.Future research should focus on adapting algorithms for diverse urban landscapes to enhance UF extraction accuracy and applicability.展开更多
Landslide is one of the multitudinous serious geological hazards. The key to its control and reduction lies on dynamic monitoring and early warning. The article points out the insufficiency of traditional measuring me...Landslide is one of the multitudinous serious geological hazards. The key to its control and reduction lies on dynamic monitoring and early warning. The article points out the insufficiency of traditional measuring means applied for large-scale landslide monitoring and proposes the method for extensive landslide displacement field monitoring using high- resolution remote images. Matching of cognominal points is realized by using the invariant features of SIFT algorithm in image translation, rotation, zooming, and affine transformation, and through recognition and comparison of characteristics of high-resolution images in different landsliding periods. Following that, landslide displacement vector field can be made known by measuring the distances and directions between cognominal points. As evidenced by field application of the method for landslide monitoring at West Open Mine in Fushun city of China, the method has the attraction of being able to make areal measurement through satellite observation and capable of obtaining at the same time the information of large- area intensive displacement field, for facilitating automatic delimitation of extent of landslide displacement vector field and sliding mass. This can serve as a basis for making analysis of laws governing occurrence of landslide and adoption of countermeasures.展开更多
Objective Nowadays, high-resolution remote sensing technology has brought new changes to surveys of earthquakes, and the quantitative study of seismic faults based on this technology has become a trend in the world(Ba...Objective Nowadays, high-resolution remote sensing technology has brought new changes to surveys of earthquakes, and the quantitative study of seismic faults based on this technology has become a trend in the world(Barzegari et al., 2017). An Mw 7.2 earthquake occurred in Yutian of Xinjiang on the western end of the Altyn Tagh fault on March 21 st, 2008. It is difficult to access this depopulated zone because of the high altitude and only 1–2 months of snowmelt. This study utilized high-resolution展开更多
On the basis of realization of beach information and its differentiating of high-resolution remote sensing image on coastal zone, extracting objects are carried through RS multi-scale diagnostic analysis, and fast inf...On the basis of realization of beach information and its differentiating of high-resolution remote sensing image on coastal zone, extracting objects are carried through RS multi-scale diagnostic analysis, and fast information extraction methods and key technologies are put forward. Meanwhile image segmentation methods are set forth for objects of coastal zone. And through the application of Otsu2D to the segmentation of water area and dock and the applying of Gabor filter to the separation and extraction of construction, some typical applications of high-resolution RS image are presented in the field of coastal zone surface objects' recognition. Quantizing high-resolution RS information on the coastal zone proved to be of great scientific and practical significance for coastal development and management.展开更多
The exploration of building detection plays an important role in urban planning,smart city and military.Aiming at the problem of high overlapping ratio of detection frames for dense building detection in high resoluti...The exploration of building detection plays an important role in urban planning,smart city and military.Aiming at the problem of high overlapping ratio of detection frames for dense building detection in high resolution remote sensing images,we present an effective YOLOv3 framework,corner regression-based YOLOv3(Correg-YOLOv3),to localize dense building accurately.This improved YOLOv3 algorithm establishes a vertex regression mechanism and an additional loss item about building vertex offsets relative to the center point of bounding box.By extending output dimensions,the trained model is able to output the rectangular bounding boxes and the building vertices meanwhile.Finally,we evaluate the performance of the Correg-YOLOv3 on our self-produced data set and provide a comparative analysis qualitatively and quantitatively.The experimental results achieve high performance in precision(96.45%),recall rate(95.75%),F1 score(96.10%)and average precision(98.05%),which were 2.73%,5.4%,4.1%and 4.73%higher than that of YOLOv3.Therefore,our proposed algorithm effectively tackles the problem of dense building detection in high resolution images.展开更多
It is proposed a high resolution remote sensing image segmentation method which combines static minimum spanning tree(MST)tessellation considering shape information and the RHMRF-FCM algorithm.It solves the problems i...It is proposed a high resolution remote sensing image segmentation method which combines static minimum spanning tree(MST)tessellation considering shape information and the RHMRF-FCM algorithm.It solves the problems in the traditional pixel-based HMRF-FCM algorithm in which poor noise resistance and low precision segmentation in a complex boundary exist.By using the MST model and shape information,the object boundary and geometrical noise can be expressed and reduced respectively.Firstly,the static MST tessellation is employed for dividing the image domain into some sub-regions corresponding to the components of homogeneous regions needed to be segmented.Secondly,based on the tessellation results,the RHMRF model is built,and regulation terms considering the KL information and the information entropy are introduced into the FCM objective function.Finally,the partial differential method and Lagrange function are employed to calculate the parameters of the fuzzy objective function for obtaining the global optimal segmentation results.To verify the robustness and effectiveness of the proposed algorithm,the experiments are carried out with WorldView-3(WV-3)high resolution image.The results from proposed method with different parameters and comparing methods(multi-resolution method and watershed segmentation method in eCognition software)are analyzed qualitatively and quantitatively.展开更多
While executing tasks such as ocean pollution monitoring,maritime rescue,geographic mapping,and automatic navigation utilizing remote sensing images,the coastline feature should be determined.Traditional methods are n...While executing tasks such as ocean pollution monitoring,maritime rescue,geographic mapping,and automatic navigation utilizing remote sensing images,the coastline feature should be determined.Traditional methods are not satisfactory to extract coastline in high-resolution panchromatic remote sensing image.Active contour model,also called snakes,have proven useful for interactive specification of image contours,so it is used as an effective coastlines extraction technique.Firstly,coastlines are detected by water segmentation and boundary tracking,which are considered initial contours to be optimized through active contour model.As better energy functions are developed,the power assist of snakes becomes effective.New internal energy has been done to reduce problems caused by convergence to local minima,and new external energy can greatly enlarge the capture region around features of interest.After normalization processing,energies are iterated using greedy algorithm to accelerate convergence rate.The experimental results encompassed examples in images and demonstrated the capabilities and efficiencies of the improvement.展开更多
Cloud detection from satellite and drone imagery is crucial for applications such as weather forecasting and environmentalmonitoring.Addressing the limitations of conventional convolutional neural networks,we propose ...Cloud detection from satellite and drone imagery is crucial for applications such as weather forecasting and environmentalmonitoring.Addressing the limitations of conventional convolutional neural networks,we propose an innovative transformer-based method.This method leverages transformers,which are adept at processing data sequences,to enhance cloud detection accuracy.Additionally,we introduce a Cyclic Refinement Architecture that improves the resolution and quality of feature extraction,thereby aiding in the retention of critical details often lost during cloud detection.Our extensive experimental validation shows that our approach significantly outperforms established models,excelling in high-resolution feature extraction and precise cloud segmentation.By integrating Positional Visual Transformers(PVT)with this architecture,our method advances high-resolution feature delineation and segmentation accuracy.Ultimately,our research offers a novel perspective for surmounting traditional challenges in cloud detection and contributes to the advancement of precise and dependable image analysis across various domains.展开更多
When existing deep learning models are used for road extraction tasks from high-resolution images,they are easily affected by noise factors such as tree and building occlusion and complex backgrounds,resulting in inco...When existing deep learning models are used for road extraction tasks from high-resolution images,they are easily affected by noise factors such as tree and building occlusion and complex backgrounds,resulting in incomplete road extraction and low accuracy.We propose the introduction of spatial and channel attention modules to the convolutional neural network ConvNeXt.Then,ConvNeXt is used as the backbone network,which cooperates with the perceptual analysis network UPerNet,retains the detection head of the semantic segmentation,and builds a new model ConvNeXt-UPerNet to suppress noise interference.Training on the open-source DeepGlobe and CHN6-CUG datasets and introducing the DiceLoss on the basis of CrossEntropyLoss solves the problem of positive and negative sample imbalance.Experimental results show that the new network model can achieve the following performance on the DeepGlobe dataset:79.40%for precision(Pre),97.93% for accuracy(Acc),69.28% for intersection over union(IoU),and 83.56% for mean intersection over union(MIoU).On the CHN6-CUG dataset,the model achieves the respective values of 78.17%for Pre,97.63%for Acc,65.4% for IoU,and 81.46% for MIoU.Compared with other network models,the fused ConvNeXt-UPerNet model can extract road information better when faced with the influence of noise contained in high-resolution remote sensing images.It also achieves multiscale image feature information with unified perception,ultimately improving the generalization ability of deep learning technology in extracting complex roads from high-resolution remote sensing images.展开更多
Significant advancements have been achieved in road surface extraction based on high-resolution remote sensingimage processing. Most current methods rely on fully supervised learning, which necessitates enormous human...Significant advancements have been achieved in road surface extraction based on high-resolution remote sensingimage processing. Most current methods rely on fully supervised learning, which necessitates enormous humaneffort to label the image. Within this field, other research endeavors utilize weakly supervised methods. Theseapproaches aim to reduce the expenses associated with annotation by leveraging sparsely annotated data, such asscribbles. This paper presents a novel technique called a weakly supervised network using scribble-supervised andedge-mask (WSSE-net). This network is a three-branch network architecture, whereby each branch is equippedwith a distinct decoder module dedicated to road extraction tasks. One of the branches is dedicated to generatingedge masks using edge detection algorithms and optimizing road edge details. The other two branches supervise themodel’s training by employing scribble labels and spreading scribble information throughout the image. To addressthe historical flaw that created pseudo-labels that are not updated with network training, we use mixup to blendprediction results dynamically and continually update new pseudo-labels to steer network training. Our solutiondemonstrates efficient operation by simultaneously considering both edge-mask aid and dynamic pseudo-labelsupport. The studies are conducted on three separate road datasets, which consist primarily of high-resolutionremote-sensing satellite photos and drone images. The experimental findings suggest that our methodologyperforms better than advanced scribble-supervised approaches and specific traditional fully supervised methods.展开更多
This paper introduces the applications of high-resolution remote sensing imagery and the necessity of geometric calibration for remote sensing sensors considering assurance of the geometric accuracy of remote sensing ...This paper introduces the applications of high-resolution remote sensing imagery and the necessity of geometric calibration for remote sensing sensors considering assurance of the geometric accuracy of remote sensing imagery. Then the paper analyzes the general methodology of geometric calibration. Taking the DMC sensor geometric calibration as an example, the paper discusses the whole calibration procedure. Finally, it gave some concluding remarks on geometric calibration of high-resolution remote sensing sensors.展开更多
Traditional Chinese villages,vital carriers of traditional culture,have faced significant alterations due to urbanization in recent years,urgently necessitating artificial intelligence data updates.This study integrat...Traditional Chinese villages,vital carriers of traditional culture,have faced significant alterations due to urbanization in recent years,urgently necessitating artificial intelligence data updates.This study integrates high spatial resolution remote sensing imagery with deep learning techniques,proposing a novel method for identifying rooftops of traditional Chinese village buildings using high-definition remote sensing images.Using 0.54 m spatial resolution imagery of traditional village areas as the data source,this method analyzes the geometric and spectral image characteristics of village building rooftops.It constructs a deep learning feature sample library tailored to the target types.Employing a semantically enhanced version of the improved Mask R-CNN(Mask Region-based Convolutional Neural Network)for building recognition,the study conducts experiments on localized imagery from different regions.The results demonstrated that the modified Mask R-CNN effectively identifies traditional village building rooftops,achieving an of 0.7520 and an of 0.7400.It improves the current problem of misidentification and missed detection caused by feature heterogeneity.This method offers a viable and effective approach for industrialized data monitoring of traditional villages,contributing to their sustainable development.展开更多
Tailings ponds are critical facilities in the mining industry,and accurate monitoring and management of these ponds are of paramount importance.However,conventional object detection methodologies,including recent adva...Tailings ponds are critical facilities in the mining industry,and accurate monitoring and management of these ponds are of paramount importance.However,conventional object detection methodologies,including recent advancements,often face significant challenges in addressing the complexities inherent to tailings pond environments.This is particularly due to deficiencies in their loss function design,which can result in protracted convergence times and suboptimal performance when detecting smaller targets.In this study,we introduce an innovative loss function termed the Rapid Intersection over Union(RIoU)loss function,which incorporates a focal weight and is integrated into the YOLOv5 object detection framework to develop the YOLOv5-RF model.This approach aims to enhance both convergence speed and improve convergence accuracy in the tailings pond identification process by comprehensively addressing the specific challenges posed by complex environmental conditions,thereby enhancing the precision and robustness of tailings pond target detection.It integrates the concepts of the central triangle and the aspect ratio of the circumscribed rectangle,assigning specific weights and penalty terms to optimize the model’s performance in object detection tasks.We validated the efficacy of YOLOv5-RF through simulation experiments and high-resolution remote sensing images of tailings ponds.The experimental results indicate that RIoU facilitates faster convergence rates.Specifically,YOLOv5-RF achieves accuracy and recall rates that are 2%and 2.1%higher than those of YOLOv5,respectively.Furthermore,it completes 120 iterations in 1.08 hours less time compared to its predecessor model while exhibiting an inference time that is 11.7 ms shorter than that for YOLOv5.These findings suggest that our model significantly enhances processing speed without compromising accuracy levels.This research offers novel technical approaches as well as theoretical support for monitoring tailings ponds using computer vision and remote sensing technologies.展开更多
Remote sensing images contain a wealth of geospatial information.To accurately identify different geospatial categories and extract relevant data,image semantic segmentation plays a crucial role.In recent years,deep l...Remote sensing images contain a wealth of geospatial information.To accurately identify different geospatial categories and extract relevant data,image semantic segmentation plays a crucial role.In recent years,deep learning technology has brought significant breakthroughs to semantic segmentation of remote sensing images,significantly enhancing its performance.This paper investigates the application of deep learning technologies in remote sensing image semantic segmentation,based on Convolutional Neural Networks(CNN)and Transformer-based semantic segmentation methods.It conducts an in-depth comparison of their structural characteristics and applicable scenarios,summarizes the achievements and shortcomings of existing research,and provides technical references and theoretical support for future studies,thereby contributing to the further development of deep learning technology in the field of remote sensing.Research results indicate that CNN-based semantic segmentation methods still hold advantages in extracting local features and achieving efficient segmentation,whereas Transformers address CNN's limitations in global context modeling and long-range dependency capture.Therefore,the collaborative integration of CNN and Transformers will become an important research direction for enhancing model performance in the future.展开更多
Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectra...Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability.展开更多
Deep learning has made significant progress in the field of oriented object detection for remote sensing images.However,existing methods still face challenges when dealing with difficult tasks such as multi-scale targ...Deep learning has made significant progress in the field of oriented object detection for remote sensing images.However,existing methods still face challenges when dealing with difficult tasks such as multi-scale targets,complex backgrounds,and small objects in remote sensing.Maintaining model lightweight to address resource constraints in remote sensing scenarios while improving task completion for remote sensing tasks remains a research hotspot.Therefore,we propose an enhanced multi-scale feature extraction lightweight network EM-YOLO based on the YOLOv8s architecture,specifically optimized for the characteristics of large target scale variations,diverse orientations,and numerous small objects in remote sensing images.Our innovations lie in two main aspects:First,a dynamic snake convolution(DSC)is introduced into the backbone network to enhance the model’s feature extraction capability for oriented targets.Second,an innovative focusing-diffusion module is designed in the feature fusion neck to effectively integrate multi-scale feature information.Finally,we introduce Layer-Adaptive Sparsity for magnitude-based Pruning(LASP)method to perform lightweight network pruning to better complete tasks in resource-constrained scenarios.Experimental results on the lightweight platform Orin demonstrate that the proposed method significantly outperforms the original YOLOv8s model in oriented remote sensing object detection tasks,and achieves comparable or superior performance to state-of-the-art methods on three authoritative remote sensing datasets(DOTA v1.0,DOTA v1.5,and HRSC2016).展开更多
Desert shrubs are indispensable in maintaining ecological stability by reducing soil erosion,enhancing water retention,and boosting soil fertility,which are critical factors in mitigating desertification processes.Due...Desert shrubs are indispensable in maintaining ecological stability by reducing soil erosion,enhancing water retention,and boosting soil fertility,which are critical factors in mitigating desertification processes.Due to the complex topography,variable climate,and challenges in field surveys in desert regions,this paper proposes YOLO-Desert-Shrub(YOLO-DS),a detection method for identifying desert shrubs in UAV remote sensing images based on an enhanced YOLOv8n framework.This method accurately identifying shrub species,locations,and coverage.To address the issue of small individual plants dominating the dataset,the SPDconv convolution module is introduced in the Backbone and Neck layers of the YOLOv8n model,replacing conventional convolutions.This structural optimization mitigates information degradation in fine-grained data while strengthening discriminative feature capture across spatial scales within desert shrub datasets.Furthermore,a structured state-space model is integrated into the main network,and the MambaLayer is designed to dynamically extract and refine shrub-specific features from remote sensing images,effectively filtering out background noise and irrelevant interference to enhance feature representation.Benchmark evaluations reveal the YOLO-DS framework attains 79.56%mAP40weight,demonstrating 2.2%absolute gain versus the baseline YOLOv8n architecture,with statistically significant advantages over contemporary detectors in cross-validation trials.The predicted plant coverage exhibits strong consistency with manually measured coverage,with a coefficient of determination(R^(2))of 0.9148 and a Root Mean Square Error(RMSE)of1.8266%.The proposed UAV-based remote sensing method utilizing the YOLO-DS effectively identify and locate desert shrubs,monitor canopy sizes and distribution,and provide technical support for automated desert shrub monitoring.展开更多
High-resolution remote sensing imagery is essential for critical applications such as precision agriculture,urban management planning,and military reconnaissance.Although significant progress has been made in singleim...High-resolution remote sensing imagery is essential for critical applications such as precision agriculture,urban management planning,and military reconnaissance.Although significant progress has been made in singleimage super-resolution(SISR)using generative adversarial networks(GANs),existing approaches still face challenges in recovering high-frequency details,effectively utilizing features,maintaining structural integrity,and ensuring training stability—particularly when dealing with the complex textures characteristic of remote sensing imagery.To address these limitations,this paper proposes the Improved ResidualModule and AttentionMechanism Network(IRMANet),a novel architecture specifically designed for remote sensing image reconstruction.IRMANet builds upon the Super-Resolution Generative Adversarial Network(SRGAN)framework and introduces several key innovations.First,the Enhanced Residual Unit(ERU)enhances feature reuse and stabilizes training through deep residual connections.Second,the Self-Attention Residual Block(SARB)incorporates a self-attentionmechanism into the Improved Residual Module(IRM)to effectivelymodel long-range dependencies and automatically emphasize salient features.Additionally,the IRM adopts amulti-scale feature fusion strategy to facilitate synergistic interactions between local detail and global semantic information.The effectiveness of each component is validated through ablation studies,while comprehensive comparative experiments on standard remote sensing datasets demonstrate that IRMANet significantly outperforms both the baseline and state-of-the-art methods in terms of perceptual quality and quantitative metrics.Specifically,compared to the baseline model,at a magnification factor of 2,IRMANet achieves an improvement of 0.24 dB in peak signal-to-noise ratio(PSNR)and 0.54 in structural similarity index(SSIM);at a magnification factor of 4,it achieves gains of 0.22 dB in PSNR and 0.51 in SSIM.These results confirm that the proposedmethod effectively enhances detail representation and structural reconstruction accuracy in complex remote sensing scenarios,offering robust technical support for high-precision detection and identification of both military and civilian aircraft.展开更多
Agricultural greenhouses(AGHs)are increasingly used globally to control the crop growth environment,which are vital for food production,resource conservation,and rural economies.Advances in high-quality data acquisiti...Agricultural greenhouses(AGHs)are increasingly used globally to control the crop growth environment,which are vital for food production,resource conservation,and rural economies.Advances in high-quality data acquisition methods and information retrieval algorithms have improved the ability to extract AGHs from remote sensing images(e.g.,satellite and uncrewed aerial vehicle(UAV)).Research on this topic began in 1989,and the number of related studies has increased annually.This paper provides a review of the development of remote sensing of AGHs and research hotspots.It summarizes the current status and trends of data sources,identification features,methods,and accuracy of AGHs extraction.Due to the unique spectral,textural,and geometric characteristics of AGHs,research studies have primarily utilized optical remote sensing data from sensors with spatial resolutions of 30 m or more,such as Landsat,Sentinel,Gaofen(GF),and Worldview,to extract AGHs.Machine learning and deep learning methods have provided more precise results for extracting AGHs than threshold segmentation methods.In contrast,deep learning algorithms have been primarily used with high-spatial resolution data and small-scale study areas,with accuracy rates generally exceeding 90.00%.However,future research may use higher spatial resolution images to improve the accuracy and detail of AGH extraction.Recent studies have integrated multiple data sources and performed time-series analysis to improve monitoring of dynamic changes in AGHs.Moreover,emphasis should be placed on optimizing data fusion techniques,implementing sample transfer methods,expanding the number of sensors,and increasing the application of artificial intelligence(AI)in monitoring AGHs.These efforts will provide more reliable methods and tools to improve agricultural production and resource utilization efficiency.This review provides resources for researchers and decision-makers involved in modern agricultural development,as well as scientific evidence for the sustainable development of rural areas.展开更多
基金provided by the Science Research Project of Hebei Education Department under grant No.BJK2024115.
文摘High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes and wealth of spatial details pose challenges for semantic segmentation.While convolutional neural networks(CNNs)excel at capturing local features,they are limited in modeling long-range dependencies.Conversely,transformers utilize multihead self-attention to integrate global context effectively,but this approach often incurs a high computational cost.This paper proposes a global-local multiscale context network(GLMCNet)to extract both global and local multiscale contextual information from HRSIs.A detail-enhanced filtering module(DEFM)is proposed at the end of the encoder to refine the encoder outputs further,thereby enhancing the key details extracted by the encoder and effectively suppressing redundant information.In addition,a global-local multiscale transformer block(GLMTB)is proposed in the decoding stage to enable the modeling of rich multiscale global and local information.We also design a stair fusion mechanism to transmit deep semantic information from deep to shallow layers progressively.Finally,we propose the semantic awareness enhancement module(SAEM),which further enhances the representation of multiscale semantic features through spatial attention and covariance channel attention.Extensive ablation analyses and comparative experiments were conducted to evaluate the performance of the proposed method.Specifically,our method achieved a mean Intersection over Union(mIoU)of 86.89%on the ISPRS Potsdam dataset and 84.34%on the ISPRS Vaihingen dataset,outperforming existing models such as ABCNet and BANet.
文摘While algorithms have been created for land usage in urban settings,there have been few investigations into the extraction of urban footprint(UF).To address this research gap,the study employs several widely used image classification method classified into three categories to evaluate their segmentation capabilities for extracting UF across eight cities.The results indicate that pixel-based methods only excel in clear urban environments,and their overall accuracy is not consistently high.RF and SVM perform well but lack stability in object-based UF extraction,influenced by feature selection and classifier performance.Deep learning enhances feature extraction but requires powerful computing and faces challenges with complex urban layouts.SAM excels in medium-sized urban areas but falters in intricate layouts.Integrating traditional and deep learning methods optimizes UF extraction,balancing accuracy and processing efficiency.Future research should focus on adapting algorithms for diverse urban landscapes to enhance UF extraction accuracy and applicability.
文摘Landslide is one of the multitudinous serious geological hazards. The key to its control and reduction lies on dynamic monitoring and early warning. The article points out the insufficiency of traditional measuring means applied for large-scale landslide monitoring and proposes the method for extensive landslide displacement field monitoring using high- resolution remote images. Matching of cognominal points is realized by using the invariant features of SIFT algorithm in image translation, rotation, zooming, and affine transformation, and through recognition and comparison of characteristics of high-resolution images in different landsliding periods. Following that, landslide displacement vector field can be made known by measuring the distances and directions between cognominal points. As evidenced by field application of the method for landslide monitoring at West Open Mine in Fushun city of China, the method has the attraction of being able to make areal measurement through satellite observation and capable of obtaining at the same time the information of large- area intensive displacement field, for facilitating automatic delimitation of extent of landslide displacement vector field and sliding mass. This can serve as a basis for making analysis of laws governing occurrence of landslide and adoption of countermeasures.
基金supported by the National Natural Science Foundation of China (grants No. 41461164002 and 41631073)
文摘Objective Nowadays, high-resolution remote sensing technology has brought new changes to surveys of earthquakes, and the quantitative study of seismic faults based on this technology has become a trend in the world(Barzegari et al., 2017). An Mw 7.2 earthquake occurred in Yutian of Xinjiang on the western end of the Altyn Tagh fault on March 21 st, 2008. It is difficult to access this depopulated zone because of the high altitude and only 1–2 months of snowmelt. This study utilized high-resolution
文摘On the basis of realization of beach information and its differentiating of high-resolution remote sensing image on coastal zone, extracting objects are carried through RS multi-scale diagnostic analysis, and fast information extraction methods and key technologies are put forward. Meanwhile image segmentation methods are set forth for objects of coastal zone. And through the application of Otsu2D to the segmentation of water area and dock and the applying of Gabor filter to the separation and extraction of construction, some typical applications of high-resolution RS image are presented in the field of coastal zone surface objects' recognition. Quantizing high-resolution RS information on the coastal zone proved to be of great scientific and practical significance for coastal development and management.
基金National Natural Science Foundation of China(No.41871305)National Key Research and Development Program of China(No.2017YFC0602204)+2 种基金Fundamental Research Funds for the Central Universities,China University of Geosciences(Wuhan)(No.CUGQY1945)Open Fund of Key Laboratory of Geological Survey and Evaluation of Ministry of Education and the Fundamental Research Funds for the Central Universities(No.GLAB2019ZR02)Open Fund of Laboratory of Urban Land Resources Monitoring and Simulation,Ministry of Natural Resources,China(No.KF-2020-05-068)。
文摘The exploration of building detection plays an important role in urban planning,smart city and military.Aiming at the problem of high overlapping ratio of detection frames for dense building detection in high resolution remote sensing images,we present an effective YOLOv3 framework,corner regression-based YOLOv3(Correg-YOLOv3),to localize dense building accurately.This improved YOLOv3 algorithm establishes a vertex regression mechanism and an additional loss item about building vertex offsets relative to the center point of bounding box.By extending output dimensions,the trained model is able to output the rectangular bounding boxes and the building vertices meanwhile.Finally,we evaluate the performance of the Correg-YOLOv3 on our self-produced data set and provide a comparative analysis qualitatively and quantitatively.The experimental results achieve high performance in precision(96.45%),recall rate(95.75%),F1 score(96.10%)and average precision(98.05%),which were 2.73%,5.4%,4.1%and 4.73%higher than that of YOLOv3.Therefore,our proposed algorithm effectively tackles the problem of dense building detection in high resolution images.
基金National Natural Science Foundation of China(No.41271435)National Natural Science Foundation of China Youth Found(No.41301479)。
文摘It is proposed a high resolution remote sensing image segmentation method which combines static minimum spanning tree(MST)tessellation considering shape information and the RHMRF-FCM algorithm.It solves the problems in the traditional pixel-based HMRF-FCM algorithm in which poor noise resistance and low precision segmentation in a complex boundary exist.By using the MST model and shape information,the object boundary and geometrical noise can be expressed and reduced respectively.Firstly,the static MST tessellation is employed for dividing the image domain into some sub-regions corresponding to the components of homogeneous regions needed to be segmented.Secondly,based on the tessellation results,the RHMRF model is built,and regulation terms considering the KL information and the information entropy are introduced into the FCM objective function.Finally,the partial differential method and Lagrange function are employed to calculate the parameters of the fuzzy objective function for obtaining the global optimal segmentation results.To verify the robustness and effectiveness of the proposed algorithm,the experiments are carried out with WorldView-3(WV-3)high resolution image.The results from proposed method with different parameters and comparing methods(multi-resolution method and watershed segmentation method in eCognition software)are analyzed qualitatively and quantitatively.
基金Sponsoreds by the National Natural Science Foundation of China (Grant No. 60575016)
文摘While executing tasks such as ocean pollution monitoring,maritime rescue,geographic mapping,and automatic navigation utilizing remote sensing images,the coastline feature should be determined.Traditional methods are not satisfactory to extract coastline in high-resolution panchromatic remote sensing image.Active contour model,also called snakes,have proven useful for interactive specification of image contours,so it is used as an effective coastlines extraction technique.Firstly,coastlines are detected by water segmentation and boundary tracking,which are considered initial contours to be optimized through active contour model.As better energy functions are developed,the power assist of snakes becomes effective.New internal energy has been done to reduce problems caused by convergence to local minima,and new external energy can greatly enlarge the capture region around features of interest.After normalization processing,energies are iterated using greedy algorithm to accelerate convergence rate.The experimental results encompassed examples in images and demonstrated the capabilities and efficiencies of the improvement.
基金funded by the Chongqing Normal University Startup Foundation for PhD(22XLB021)supported by the Open Research Project of the State Key Laboratory of Industrial Control Technology,Zhejiang University,China(No.ICT2023B40).
文摘Cloud detection from satellite and drone imagery is crucial for applications such as weather forecasting and environmentalmonitoring.Addressing the limitations of conventional convolutional neural networks,we propose an innovative transformer-based method.This method leverages transformers,which are adept at processing data sequences,to enhance cloud detection accuracy.Additionally,we introduce a Cyclic Refinement Architecture that improves the resolution and quality of feature extraction,thereby aiding in the retention of critical details often lost during cloud detection.Our extensive experimental validation shows that our approach significantly outperforms established models,excelling in high-resolution feature extraction and precise cloud segmentation.By integrating Positional Visual Transformers(PVT)with this architecture,our method advances high-resolution feature delineation and segmentation accuracy.Ultimately,our research offers a novel perspective for surmounting traditional challenges in cloud detection and contributes to the advancement of precise and dependable image analysis across various domains.
基金This work was supported in part by the Key Project of Natural Science Research of Anhui Provincial Department of Education under Grant KJ2017A416in part by the Fund of National Sensor Network Engineering Technology Research Center(No.NSNC202103).
文摘When existing deep learning models are used for road extraction tasks from high-resolution images,they are easily affected by noise factors such as tree and building occlusion and complex backgrounds,resulting in incomplete road extraction and low accuracy.We propose the introduction of spatial and channel attention modules to the convolutional neural network ConvNeXt.Then,ConvNeXt is used as the backbone network,which cooperates with the perceptual analysis network UPerNet,retains the detection head of the semantic segmentation,and builds a new model ConvNeXt-UPerNet to suppress noise interference.Training on the open-source DeepGlobe and CHN6-CUG datasets and introducing the DiceLoss on the basis of CrossEntropyLoss solves the problem of positive and negative sample imbalance.Experimental results show that the new network model can achieve the following performance on the DeepGlobe dataset:79.40%for precision(Pre),97.93% for accuracy(Acc),69.28% for intersection over union(IoU),and 83.56% for mean intersection over union(MIoU).On the CHN6-CUG dataset,the model achieves the respective values of 78.17%for Pre,97.63%for Acc,65.4% for IoU,and 81.46% for MIoU.Compared with other network models,the fused ConvNeXt-UPerNet model can extract road information better when faced with the influence of noise contained in high-resolution remote sensing images.It also achieves multiscale image feature information with unified perception,ultimately improving the generalization ability of deep learning technology in extracting complex roads from high-resolution remote sensing images.
基金the National Natural Science Foundation of China(42001408,61806097).
文摘Significant advancements have been achieved in road surface extraction based on high-resolution remote sensingimage processing. Most current methods rely on fully supervised learning, which necessitates enormous humaneffort to label the image. Within this field, other research endeavors utilize weakly supervised methods. Theseapproaches aim to reduce the expenses associated with annotation by leveraging sparsely annotated data, such asscribbles. This paper presents a novel technique called a weakly supervised network using scribble-supervised andedge-mask (WSSE-net). This network is a three-branch network architecture, whereby each branch is equippedwith a distinct decoder module dedicated to road extraction tasks. One of the branches is dedicated to generatingedge masks using edge detection algorithms and optimizing road edge details. The other two branches supervise themodel’s training by employing scribble labels and spreading scribble information throughout the image. To addressthe historical flaw that created pseudo-labels that are not updated with network training, we use mixup to blendprediction results dynamically and continually update new pseudo-labels to steer network training. Our solutiondemonstrates efficient operation by simultaneously considering both edge-mask aid and dynamic pseudo-labelsupport. The studies are conducted on three separate road datasets, which consist primarily of high-resolutionremote-sensing satellite photos and drone images. The experimental findings suggest that our methodologyperforms better than advanced scribble-supervised approaches and specific traditional fully supervised methods.
基金This work is supported by Chinese Academy of Sciences‘Hundred Talents’project (No:KZCX0415)
文摘This paper introduces the applications of high-resolution remote sensing imagery and the necessity of geometric calibration for remote sensing sensors considering assurance of the geometric accuracy of remote sensing imagery. Then the paper analyzes the general methodology of geometric calibration. Taking the DMC sensor geometric calibration as an example, the paper discusses the whole calibration procedure. Finally, it gave some concluding remarks on geometric calibration of high-resolution remote sensing sensors.
文摘Traditional Chinese villages,vital carriers of traditional culture,have faced significant alterations due to urbanization in recent years,urgently necessitating artificial intelligence data updates.This study integrates high spatial resolution remote sensing imagery with deep learning techniques,proposing a novel method for identifying rooftops of traditional Chinese village buildings using high-definition remote sensing images.Using 0.54 m spatial resolution imagery of traditional village areas as the data source,this method analyzes the geometric and spectral image characteristics of village building rooftops.It constructs a deep learning feature sample library tailored to the target types.Employing a semantically enhanced version of the improved Mask R-CNN(Mask Region-based Convolutional Neural Network)for building recognition,the study conducts experiments on localized imagery from different regions.The results demonstrated that the modified Mask R-CNN effectively identifies traditional village building rooftops,achieving an of 0.7520 and an of 0.7400.It improves the current problem of misidentification and missed detection caused by feature heterogeneity.This method offers a viable and effective approach for industrialized data monitoring of traditional villages,contributing to their sustainable development.
基金supported by the Erdos Major“Leader Recruitment”Technological Project[JBGS-2023-001]Research Grant from the National Institute of Natural Hazards,Ministry of Emergency Management of China[ZDJ2019-17]Civil Aerospace Technology Advance Research Project of China[D040405].
文摘Tailings ponds are critical facilities in the mining industry,and accurate monitoring and management of these ponds are of paramount importance.However,conventional object detection methodologies,including recent advancements,often face significant challenges in addressing the complexities inherent to tailings pond environments.This is particularly due to deficiencies in their loss function design,which can result in protracted convergence times and suboptimal performance when detecting smaller targets.In this study,we introduce an innovative loss function termed the Rapid Intersection over Union(RIoU)loss function,which incorporates a focal weight and is integrated into the YOLOv5 object detection framework to develop the YOLOv5-RF model.This approach aims to enhance both convergence speed and improve convergence accuracy in the tailings pond identification process by comprehensively addressing the specific challenges posed by complex environmental conditions,thereby enhancing the precision and robustness of tailings pond target detection.It integrates the concepts of the central triangle and the aspect ratio of the circumscribed rectangle,assigning specific weights and penalty terms to optimize the model’s performance in object detection tasks.We validated the efficacy of YOLOv5-RF through simulation experiments and high-resolution remote sensing images of tailings ponds.The experimental results indicate that RIoU facilitates faster convergence rates.Specifically,YOLOv5-RF achieves accuracy and recall rates that are 2%and 2.1%higher than those of YOLOv5,respectively.Furthermore,it completes 120 iterations in 1.08 hours less time compared to its predecessor model while exhibiting an inference time that is 11.7 ms shorter than that for YOLOv5.These findings suggest that our model significantly enhances processing speed without compromising accuracy levels.This research offers novel technical approaches as well as theoretical support for monitoring tailings ponds using computer vision and remote sensing technologies.
文摘Remote sensing images contain a wealth of geospatial information.To accurately identify different geospatial categories and extract relevant data,image semantic segmentation plays a crucial role.In recent years,deep learning technology has brought significant breakthroughs to semantic segmentation of remote sensing images,significantly enhancing its performance.This paper investigates the application of deep learning technologies in remote sensing image semantic segmentation,based on Convolutional Neural Networks(CNN)and Transformer-based semantic segmentation methods.It conducts an in-depth comparison of their structural characteristics and applicable scenarios,summarizes the achievements and shortcomings of existing research,and provides technical references and theoretical support for future studies,thereby contributing to the further development of deep learning technology in the field of remote sensing.Research results indicate that CNN-based semantic segmentation methods still hold advantages in extracting local features and achieving efficient segmentation,whereas Transformers address CNN's limitations in global context modeling and long-range dependency capture.Therefore,the collaborative integration of CNN and Transformers will become an important research direction for enhancing model performance in the future.
基金supported by the Henan Province Key R&D Project under Grant 241111210400the Henan Provincial Science and Technology Research Project under Grants 252102211047,252102211062,252102211055 and 232102210069+2 种基金the Jiangsu Provincial Scheme Double Initiative Plan JSS-CBS20230474,the XJTLU RDF-21-02-008the Science and Technology Innovation Project of Zhengzhou University of Light Industry under Grant 23XNKJTD0205the Higher Education Teaching Reform Research and Practice Project of Henan Province under Grant 2024SJGLX0126。
文摘Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability.
基金funded by the Hainan Province Science and Technology Special Fund under Grant ZDYF2024GXJS292.
文摘Deep learning has made significant progress in the field of oriented object detection for remote sensing images.However,existing methods still face challenges when dealing with difficult tasks such as multi-scale targets,complex backgrounds,and small objects in remote sensing.Maintaining model lightweight to address resource constraints in remote sensing scenarios while improving task completion for remote sensing tasks remains a research hotspot.Therefore,we propose an enhanced multi-scale feature extraction lightweight network EM-YOLO based on the YOLOv8s architecture,specifically optimized for the characteristics of large target scale variations,diverse orientations,and numerous small objects in remote sensing images.Our innovations lie in two main aspects:First,a dynamic snake convolution(DSC)is introduced into the backbone network to enhance the model’s feature extraction capability for oriented targets.Second,an innovative focusing-diffusion module is designed in the feature fusion neck to effectively integrate multi-scale feature information.Finally,we introduce Layer-Adaptive Sparsity for magnitude-based Pruning(LASP)method to perform lightweight network pruning to better complete tasks in resource-constrained scenarios.Experimental results on the lightweight platform Orin demonstrate that the proposed method significantly outperforms the original YOLOv8s model in oriented remote sensing object detection tasks,and achieves comparable or superior performance to state-of-the-art methods on three authoritative remote sensing datasets(DOTA v1.0,DOTA v1.5,and HRSC2016).
基金supported by the National Public Welfare Forest Desert Shrubbery Monitoring Project。
文摘Desert shrubs are indispensable in maintaining ecological stability by reducing soil erosion,enhancing water retention,and boosting soil fertility,which are critical factors in mitigating desertification processes.Due to the complex topography,variable climate,and challenges in field surveys in desert regions,this paper proposes YOLO-Desert-Shrub(YOLO-DS),a detection method for identifying desert shrubs in UAV remote sensing images based on an enhanced YOLOv8n framework.This method accurately identifying shrub species,locations,and coverage.To address the issue of small individual plants dominating the dataset,the SPDconv convolution module is introduced in the Backbone and Neck layers of the YOLOv8n model,replacing conventional convolutions.This structural optimization mitigates information degradation in fine-grained data while strengthening discriminative feature capture across spatial scales within desert shrub datasets.Furthermore,a structured state-space model is integrated into the main network,and the MambaLayer is designed to dynamically extract and refine shrub-specific features from remote sensing images,effectively filtering out background noise and irrelevant interference to enhance feature representation.Benchmark evaluations reveal the YOLO-DS framework attains 79.56%mAP40weight,demonstrating 2.2%absolute gain versus the baseline YOLOv8n architecture,with statistically significant advantages over contemporary detectors in cross-validation trials.The predicted plant coverage exhibits strong consistency with manually measured coverage,with a coefficient of determination(R^(2))of 0.9148 and a Root Mean Square Error(RMSE)of1.8266%.The proposed UAV-based remote sensing method utilizing the YOLO-DS effectively identify and locate desert shrubs,monitor canopy sizes and distribution,and provide technical support for automated desert shrub monitoring.
基金funded by the Henan Province Key R&D Program Project,“Research and Application Demonstration of Class Ⅱ Superlattice Medium Wave High Temperature Infrared Detector Technology”,grant number 231111210400.
文摘High-resolution remote sensing imagery is essential for critical applications such as precision agriculture,urban management planning,and military reconnaissance.Although significant progress has been made in singleimage super-resolution(SISR)using generative adversarial networks(GANs),existing approaches still face challenges in recovering high-frequency details,effectively utilizing features,maintaining structural integrity,and ensuring training stability—particularly when dealing with the complex textures characteristic of remote sensing imagery.To address these limitations,this paper proposes the Improved ResidualModule and AttentionMechanism Network(IRMANet),a novel architecture specifically designed for remote sensing image reconstruction.IRMANet builds upon the Super-Resolution Generative Adversarial Network(SRGAN)framework and introduces several key innovations.First,the Enhanced Residual Unit(ERU)enhances feature reuse and stabilizes training through deep residual connections.Second,the Self-Attention Residual Block(SARB)incorporates a self-attentionmechanism into the Improved Residual Module(IRM)to effectivelymodel long-range dependencies and automatically emphasize salient features.Additionally,the IRM adopts amulti-scale feature fusion strategy to facilitate synergistic interactions between local detail and global semantic information.The effectiveness of each component is validated through ablation studies,while comprehensive comparative experiments on standard remote sensing datasets demonstrate that IRMANet significantly outperforms both the baseline and state-of-the-art methods in terms of perceptual quality and quantitative metrics.Specifically,compared to the baseline model,at a magnification factor of 2,IRMANet achieves an improvement of 0.24 dB in peak signal-to-noise ratio(PSNR)and 0.54 in structural similarity index(SSIM);at a magnification factor of 4,it achieves gains of 0.22 dB in PSNR and 0.51 in SSIM.These results confirm that the proposedmethod effectively enhances detail representation and structural reconstruction accuracy in complex remote sensing scenarios,offering robust technical support for high-precision detection and identification of both military and civilian aircraft.
基金Under the auspices of the Strategic Priority Research Program of the Chinese Academy of Sciences(No.XDA28050400)Jilin Province Key Research and Development Project(No.20230202040NC)Common Application Support Platform for National Civil Space Infrastructure Land Observation Satellites(No.2017-000052-73-01-001735)。
文摘Agricultural greenhouses(AGHs)are increasingly used globally to control the crop growth environment,which are vital for food production,resource conservation,and rural economies.Advances in high-quality data acquisition methods and information retrieval algorithms have improved the ability to extract AGHs from remote sensing images(e.g.,satellite and uncrewed aerial vehicle(UAV)).Research on this topic began in 1989,and the number of related studies has increased annually.This paper provides a review of the development of remote sensing of AGHs and research hotspots.It summarizes the current status and trends of data sources,identification features,methods,and accuracy of AGHs extraction.Due to the unique spectral,textural,and geometric characteristics of AGHs,research studies have primarily utilized optical remote sensing data from sensors with spatial resolutions of 30 m or more,such as Landsat,Sentinel,Gaofen(GF),and Worldview,to extract AGHs.Machine learning and deep learning methods have provided more precise results for extracting AGHs than threshold segmentation methods.In contrast,deep learning algorithms have been primarily used with high-spatial resolution data and small-scale study areas,with accuracy rates generally exceeding 90.00%.However,future research may use higher spatial resolution images to improve the accuracy and detail of AGH extraction.Recent studies have integrated multiple data sources and performed time-series analysis to improve monitoring of dynamic changes in AGHs.Moreover,emphasis should be placed on optimizing data fusion techniques,implementing sample transfer methods,expanding the number of sensors,and increasing the application of artificial intelligence(AI)in monitoring AGHs.These efforts will provide more reliable methods and tools to improve agricultural production and resource utilization efficiency.This review provides resources for researchers and decision-makers involved in modern agricultural development,as well as scientific evidence for the sustainable development of rural areas.