High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes an...High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes and wealth of spatial details pose challenges for semantic segmentation.While convolutional neural networks(CNNs)excel at capturing local features,they are limited in modeling long-range dependencies.Conversely,transformers utilize multihead self-attention to integrate global context effectively,but this approach often incurs a high computational cost.This paper proposes a global-local multiscale context network(GLMCNet)to extract both global and local multiscale contextual information from HRSIs.A detail-enhanced filtering module(DEFM)is proposed at the end of the encoder to refine the encoder outputs further,thereby enhancing the key details extracted by the encoder and effectively suppressing redundant information.In addition,a global-local multiscale transformer block(GLMTB)is proposed in the decoding stage to enable the modeling of rich multiscale global and local information.We also design a stair fusion mechanism to transmit deep semantic information from deep to shallow layers progressively.Finally,we propose the semantic awareness enhancement module(SAEM),which further enhances the representation of multiscale semantic features through spatial attention and covariance channel attention.Extensive ablation analyses and comparative experiments were conducted to evaluate the performance of the proposed method.Specifically,our method achieved a mean Intersection over Union(mIoU)of 86.89%on the ISPRS Potsdam dataset and 84.34%on the ISPRS Vaihingen dataset,outperforming existing models such as ABCNet and BANet.展开更多
While algorithms have been created for land usage in urban settings,there have been few investigations into the extraction of urban footprint(UF).To address this research gap,the study employs several widely used imag...While algorithms have been created for land usage in urban settings,there have been few investigations into the extraction of urban footprint(UF).To address this research gap,the study employs several widely used image classification method classified into three categories to evaluate their segmentation capabilities for extracting UF across eight cities.The results indicate that pixel-based methods only excel in clear urban environments,and their overall accuracy is not consistently high.RF and SVM perform well but lack stability in object-based UF extraction,influenced by feature selection and classifier performance.Deep learning enhances feature extraction but requires powerful computing and faces challenges with complex urban layouts.SAM excels in medium-sized urban areas but falters in intricate layouts.Integrating traditional and deep learning methods optimizes UF extraction,balancing accuracy and processing efficiency.Future research should focus on adapting algorithms for diverse urban landscapes to enhance UF extraction accuracy and applicability.展开更多
Landslide is one of the multitudinous serious geological hazards. The key to its control and reduction lies on dynamic monitoring and early warning. The article points out the insufficiency of traditional measuring me...Landslide is one of the multitudinous serious geological hazards. The key to its control and reduction lies on dynamic monitoring and early warning. The article points out the insufficiency of traditional measuring means applied for large-scale landslide monitoring and proposes the method for extensive landslide displacement field monitoring using high- resolution remote images. Matching of cognominal points is realized by using the invariant features of SIFT algorithm in image translation, rotation, zooming, and affine transformation, and through recognition and comparison of characteristics of high-resolution images in different landsliding periods. Following that, landslide displacement vector field can be made known by measuring the distances and directions between cognominal points. As evidenced by field application of the method for landslide monitoring at West Open Mine in Fushun city of China, the method has the attraction of being able to make areal measurement through satellite observation and capable of obtaining at the same time the information of large- area intensive displacement field, for facilitating automatic delimitation of extent of landslide displacement vector field and sliding mass. This can serve as a basis for making analysis of laws governing occurrence of landslide and adoption of countermeasures.展开更多
Objective Nowadays, high-resolution remote sensing technology has brought new changes to surveys of earthquakes, and the quantitative study of seismic faults based on this technology has become a trend in the world(Ba...Objective Nowadays, high-resolution remote sensing technology has brought new changes to surveys of earthquakes, and the quantitative study of seismic faults based on this technology has become a trend in the world(Barzegari et al., 2017). An Mw 7.2 earthquake occurred in Yutian of Xinjiang on the western end of the Altyn Tagh fault on March 21 st, 2008. It is difficult to access this depopulated zone because of the high altitude and only 1–2 months of snowmelt. This study utilized high-resolution展开更多
On the basis of realization of beach information and its differentiating of high-resolution remote sensing image on coastal zone, extracting objects are carried through RS multi-scale diagnostic analysis, and fast inf...On the basis of realization of beach information and its differentiating of high-resolution remote sensing image on coastal zone, extracting objects are carried through RS multi-scale diagnostic analysis, and fast information extraction methods and key technologies are put forward. Meanwhile image segmentation methods are set forth for objects of coastal zone. And through the application of Otsu2D to the segmentation of water area and dock and the applying of Gabor filter to the separation and extraction of construction, some typical applications of high-resolution RS image are presented in the field of coastal zone surface objects' recognition. Quantizing high-resolution RS information on the coastal zone proved to be of great scientific and practical significance for coastal development and management.展开更多
The exploration of building detection plays an important role in urban planning,smart city and military.Aiming at the problem of high overlapping ratio of detection frames for dense building detection in high resoluti...The exploration of building detection plays an important role in urban planning,smart city and military.Aiming at the problem of high overlapping ratio of detection frames for dense building detection in high resolution remote sensing images,we present an effective YOLOv3 framework,corner regression-based YOLOv3(Correg-YOLOv3),to localize dense building accurately.This improved YOLOv3 algorithm establishes a vertex regression mechanism and an additional loss item about building vertex offsets relative to the center point of bounding box.By extending output dimensions,the trained model is able to output the rectangular bounding boxes and the building vertices meanwhile.Finally,we evaluate the performance of the Correg-YOLOv3 on our self-produced data set and provide a comparative analysis qualitatively and quantitatively.The experimental results achieve high performance in precision(96.45%),recall rate(95.75%),F1 score(96.10%)and average precision(98.05%),which were 2.73%,5.4%,4.1%and 4.73%higher than that of YOLOv3.Therefore,our proposed algorithm effectively tackles the problem of dense building detection in high resolution images.展开更多
It is proposed a high resolution remote sensing image segmentation method which combines static minimum spanning tree(MST)tessellation considering shape information and the RHMRF-FCM algorithm.It solves the problems i...It is proposed a high resolution remote sensing image segmentation method which combines static minimum spanning tree(MST)tessellation considering shape information and the RHMRF-FCM algorithm.It solves the problems in the traditional pixel-based HMRF-FCM algorithm in which poor noise resistance and low precision segmentation in a complex boundary exist.By using the MST model and shape information,the object boundary and geometrical noise can be expressed and reduced respectively.Firstly,the static MST tessellation is employed for dividing the image domain into some sub-regions corresponding to the components of homogeneous regions needed to be segmented.Secondly,based on the tessellation results,the RHMRF model is built,and regulation terms considering the KL information and the information entropy are introduced into the FCM objective function.Finally,the partial differential method and Lagrange function are employed to calculate the parameters of the fuzzy objective function for obtaining the global optimal segmentation results.To verify the robustness and effectiveness of the proposed algorithm,the experiments are carried out with WorldView-3(WV-3)high resolution image.The results from proposed method with different parameters and comparing methods(multi-resolution method and watershed segmentation method in eCognition software)are analyzed qualitatively and quantitatively.展开更多
While executing tasks such as ocean pollution monitoring,maritime rescue,geographic mapping,and automatic navigation utilizing remote sensing images,the coastline feature should be determined.Traditional methods are n...While executing tasks such as ocean pollution monitoring,maritime rescue,geographic mapping,and automatic navigation utilizing remote sensing images,the coastline feature should be determined.Traditional methods are not satisfactory to extract coastline in high-resolution panchromatic remote sensing image.Active contour model,also called snakes,have proven useful for interactive specification of image contours,so it is used as an effective coastlines extraction technique.Firstly,coastlines are detected by water segmentation and boundary tracking,which are considered initial contours to be optimized through active contour model.As better energy functions are developed,the power assist of snakes becomes effective.New internal energy has been done to reduce problems caused by convergence to local minima,and new external energy can greatly enlarge the capture region around features of interest.After normalization processing,energies are iterated using greedy algorithm to accelerate convergence rate.The experimental results encompassed examples in images and demonstrated the capabilities and efficiencies of the improvement.展开更多
Cloud detection from satellite and drone imagery is crucial for applications such as weather forecasting and environmentalmonitoring.Addressing the limitations of conventional convolutional neural networks,we propose ...Cloud detection from satellite and drone imagery is crucial for applications such as weather forecasting and environmentalmonitoring.Addressing the limitations of conventional convolutional neural networks,we propose an innovative transformer-based method.This method leverages transformers,which are adept at processing data sequences,to enhance cloud detection accuracy.Additionally,we introduce a Cyclic Refinement Architecture that improves the resolution and quality of feature extraction,thereby aiding in the retention of critical details often lost during cloud detection.Our extensive experimental validation shows that our approach significantly outperforms established models,excelling in high-resolution feature extraction and precise cloud segmentation.By integrating Positional Visual Transformers(PVT)with this architecture,our method advances high-resolution feature delineation and segmentation accuracy.Ultimately,our research offers a novel perspective for surmounting traditional challenges in cloud detection and contributes to the advancement of precise and dependable image analysis across various domains.展开更多
When existing deep learning models are used for road extraction tasks from high-resolution images,they are easily affected by noise factors such as tree and building occlusion and complex backgrounds,resulting in inco...When existing deep learning models are used for road extraction tasks from high-resolution images,they are easily affected by noise factors such as tree and building occlusion and complex backgrounds,resulting in incomplete road extraction and low accuracy.We propose the introduction of spatial and channel attention modules to the convolutional neural network ConvNeXt.Then,ConvNeXt is used as the backbone network,which cooperates with the perceptual analysis network UPerNet,retains the detection head of the semantic segmentation,and builds a new model ConvNeXt-UPerNet to suppress noise interference.Training on the open-source DeepGlobe and CHN6-CUG datasets and introducing the DiceLoss on the basis of CrossEntropyLoss solves the problem of positive and negative sample imbalance.Experimental results show that the new network model can achieve the following performance on the DeepGlobe dataset:79.40%for precision(Pre),97.93% for accuracy(Acc),69.28% for intersection over union(IoU),and 83.56% for mean intersection over union(MIoU).On the CHN6-CUG dataset,the model achieves the respective values of 78.17%for Pre,97.63%for Acc,65.4% for IoU,and 81.46% for MIoU.Compared with other network models,the fused ConvNeXt-UPerNet model can extract road information better when faced with the influence of noise contained in high-resolution remote sensing images.It also achieves multiscale image feature information with unified perception,ultimately improving the generalization ability of deep learning technology in extracting complex roads from high-resolution remote sensing images.展开更多
Significant advancements have been achieved in road surface extraction based on high-resolution remote sensingimage processing. Most current methods rely on fully supervised learning, which necessitates enormous human...Significant advancements have been achieved in road surface extraction based on high-resolution remote sensingimage processing. Most current methods rely on fully supervised learning, which necessitates enormous humaneffort to label the image. Within this field, other research endeavors utilize weakly supervised methods. Theseapproaches aim to reduce the expenses associated with annotation by leveraging sparsely annotated data, such asscribbles. This paper presents a novel technique called a weakly supervised network using scribble-supervised andedge-mask (WSSE-net). This network is a three-branch network architecture, whereby each branch is equippedwith a distinct decoder module dedicated to road extraction tasks. One of the branches is dedicated to generatingedge masks using edge detection algorithms and optimizing road edge details. The other two branches supervise themodel’s training by employing scribble labels and spreading scribble information throughout the image. To addressthe historical flaw that created pseudo-labels that are not updated with network training, we use mixup to blendprediction results dynamically and continually update new pseudo-labels to steer network training. Our solutiondemonstrates efficient operation by simultaneously considering both edge-mask aid and dynamic pseudo-labelsupport. The studies are conducted on three separate road datasets, which consist primarily of high-resolutionremote-sensing satellite photos and drone images. The experimental findings suggest that our methodologyperforms better than advanced scribble-supervised approaches and specific traditional fully supervised methods.展开更多
This paper introduces the applications of high-resolution remote sensing imagery and the necessity of geometric calibration for remote sensing sensors considering assurance of the geometric accuracy of remote sensing ...This paper introduces the applications of high-resolution remote sensing imagery and the necessity of geometric calibration for remote sensing sensors considering assurance of the geometric accuracy of remote sensing imagery. Then the paper analyzes the general methodology of geometric calibration. Taking the DMC sensor geometric calibration as an example, the paper discusses the whole calibration procedure. Finally, it gave some concluding remarks on geometric calibration of high-resolution remote sensing sensors.展开更多
Traditional Chinese villages,vital carriers of traditional culture,have faced significant alterations due to urbanization in recent years,urgently necessitating artificial intelligence data updates.This study integrat...Traditional Chinese villages,vital carriers of traditional culture,have faced significant alterations due to urbanization in recent years,urgently necessitating artificial intelligence data updates.This study integrates high spatial resolution remote sensing imagery with deep learning techniques,proposing a novel method for identifying rooftops of traditional Chinese village buildings using high-definition remote sensing images.Using 0.54 m spatial resolution imagery of traditional village areas as the data source,this method analyzes the geometric and spectral image characteristics of village building rooftops.It constructs a deep learning feature sample library tailored to the target types.Employing a semantically enhanced version of the improved Mask R-CNN(Mask Region-based Convolutional Neural Network)for building recognition,the study conducts experiments on localized imagery from different regions.The results demonstrated that the modified Mask R-CNN effectively identifies traditional village building rooftops,achieving an of 0.7520 and an of 0.7400.It improves the current problem of misidentification and missed detection caused by feature heterogeneity.This method offers a viable and effective approach for industrialized data monitoring of traditional villages,contributing to their sustainable development.展开更多
Remote sensing data is a cheap form of surficial geoscientific data,and in terms of veracity,velocity and volume,can sometimes be considered big data.Its spatial and spectral resolution continues to improve over time,...Remote sensing data is a cheap form of surficial geoscientific data,and in terms of veracity,velocity and volume,can sometimes be considered big data.Its spatial and spectral resolution continues to improve over time,and some modern satellites,such as the Copernicus Programme’s Sentinel-2 remote sensing satellites,offer a spatial resolution of 10 m across many of their spectral bands.The abundance and quality of remote sensing data combined with accumulated primary geochemical data has provided an unprecedented opportunity to inferentially invert remote sensing data into geochemical data.The ability to derive geochemical data from remote sensing data would provide a form of secondary big geochemical data,which can be used for numerous downstream activities,particularly where data timeliness,volume and velocity are important.Major benefactors of secondary geochemical data would be environmental monitoring and applications of artificial intelligence and machine learning in geochemistry,which currently entirely relies on manually derived data that is primarily guided by scientific reduction.Furthermore,it permits the usage of well-established data analysis techniques from geochemistry to remote sensing that allows useable insights to be extracted beyond those typically associated with strictly remote sensing data analysis.Currently,no generally applicable and systematic method to derive chemical elemental concentrations from large-scale remote sensing data have been documented in geosciences.In this paper,we demonstrate that fusing geostatistically-augmented geochemical and remote sensing data produces an abundance of data that enables a more generalized machine learning-based geochemical data generation.We use gold grade data from a South African tailing storage facility(TSF)and data from both the Landsat-8 and Sentinel remote sensing satellites.We show that various machine learning algorithms can be used given the abundance of training data.Consequently,we are able to produce a high resolution(10 m grid size)gold concentration map of the TSF,which demonstrates the potential of our method to be used to guide extraction planning,online resource exploration,environmental monitoring and resource estimation.展开更多
Tailings ponds are critical facilities in the mining industry,and accurate monitoring and management of these ponds are of paramount importance.However,conventional object detection methodologies,including recent adva...Tailings ponds are critical facilities in the mining industry,and accurate monitoring and management of these ponds are of paramount importance.However,conventional object detection methodologies,including recent advancements,often face significant challenges in addressing the complexities inherent to tailings pond environments.This is particularly due to deficiencies in their loss function design,which can result in protracted convergence times and suboptimal performance when detecting smaller targets.In this study,we introduce an innovative loss function termed the Rapid Intersection over Union(RIoU)loss function,which incorporates a focal weight and is integrated into the YOLOv5 object detection framework to develop the YOLOv5-RF model.This approach aims to enhance both convergence speed and improve convergence accuracy in the tailings pond identification process by comprehensively addressing the specific challenges posed by complex environmental conditions,thereby enhancing the precision and robustness of tailings pond target detection.It integrates the concepts of the central triangle and the aspect ratio of the circumscribed rectangle,assigning specific weights and penalty terms to optimize the model’s performance in object detection tasks.We validated the efficacy of YOLOv5-RF through simulation experiments and high-resolution remote sensing images of tailings ponds.The experimental results indicate that RIoU facilitates faster convergence rates.Specifically,YOLOv5-RF achieves accuracy and recall rates that are 2%and 2.1%higher than those of YOLOv5,respectively.Furthermore,it completes 120 iterations in 1.08 hours less time compared to its predecessor model while exhibiting an inference time that is 11.7 ms shorter than that for YOLOv5.These findings suggest that our model significantly enhances processing speed without compromising accuracy levels.This research offers novel technical approaches as well as theoretical support for monitoring tailings ponds using computer vision and remote sensing technologies.展开更多
Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectra...Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability.展开更多
Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods ex...Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods exhibit deficiencies in detail recovery and noise suppression,particularly when processing complex landscapes(e.g.,forests,farmlands),leading to artifacts and spectral distortions that limit practical utility.To address this,we propose an enhanced Super-Resolution Generative Adversarial Network(SRGAN)framework featuring three key innovations:(1)Replacement of L1/L2 loss with a robust Charbonnier loss to suppress noise while preserving edge details via adaptive gradient balancing;(2)A multi-loss joint optimization strategy dynamically weighting Charbonnier loss(β=0.5),Visual Geometry Group(VGG)perceptual loss(α=1),and adversarial loss(γ=0.1)to synergize pixel-level accuracy and perceptual quality;(3)A multi-scale residual network(MSRN)capturing cross-scale texture features(e.g.,forest canopies,mountain contours).Validated on Sentinel-2(10 m)and SPOT-6/7(2.5 m)datasets covering 904 km2 in Motuo County,Xizang,our method outperforms the SRGAN baseline(SR4RS)with Peak Signal-to-Noise Ratio(PSNR)gains of 0.29 dB and Structural Similarity Index(SSIM)improvements of 3.08%on forest imagery.Visual comparisons confirm enhanced texture continuity despite marginal Learned Perceptual Image Patch Similarity(LPIPS)increases.The method significantly improves noise robustness and edge retention in complex geomorphology,demonstrating 18%faster response in forest fire early warning and providing high-resolution support for agricultural/urban monitoring.Future work will integrate spectral constraints and lightweight architectures.展开更多
Background China is progressing towards the goal of schistosomiasis elimination,but there are still some problems,such as difficult management of infection source and snail control.This study aimed to develop deep lea...Background China is progressing towards the goal of schistosomiasis elimination,but there are still some problems,such as difficult management of infection source and snail control.This study aimed to develop deep learning models with high-resolution remote sensing images for recognizing and monitoring livestock bovine,which is an intermediate source of Schistosoma japonicum infection,and to evaluate the effectiveness of the models for real-world application.Methods The dataset of livestock bovine’s spatial distribution was collected from the Chinese National Platform for Common Geospatial Information Services.The high-resolution remote sensing images were further divided into training data,test data,and validation data for model development.Two recognition models based on deep learning methods(ENVINet5 and Mask R-CNN)were developed with reference to the training datasets.The performance of the developed models was evaluated by the performance metrics of precision,recall,and F1-score.Results A total of 50 typical image areas were selected,1125 bovine objectives were labeled by the ENVINet5 model and 1277 bovine objectives were labeled by the Mask R-CNN model.For the ENVINet5 model,a total of 1598 records of bovine distribution were recognized.The model precision and recall were 81.9%and 80.2%,respectively.The F1 score was 0.81.For the Mask R-CNN mode,1679 records of bovine objectives were identified.The model precision and recall were 87.3%and 85.2%,respectively.The F1 score was 0.87.When applying the developed models to real-world schistosomiasis-endemic regions,there were 63 bovine objectives in the original image,53 records were extracted using the ENVINet5 model,and 57 records were extracted using the Mask R-CNN model.The successful recognition ratios were 84.1%and 90.5%for the respectively developed models.Conclusion The ENVINet5 model is very feasible when the bovine distribution is low in structure with few samples.The Mask R-CNN model has a good framework design and runs highly efficiently.The livestock recognition models developed using deep learning methods with high-resolution remote sensing images accurately recognize the spatial distribution of livestock,which could enable precise control of schistosomiasis.展开更多
Forest is the largest carbon reservoir and carbon absorber on earth.Thus,mapping forest cover change accurately is of great significance to achieving the global carbon neutrality goal.Accurate forest change informatio...Forest is the largest carbon reservoir and carbon absorber on earth.Thus,mapping forest cover change accurately is of great significance to achieving the global carbon neutrality goal.Accurate forest change information could be acquired by deep learning methods using high-resolution remote sensing images.However,deforestation detection based on deep learning on a large-scale region with high-resolution images required huge computational resources.Therefore,there was an urgent need for a fast and accurate deforestation detection model.In this study,we proposed an interesting but effective re-parameterization deforestation detection model,named RepDDNet.Unlike other existing models designed for deforestation detection,the main feature of RepDDNet was its decoupling feature,which means that it allowed the multi-branch structure in the training stages to be converted into a plain structure in the inference stage,thus the computation efficiency can be significantly improved in the inference stage while maintaining the accuracy unchanged.A large-scale experiment was carried out in Ankang city with 2-meter high-resolution remote sensing images(the total area of it was over 20,000 square kilometers),and the result indicated that the model computation efficiency could be improved by nearly 30%compared with the model without re-parameterization.Additionally,compared with other lightweight models,RepDDNet also displayed a trade-off between accuracy and computation efficiency.展开更多
Rapid building damage assessment following an earthquake is important for humanitarian relief and disaster emergency responses.In February 2023,two magnitude-7.8 earthquakes struck Turkey in quick succession,impacting...Rapid building damage assessment following an earthquake is important for humanitarian relief and disaster emergency responses.In February 2023,two magnitude-7.8 earthquakes struck Turkey in quick succession,impacting over 30 major cities across nearly 300 km.A quick and comprehensive understanding of the distribution of building damage is essential for e fficiently deploying rescue forces during critical rescue periods.This article presents the training of a two-stage convolutional neural network called BDANet that integrated image features captured before and after the disaster to evaluate the extent of building damage in Islahiye.Based on high-resolution remote sensing data from WorldView2,BDANet used predisaster imagery to extract building outlines;the image features before and after the disaster were then combined to conduct building damage assessment.We optimized these results to improve the accuracy of building edges and analyzed the damage to each building,and used population distribution information to estimate the population count and urgency of rescue at different disaster levels.The results indicate that the building area in the Islahiye region was 156.92 ha,with an affected area of 26.60 ha.Severely damaged buildings accounted for 15.67%of the total building area in the affected areas.WorldPop population distribution data indicated approximately 253,297,and 1,246 people in the collapsed,severely damaged,and lightly damaged areas,respectively.Accuracy verification showed that the BDANet model exhibited good performance in handling high-resolution images and can be used to directly assess building damage and provide rapid information for rescue operations in future disasters using model weights.展开更多
基金provided by the Science Research Project of Hebei Education Department under grant No.BJK2024115.
文摘High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes and wealth of spatial details pose challenges for semantic segmentation.While convolutional neural networks(CNNs)excel at capturing local features,they are limited in modeling long-range dependencies.Conversely,transformers utilize multihead self-attention to integrate global context effectively,but this approach often incurs a high computational cost.This paper proposes a global-local multiscale context network(GLMCNet)to extract both global and local multiscale contextual information from HRSIs.A detail-enhanced filtering module(DEFM)is proposed at the end of the encoder to refine the encoder outputs further,thereby enhancing the key details extracted by the encoder and effectively suppressing redundant information.In addition,a global-local multiscale transformer block(GLMTB)is proposed in the decoding stage to enable the modeling of rich multiscale global and local information.We also design a stair fusion mechanism to transmit deep semantic information from deep to shallow layers progressively.Finally,we propose the semantic awareness enhancement module(SAEM),which further enhances the representation of multiscale semantic features through spatial attention and covariance channel attention.Extensive ablation analyses and comparative experiments were conducted to evaluate the performance of the proposed method.Specifically,our method achieved a mean Intersection over Union(mIoU)of 86.89%on the ISPRS Potsdam dataset and 84.34%on the ISPRS Vaihingen dataset,outperforming existing models such as ABCNet and BANet.
文摘While algorithms have been created for land usage in urban settings,there have been few investigations into the extraction of urban footprint(UF).To address this research gap,the study employs several widely used image classification method classified into three categories to evaluate their segmentation capabilities for extracting UF across eight cities.The results indicate that pixel-based methods only excel in clear urban environments,and their overall accuracy is not consistently high.RF and SVM perform well but lack stability in object-based UF extraction,influenced by feature selection and classifier performance.Deep learning enhances feature extraction but requires powerful computing and faces challenges with complex urban layouts.SAM excels in medium-sized urban areas but falters in intricate layouts.Integrating traditional and deep learning methods optimizes UF extraction,balancing accuracy and processing efficiency.Future research should focus on adapting algorithms for diverse urban landscapes to enhance UF extraction accuracy and applicability.
文摘Landslide is one of the multitudinous serious geological hazards. The key to its control and reduction lies on dynamic monitoring and early warning. The article points out the insufficiency of traditional measuring means applied for large-scale landslide monitoring and proposes the method for extensive landslide displacement field monitoring using high- resolution remote images. Matching of cognominal points is realized by using the invariant features of SIFT algorithm in image translation, rotation, zooming, and affine transformation, and through recognition and comparison of characteristics of high-resolution images in different landsliding periods. Following that, landslide displacement vector field can be made known by measuring the distances and directions between cognominal points. As evidenced by field application of the method for landslide monitoring at West Open Mine in Fushun city of China, the method has the attraction of being able to make areal measurement through satellite observation and capable of obtaining at the same time the information of large- area intensive displacement field, for facilitating automatic delimitation of extent of landslide displacement vector field and sliding mass. This can serve as a basis for making analysis of laws governing occurrence of landslide and adoption of countermeasures.
基金supported by the National Natural Science Foundation of China (grants No. 41461164002 and 41631073)
文摘Objective Nowadays, high-resolution remote sensing technology has brought new changes to surveys of earthquakes, and the quantitative study of seismic faults based on this technology has become a trend in the world(Barzegari et al., 2017). An Mw 7.2 earthquake occurred in Yutian of Xinjiang on the western end of the Altyn Tagh fault on March 21 st, 2008. It is difficult to access this depopulated zone because of the high altitude and only 1–2 months of snowmelt. This study utilized high-resolution
文摘On the basis of realization of beach information and its differentiating of high-resolution remote sensing image on coastal zone, extracting objects are carried through RS multi-scale diagnostic analysis, and fast information extraction methods and key technologies are put forward. Meanwhile image segmentation methods are set forth for objects of coastal zone. And through the application of Otsu2D to the segmentation of water area and dock and the applying of Gabor filter to the separation and extraction of construction, some typical applications of high-resolution RS image are presented in the field of coastal zone surface objects' recognition. Quantizing high-resolution RS information on the coastal zone proved to be of great scientific and practical significance for coastal development and management.
基金National Natural Science Foundation of China(No.41871305)National Key Research and Development Program of China(No.2017YFC0602204)+2 种基金Fundamental Research Funds for the Central Universities,China University of Geosciences(Wuhan)(No.CUGQY1945)Open Fund of Key Laboratory of Geological Survey and Evaluation of Ministry of Education and the Fundamental Research Funds for the Central Universities(No.GLAB2019ZR02)Open Fund of Laboratory of Urban Land Resources Monitoring and Simulation,Ministry of Natural Resources,China(No.KF-2020-05-068)。
文摘The exploration of building detection plays an important role in urban planning,smart city and military.Aiming at the problem of high overlapping ratio of detection frames for dense building detection in high resolution remote sensing images,we present an effective YOLOv3 framework,corner regression-based YOLOv3(Correg-YOLOv3),to localize dense building accurately.This improved YOLOv3 algorithm establishes a vertex regression mechanism and an additional loss item about building vertex offsets relative to the center point of bounding box.By extending output dimensions,the trained model is able to output the rectangular bounding boxes and the building vertices meanwhile.Finally,we evaluate the performance of the Correg-YOLOv3 on our self-produced data set and provide a comparative analysis qualitatively and quantitatively.The experimental results achieve high performance in precision(96.45%),recall rate(95.75%),F1 score(96.10%)and average precision(98.05%),which were 2.73%,5.4%,4.1%and 4.73%higher than that of YOLOv3.Therefore,our proposed algorithm effectively tackles the problem of dense building detection in high resolution images.
基金National Natural Science Foundation of China(No.41271435)National Natural Science Foundation of China Youth Found(No.41301479)。
文摘It is proposed a high resolution remote sensing image segmentation method which combines static minimum spanning tree(MST)tessellation considering shape information and the RHMRF-FCM algorithm.It solves the problems in the traditional pixel-based HMRF-FCM algorithm in which poor noise resistance and low precision segmentation in a complex boundary exist.By using the MST model and shape information,the object boundary and geometrical noise can be expressed and reduced respectively.Firstly,the static MST tessellation is employed for dividing the image domain into some sub-regions corresponding to the components of homogeneous regions needed to be segmented.Secondly,based on the tessellation results,the RHMRF model is built,and regulation terms considering the KL information and the information entropy are introduced into the FCM objective function.Finally,the partial differential method and Lagrange function are employed to calculate the parameters of the fuzzy objective function for obtaining the global optimal segmentation results.To verify the robustness and effectiveness of the proposed algorithm,the experiments are carried out with WorldView-3(WV-3)high resolution image.The results from proposed method with different parameters and comparing methods(multi-resolution method and watershed segmentation method in eCognition software)are analyzed qualitatively and quantitatively.
基金Sponsoreds by the National Natural Science Foundation of China (Grant No. 60575016)
文摘While executing tasks such as ocean pollution monitoring,maritime rescue,geographic mapping,and automatic navigation utilizing remote sensing images,the coastline feature should be determined.Traditional methods are not satisfactory to extract coastline in high-resolution panchromatic remote sensing image.Active contour model,also called snakes,have proven useful for interactive specification of image contours,so it is used as an effective coastlines extraction technique.Firstly,coastlines are detected by water segmentation and boundary tracking,which are considered initial contours to be optimized through active contour model.As better energy functions are developed,the power assist of snakes becomes effective.New internal energy has been done to reduce problems caused by convergence to local minima,and new external energy can greatly enlarge the capture region around features of interest.After normalization processing,energies are iterated using greedy algorithm to accelerate convergence rate.The experimental results encompassed examples in images and demonstrated the capabilities and efficiencies of the improvement.
基金funded by the Chongqing Normal University Startup Foundation for PhD(22XLB021)supported by the Open Research Project of the State Key Laboratory of Industrial Control Technology,Zhejiang University,China(No.ICT2023B40).
文摘Cloud detection from satellite and drone imagery is crucial for applications such as weather forecasting and environmentalmonitoring.Addressing the limitations of conventional convolutional neural networks,we propose an innovative transformer-based method.This method leverages transformers,which are adept at processing data sequences,to enhance cloud detection accuracy.Additionally,we introduce a Cyclic Refinement Architecture that improves the resolution and quality of feature extraction,thereby aiding in the retention of critical details often lost during cloud detection.Our extensive experimental validation shows that our approach significantly outperforms established models,excelling in high-resolution feature extraction and precise cloud segmentation.By integrating Positional Visual Transformers(PVT)with this architecture,our method advances high-resolution feature delineation and segmentation accuracy.Ultimately,our research offers a novel perspective for surmounting traditional challenges in cloud detection and contributes to the advancement of precise and dependable image analysis across various domains.
基金This work was supported in part by the Key Project of Natural Science Research of Anhui Provincial Department of Education under Grant KJ2017A416in part by the Fund of National Sensor Network Engineering Technology Research Center(No.NSNC202103).
文摘When existing deep learning models are used for road extraction tasks from high-resolution images,they are easily affected by noise factors such as tree and building occlusion and complex backgrounds,resulting in incomplete road extraction and low accuracy.We propose the introduction of spatial and channel attention modules to the convolutional neural network ConvNeXt.Then,ConvNeXt is used as the backbone network,which cooperates with the perceptual analysis network UPerNet,retains the detection head of the semantic segmentation,and builds a new model ConvNeXt-UPerNet to suppress noise interference.Training on the open-source DeepGlobe and CHN6-CUG datasets and introducing the DiceLoss on the basis of CrossEntropyLoss solves the problem of positive and negative sample imbalance.Experimental results show that the new network model can achieve the following performance on the DeepGlobe dataset:79.40%for precision(Pre),97.93% for accuracy(Acc),69.28% for intersection over union(IoU),and 83.56% for mean intersection over union(MIoU).On the CHN6-CUG dataset,the model achieves the respective values of 78.17%for Pre,97.63%for Acc,65.4% for IoU,and 81.46% for MIoU.Compared with other network models,the fused ConvNeXt-UPerNet model can extract road information better when faced with the influence of noise contained in high-resolution remote sensing images.It also achieves multiscale image feature information with unified perception,ultimately improving the generalization ability of deep learning technology in extracting complex roads from high-resolution remote sensing images.
基金the National Natural Science Foundation of China(42001408,61806097).
文摘Significant advancements have been achieved in road surface extraction based on high-resolution remote sensingimage processing. Most current methods rely on fully supervised learning, which necessitates enormous humaneffort to label the image. Within this field, other research endeavors utilize weakly supervised methods. Theseapproaches aim to reduce the expenses associated with annotation by leveraging sparsely annotated data, such asscribbles. This paper presents a novel technique called a weakly supervised network using scribble-supervised andedge-mask (WSSE-net). This network is a three-branch network architecture, whereby each branch is equippedwith a distinct decoder module dedicated to road extraction tasks. One of the branches is dedicated to generatingedge masks using edge detection algorithms and optimizing road edge details. The other two branches supervise themodel’s training by employing scribble labels and spreading scribble information throughout the image. To addressthe historical flaw that created pseudo-labels that are not updated with network training, we use mixup to blendprediction results dynamically and continually update new pseudo-labels to steer network training. Our solutiondemonstrates efficient operation by simultaneously considering both edge-mask aid and dynamic pseudo-labelsupport. The studies are conducted on three separate road datasets, which consist primarily of high-resolutionremote-sensing satellite photos and drone images. The experimental findings suggest that our methodologyperforms better than advanced scribble-supervised approaches and specific traditional fully supervised methods.
基金This work is supported by Chinese Academy of Sciences‘Hundred Talents’project (No:KZCX0415)
文摘This paper introduces the applications of high-resolution remote sensing imagery and the necessity of geometric calibration for remote sensing sensors considering assurance of the geometric accuracy of remote sensing imagery. Then the paper analyzes the general methodology of geometric calibration. Taking the DMC sensor geometric calibration as an example, the paper discusses the whole calibration procedure. Finally, it gave some concluding remarks on geometric calibration of high-resolution remote sensing sensors.
文摘Traditional Chinese villages,vital carriers of traditional culture,have faced significant alterations due to urbanization in recent years,urgently necessitating artificial intelligence data updates.This study integrates high spatial resolution remote sensing imagery with deep learning techniques,proposing a novel method for identifying rooftops of traditional Chinese village buildings using high-definition remote sensing images.Using 0.54 m spatial resolution imagery of traditional village areas as the data source,this method analyzes the geometric and spectral image characteristics of village building rooftops.It constructs a deep learning feature sample library tailored to the target types.Employing a semantically enhanced version of the improved Mask R-CNN(Mask Region-based Convolutional Neural Network)for building recognition,the study conducts experiments on localized imagery from different regions.The results demonstrated that the modified Mask R-CNN effectively identifies traditional village building rooftops,achieving an of 0.7520 and an of 0.7400.It improves the current problem of misidentification and missed detection caused by feature heterogeneity.This method offers a viable and effective approach for industrialized data monitoring of traditional villages,contributing to their sustainable development.
基金provided by the Department of Science and Innovation(DSI)-National Research Foundation(NRF)Thuthuka Grant(Grant UID:121,973)DSI-NRF CIMERA.Yousef Ghorbani acknowledges financial support from the Centre for Advanced Mining and Metallurgy(CAMM),a strategic research environment established at the LuleåUniversity of Technology funded by the Swedish governmentWe also thank Sibanye-Stillwater Ltd.For their funding through the Wits Mining Institute(WMI).
文摘Remote sensing data is a cheap form of surficial geoscientific data,and in terms of veracity,velocity and volume,can sometimes be considered big data.Its spatial and spectral resolution continues to improve over time,and some modern satellites,such as the Copernicus Programme’s Sentinel-2 remote sensing satellites,offer a spatial resolution of 10 m across many of their spectral bands.The abundance and quality of remote sensing data combined with accumulated primary geochemical data has provided an unprecedented opportunity to inferentially invert remote sensing data into geochemical data.The ability to derive geochemical data from remote sensing data would provide a form of secondary big geochemical data,which can be used for numerous downstream activities,particularly where data timeliness,volume and velocity are important.Major benefactors of secondary geochemical data would be environmental monitoring and applications of artificial intelligence and machine learning in geochemistry,which currently entirely relies on manually derived data that is primarily guided by scientific reduction.Furthermore,it permits the usage of well-established data analysis techniques from geochemistry to remote sensing that allows useable insights to be extracted beyond those typically associated with strictly remote sensing data analysis.Currently,no generally applicable and systematic method to derive chemical elemental concentrations from large-scale remote sensing data have been documented in geosciences.In this paper,we demonstrate that fusing geostatistically-augmented geochemical and remote sensing data produces an abundance of data that enables a more generalized machine learning-based geochemical data generation.We use gold grade data from a South African tailing storage facility(TSF)and data from both the Landsat-8 and Sentinel remote sensing satellites.We show that various machine learning algorithms can be used given the abundance of training data.Consequently,we are able to produce a high resolution(10 m grid size)gold concentration map of the TSF,which demonstrates the potential of our method to be used to guide extraction planning,online resource exploration,environmental monitoring and resource estimation.
基金supported by the Erdos Major“Leader Recruitment”Technological Project[JBGS-2023-001]Research Grant from the National Institute of Natural Hazards,Ministry of Emergency Management of China[ZDJ2019-17]Civil Aerospace Technology Advance Research Project of China[D040405].
文摘Tailings ponds are critical facilities in the mining industry,and accurate monitoring and management of these ponds are of paramount importance.However,conventional object detection methodologies,including recent advancements,often face significant challenges in addressing the complexities inherent to tailings pond environments.This is particularly due to deficiencies in their loss function design,which can result in protracted convergence times and suboptimal performance when detecting smaller targets.In this study,we introduce an innovative loss function termed the Rapid Intersection over Union(RIoU)loss function,which incorporates a focal weight and is integrated into the YOLOv5 object detection framework to develop the YOLOv5-RF model.This approach aims to enhance both convergence speed and improve convergence accuracy in the tailings pond identification process by comprehensively addressing the specific challenges posed by complex environmental conditions,thereby enhancing the precision and robustness of tailings pond target detection.It integrates the concepts of the central triangle and the aspect ratio of the circumscribed rectangle,assigning specific weights and penalty terms to optimize the model’s performance in object detection tasks.We validated the efficacy of YOLOv5-RF through simulation experiments and high-resolution remote sensing images of tailings ponds.The experimental results indicate that RIoU facilitates faster convergence rates.Specifically,YOLOv5-RF achieves accuracy and recall rates that are 2%and 2.1%higher than those of YOLOv5,respectively.Furthermore,it completes 120 iterations in 1.08 hours less time compared to its predecessor model while exhibiting an inference time that is 11.7 ms shorter than that for YOLOv5.These findings suggest that our model significantly enhances processing speed without compromising accuracy levels.This research offers novel technical approaches as well as theoretical support for monitoring tailings ponds using computer vision and remote sensing technologies.
基金supported by the Henan Province Key R&D Project under Grant 241111210400the Henan Provincial Science and Technology Research Project under Grants 252102211047,252102211062,252102211055 and 232102210069+2 种基金the Jiangsu Provincial Scheme Double Initiative Plan JSS-CBS20230474,the XJTLU RDF-21-02-008the Science and Technology Innovation Project of Zhengzhou University of Light Industry under Grant 23XNKJTD0205the Higher Education Teaching Reform Research and Practice Project of Henan Province under Grant 2024SJGLX0126。
文摘Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability.
基金This study was supported by:Inner Mongolia Academy of Forestry Sciences Open Research Project(Grant No.KF2024MS03)The Project to Improve the Scientific Research Capacity of the Inner Mongolia Academy of Forestry Sciences(Grant No.2024NLTS04)The Innovation and Entrepreneurship Training Program for Undergraduates of Beijing Forestry University(Grant No.X202410022268).
文摘Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods exhibit deficiencies in detail recovery and noise suppression,particularly when processing complex landscapes(e.g.,forests,farmlands),leading to artifacts and spectral distortions that limit practical utility.To address this,we propose an enhanced Super-Resolution Generative Adversarial Network(SRGAN)framework featuring three key innovations:(1)Replacement of L1/L2 loss with a robust Charbonnier loss to suppress noise while preserving edge details via adaptive gradient balancing;(2)A multi-loss joint optimization strategy dynamically weighting Charbonnier loss(β=0.5),Visual Geometry Group(VGG)perceptual loss(α=1),and adversarial loss(γ=0.1)to synergize pixel-level accuracy and perceptual quality;(3)A multi-scale residual network(MSRN)capturing cross-scale texture features(e.g.,forest canopies,mountain contours).Validated on Sentinel-2(10 m)and SPOT-6/7(2.5 m)datasets covering 904 km2 in Motuo County,Xizang,our method outperforms the SRGAN baseline(SR4RS)with Peak Signal-to-Noise Ratio(PSNR)gains of 0.29 dB and Structural Similarity Index(SSIM)improvements of 3.08%on forest imagery.Visual comparisons confirm enhanced texture continuity despite marginal Learned Perceptual Image Patch Similarity(LPIPS)increases.The method significantly improves noise robustness and edge retention in complex geomorphology,demonstrating 18%faster response in forest fire early warning and providing high-resolution support for agricultural/urban monitoring.Future work will integrate spectral constraints and lightweight architectures.
基金National Natural Science Foundation of China(No.32161143036,No.82173633,No.81960374)Science and Technology research project of Shanghai Municipal Health Commission(No.20194Y0359)National Key Research and Development Program of China(No.2021YFC2300800,2021YFC2300803)
文摘Background China is progressing towards the goal of schistosomiasis elimination,but there are still some problems,such as difficult management of infection source and snail control.This study aimed to develop deep learning models with high-resolution remote sensing images for recognizing and monitoring livestock bovine,which is an intermediate source of Schistosoma japonicum infection,and to evaluate the effectiveness of the models for real-world application.Methods The dataset of livestock bovine’s spatial distribution was collected from the Chinese National Platform for Common Geospatial Information Services.The high-resolution remote sensing images were further divided into training data,test data,and validation data for model development.Two recognition models based on deep learning methods(ENVINet5 and Mask R-CNN)were developed with reference to the training datasets.The performance of the developed models was evaluated by the performance metrics of precision,recall,and F1-score.Results A total of 50 typical image areas were selected,1125 bovine objectives were labeled by the ENVINet5 model and 1277 bovine objectives were labeled by the Mask R-CNN model.For the ENVINet5 model,a total of 1598 records of bovine distribution were recognized.The model precision and recall were 81.9%and 80.2%,respectively.The F1 score was 0.81.For the Mask R-CNN mode,1679 records of bovine objectives were identified.The model precision and recall were 87.3%and 85.2%,respectively.The F1 score was 0.87.When applying the developed models to real-world schistosomiasis-endemic regions,there were 63 bovine objectives in the original image,53 records were extracted using the ENVINet5 model,and 57 records were extracted using the Mask R-CNN model.The successful recognition ratios were 84.1%and 90.5%for the respectively developed models.Conclusion The ENVINet5 model is very feasible when the bovine distribution is low in structure with few samples.The Mask R-CNN model has a good framework design and runs highly efficiently.The livestock recognition models developed using deep learning methods with high-resolution remote sensing images accurately recognize the spatial distribution of livestock,which could enable precise control of schistosomiasis.
基金supported by the Shenzhen Science and Technology Innovation Project(No.ZDSYS20210623091808026)supported in part by the National Natural Science Foundation of China(General Program,No.42071351)+1 种基金the National Key Research and Development Program of China(No.2020YFA0608501)the Chongqing Science and Technology Bureau technology innovation and application development special(cstc2021jscx-gksb0116).
文摘Forest is the largest carbon reservoir and carbon absorber on earth.Thus,mapping forest cover change accurately is of great significance to achieving the global carbon neutrality goal.Accurate forest change information could be acquired by deep learning methods using high-resolution remote sensing images.However,deforestation detection based on deep learning on a large-scale region with high-resolution images required huge computational resources.Therefore,there was an urgent need for a fast and accurate deforestation detection model.In this study,we proposed an interesting but effective re-parameterization deforestation detection model,named RepDDNet.Unlike other existing models designed for deforestation detection,the main feature of RepDDNet was its decoupling feature,which means that it allowed the multi-branch structure in the training stages to be converted into a plain structure in the inference stage,thus the computation efficiency can be significantly improved in the inference stage while maintaining the accuracy unchanged.A large-scale experiment was carried out in Ankang city with 2-meter high-resolution remote sensing images(the total area of it was over 20,000 square kilometers),and the result indicated that the model computation efficiency could be improved by nearly 30%compared with the model without re-parameterization.Additionally,compared with other lightweight models,RepDDNet also displayed a trade-off between accuracy and computation efficiency.
基金supported by the Third Xinjiang Scientific Expedition Program(Grant 2022xjkk0600)。
文摘Rapid building damage assessment following an earthquake is important for humanitarian relief and disaster emergency responses.In February 2023,two magnitude-7.8 earthquakes struck Turkey in quick succession,impacting over 30 major cities across nearly 300 km.A quick and comprehensive understanding of the distribution of building damage is essential for e fficiently deploying rescue forces during critical rescue periods.This article presents the training of a two-stage convolutional neural network called BDANet that integrated image features captured before and after the disaster to evaluate the extent of building damage in Islahiye.Based on high-resolution remote sensing data from WorldView2,BDANet used predisaster imagery to extract building outlines;the image features before and after the disaster were then combined to conduct building damage assessment.We optimized these results to improve the accuracy of building edges and analyzed the damage to each building,and used population distribution information to estimate the population count and urgency of rescue at different disaster levels.The results indicate that the building area in the Islahiye region was 156.92 ha,with an affected area of 26.60 ha.Severely damaged buildings accounted for 15.67%of the total building area in the affected areas.WorldPop population distribution data indicated approximately 253,297,and 1,246 people in the collapsed,severely damaged,and lightly damaged areas,respectively.Accuracy verification showed that the BDANet model exhibited good performance in handling high-resolution images and can be used to directly assess building damage and provide rapid information for rescue operations in future disasters using model weights.