Measuring the transverse velocity field in high-resolution solar images is essential for understanding solar dynamics.This paper introduces an innovative unsupervised deep learning optical flow model designed to calcu...Measuring the transverse velocity field in high-resolution solar images is essential for understanding solar dynamics.This paper introduces an innovative unsupervised deep learning optical flow model designed to calculate the transverse velocity field,addressing the challenges of missing optical flow labels and the limited accuracy of velocity field measurements in high-resolution solar images.The proposed method converts the transverse velocity field computation problem into an optical flow computation problem,using two forward propagations of features to get rid of the reliance on optical flow labels.Additionally,it reduces the impact of the“Brightness Consistency”constraint on optical flow accuracy by identifying and handling optical flow outliers.We apply this method to compute the transverse velocity fields of high-resolution solar image sequences from the Hαand TiO bands,observed by the New Vacuum Solar Telescope.Comparative experiments with several wellestablished optical flow methods,including those based on supervised deep learning models,show that our approach outperforms the comparison methods according to key evaluation metrics such as Residual Map Mean,Residual Map Variance,Cross Correlation,and Structural Similarity Index Measure.Moreover,since optical flow captures the fundamental motion information in image sequences,the proposed method can be applied to a variety of research areas,including solar image registration,sequence alignment,image super-resolution,magnetic field calibration,and solar activity forecasting.The code is available at https://github.com/jackie-willianm/Transverse-Velocity-Field-Measurement-of-Solar-High-Resolution-Images.展开更多
Measurement of vegetation coverage on a small scale is the foundation for the monitoring of changes in vegetation coverage and of the inversion model of monitoring vegetation coverage on a large scale by remote sensin...Measurement of vegetation coverage on a small scale is the foundation for the monitoring of changes in vegetation coverage and of the inversion model of monitoring vegetation coverage on a large scale by remote sensing. Using the object-oriented analytical software, Definiens Professional 5, a new method for calculating vegetation coverage based on high-resolution images (aerial photographs or near-surface photography) is proposed. Our research supplies references to remote sensing measurements of vegetation coverage on a small scale and accurate fundamental data for the inversion model of vegetation coverage on a large and intermediate scale to improve the accuracy of remote sensing monitoring of changes in vegetation coverage.展开更多
Nowadays, remote sensing imagery, especially with its high spatialresolution, has become an indispensable tool to provide timely up-gradation of urban land use andland cover information, which is a prerequisite for pr...Nowadays, remote sensing imagery, especially with its high spatialresolution, has become an indispensable tool to provide timely up-gradation of urban land use andland cover information, which is a prerequisite for proper urban planning and management. Thepossible method described in the present paper to obtain urban land use types is based on theprinciple that land use can be derived from the land cover existing in a neighborhood. Here, movingwindow is used to represent the spatial pattern of land cover within a neighborhood and seven windowsizes (61mx61m, 68mx68m, 75mx75m, 87mx87m, 99mx99m, 110mx110m and 121mxl21m) are applied todetermining the most proper window size. Then, the unsupervised method of ISODATA is employed toclassify the layered land cover density maps obtained by the moving window. The results of accuracyevaluation show that the window size of 99mx99m is proper to infer urban land use categories and theproposed method has produced a land use map with a total accuracy of 85%.展开更多
Individual Tree Detection-and-Counting(ITDC)is among the important tasks in town areas,and numerous methods are proposed in this direction.Despite their many advantages,still,the proposed methods are inadequate to pro...Individual Tree Detection-and-Counting(ITDC)is among the important tasks in town areas,and numerous methods are proposed in this direction.Despite their many advantages,still,the proposed methods are inadequate to provide robust results because they mostly rely on the direct field investigations.This paper presents a novel approach involving high-resolution imagery and the Canopy-Height-Model(CHM)data to solve the ITDC problem.The new approach is studied in six urban scenes:farmland,woodland,park,industrial land,road and residential areas.First,it identifies tree canopy regions using a deep learning network from high-resolution imagery.It then deploys the CHM-data to detect treetops of the canopy regions using a local maximum algorithm and individual tree canopies using the region growing.Finally,it calculates and describes the number of individual trees and tree canopies.The proposed approach is experimented with the data from Shanghai,China.Our results show that the individual tree detection method had an average overall accuracy of 0.953,with a precision of 0.987 for woodland scene.Meanwhile,the R^(2) value for canopy segmentation in different urban scenes is greater than 0.780 and 0.779 for canopy area and diameter size,respectively.These results confirm that the proposed method is robust enough for urban tree planning and management.展开更多
Ephemeral gullies,which are widely developed worldwide and threaten farmlands,have aroused a growing concern.Identifying and mapping gullies are generally considered prerequisites of gully erosion assessment.However,e...Ephemeral gullies,which are widely developed worldwide and threaten farmlands,have aroused a growing concern.Identifying and mapping gullies are generally considered prerequisites of gully erosion assessment.However,ephemeral gully mapping remains a challenge.In this study,we proposed a flow-directional detection for identifying ephemeral gullies from high-resolution images and digital elevation models(DEMs).Ephemeral gullies exhibit clear linear features in high-resolution images.An edge detection operator was initially used to identify linear features from high-resolution images.Then,according to gully erosion mechanism,the flow-directional detection was designed.Edge images obtained from edge detection and flow directions obtained from DEMs were used to implement the flow-directional detection that detects ephemeral gullies along the flow direction.Results from ten study areas in the Loess Plateau of China showed that ranges of precision,recall,and Fmeasure are 6 o.66%-90.47%,65.74%-94.98%,and63.10%-91.93%,respectively.The proposed method is flexible and can be used with various images and DEMs.However,analysis of the effect of DEM resolution and accuracy showed that DEM resolution only demonstrates a minor effect on the detection results.Conversely,DEM accuracy influences the detection result and is more important than the DEM resolution.The worse the vertical accuracy of DEM,the lower the performance of the flow-directional detection will be.This work is beneficial to research related to monitoring gully erosion and assessing soil loss.展开更多
On the basis of a thorough understanding of the physical characteristics of remote sensing image, this paper employs the theories of wavelet transform and signal sampling to develop a new image fusion algorithm. The a...On the basis of a thorough understanding of the physical characteristics of remote sensing image, this paper employs the theories of wavelet transform and signal sampling to develop a new image fusion algorithm. The algorithm has been successfully applied to the image fusion of SPOT PAN and TM of Guangdong province, China. The experimental results show that a perfect image fusion can be built up by using the image analytical solution and re-construction in the image frequency domain based on the physical characteristics of the image formation. The method has demonstrated that the results of the image fusion do not change spectral characteristics of the original image.展开更多
Small-object detection has long been a challenge.High-megapixel cameras are used to solve this problem in industries.However,current detectors are inefficient for high-resolution images.In this work,we propose a new m...Small-object detection has long been a challenge.High-megapixel cameras are used to solve this problem in industries.However,current detectors are inefficient for high-resolution images.In this work,we propose a new module called Pre-Locate Net,which is a plug-and-play structure that can be combined with most popular detectors.We inspire the use of classification ideas to obtain candidate regions in images,greatly reducing the amount of calculation,and thus achieving rapid detection in high-resolution images.Pre-Locate Net mainly includes two parts,candidate region classification and behavior classification.Candidate region classification is used to obtain a candidate region,and behavior classification is used to estimate the scale of an object.Different follow-up processing is adopted according to different scales to balance the variance of the network input.Different from the popular candidate region generation method,we abandon the idea of regression of a bounding box and adopt the concept of classification,so as to realize the prediction of a candidate region in the shallow network.We build a high-resolution dataset of aircraft and landing gears covering complex scenes to verify the effectiveness of our method.Compared to state-of-the-art detectors(e.g.,Guided Anchoring,Libra-RCNN,and FASF),our method achieves the best m AP of 94.5 on 1920×1080 images at 16.7 FPS.展开更多
A large number of debris flow disasters(called Seismic debris flows) would occur after an earthquake, which can cause a great amount of damage. UAV low-altitude remote sensing technology has become a means of quickly ...A large number of debris flow disasters(called Seismic debris flows) would occur after an earthquake, which can cause a great amount of damage. UAV low-altitude remote sensing technology has become a means of quickly obtaining disaster information as it has the advantage of convenience and timeliness, but the spectral information of the image is so scarce, making it difficult to accurately detect the information of earthquake debris flow disasters. Based on the above problems, a seismic debris flow detection method based on transfer learning(TL) mechanism is proposed. On the basis of the constructed seismic debris flow disaster database, the features acquired from the training of the convolutional neural network(CNN) are transferred to the disaster information detection of the seismic debris flow. The automatic detection of earthquake debris flow disaster information is then completed, and the results of object-oriented seismic debris flow disaster information detection are compared and analyzed with the detection results supported by transfer learning.展开更多
High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes an...High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes and wealth of spatial details pose challenges for semantic segmentation.While convolutional neural networks(CNNs)excel at capturing local features,they are limited in modeling long-range dependencies.Conversely,transformers utilize multihead self-attention to integrate global context effectively,but this approach often incurs a high computational cost.This paper proposes a global-local multiscale context network(GLMCNet)to extract both global and local multiscale contextual information from HRSIs.A detail-enhanced filtering module(DEFM)is proposed at the end of the encoder to refine the encoder outputs further,thereby enhancing the key details extracted by the encoder and effectively suppressing redundant information.In addition,a global-local multiscale transformer block(GLMTB)is proposed in the decoding stage to enable the modeling of rich multiscale global and local information.We also design a stair fusion mechanism to transmit deep semantic information from deep to shallow layers progressively.Finally,we propose the semantic awareness enhancement module(SAEM),which further enhances the representation of multiscale semantic features through spatial attention and covariance channel attention.Extensive ablation analyses and comparative experiments were conducted to evaluate the performance of the proposed method.Specifically,our method achieved a mean Intersection over Union(mIoU)of 86.89%on the ISPRS Potsdam dataset and 84.34%on the ISPRS Vaihingen dataset,outperforming existing models such as ABCNet and BANet.展开更多
In complex media scattering,multiple scattering severely degrades the optical wavefront and results in blurred images,while the spectral distortion caused by the scattering effect leads to severe color distortion.Achi...In complex media scattering,multiple scattering severely degrades the optical wavefront and results in blurred images,while the spectral distortion caused by the scattering effect leads to severe color distortion.Achieving color high-resolution imaging through scattering media remains a significant challenge.Here,we propose a broadband,polarization-based method for color high-resolution imaging through scattering media.This approach enables high-resolution reconstruction by effectively separating the speckle illumination pattern from the mixed-scattering field information,leveraging polarization common-mode characteristics.Concurrently,it incorporates chromatic balance compensation to correct spectral aliasing in the scattered light field,enabling color high-resolution imaging through complex scattering media.To further optimize color distortion caused by scattering,a compensation strategy combining color constancy and white balance theory is adopted.Experimental results demonstrate that the proposed method significantly enhances both spatial resolution and color fidelity across various scattering conditions and target materials,showcasing strong adaptability and robustness.This approach provides an effective solution for achieving high-resolution color optical imaging in complex scattering environments.展开更多
While algorithms have been created for land usage in urban settings,there have been few investigations into the extraction of urban footprint(UF).To address this research gap,the study employs several widely used imag...While algorithms have been created for land usage in urban settings,there have been few investigations into the extraction of urban footprint(UF).To address this research gap,the study employs several widely used image classification method classified into three categories to evaluate their segmentation capabilities for extracting UF across eight cities.The results indicate that pixel-based methods only excel in clear urban environments,and their overall accuracy is not consistently high.RF and SVM perform well but lack stability in object-based UF extraction,influenced by feature selection and classifier performance.Deep learning enhances feature extraction but requires powerful computing and faces challenges with complex urban layouts.SAM excels in medium-sized urban areas but falters in intricate layouts.Integrating traditional and deep learning methods optimizes UF extraction,balancing accuracy and processing efficiency.Future research should focus on adapting algorithms for diverse urban landscapes to enhance UF extraction accuracy and applicability.展开更多
The commercial high-resolution imaging satellite with 1 m spatial resolution IKONOS is an important data source of information for urban planning and geographical information system (GIS) applications. In this paper, ...The commercial high-resolution imaging satellite with 1 m spatial resolution IKONOS is an important data source of information for urban planning and geographical information system (GIS) applications. In this paper, a morphological method is proposed. The proposed method combines the automatic thresholding and morphological operation techniques to extract the road centerline of the urban environment. This method intends to solve urban road centerline problems, vehicle, vegetation, building etc. Based on this morphological method, an object extractor is designed to extract road networks from highly remote sensing images. Some filters are applied in this experiment such as line reconstruction and region filling techniques to connect the disconnected road segments and remove the small redundant. Finally, the thinning algorithm is used to extract the road centerline. Experiments have been conducted on a high-resolution IKONOS and QuickBird images showing the efficiency of the proposed method.展开更多
Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of...Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of complex diseases,with some even achieving clinical translation.Changes in the overall size,shape,boundary,and other morphological features of organoids provide a noninvasive method for assessing organoid drug sensitivity.However,the precise segmentation of organoids in bright-field microscopy images is made difficult by the complexity of the organoid morphology and interference,including overlapping organoids,bubbles,dust particles,and cell fragments.This paper introduces the precision organoid segmentation technique(POST),which is a deep-learning algorithm for segmenting challenging organoids under simple bright-field imaging conditions.Unlike existing methods,POST accurately segments each organoid and eliminates various artifacts encountered during organoid culturing and imaging.Furthermore,it is sensitive to and aligns with measurements of organoid activity in drug sensitivity experiments.POST is expected to be a valuable tool for drug screening using organoids owing to its capability of automatically and rapidly eliminating interfering substances and thereby streamlining the organoid analysis and drug screening process.展开更多
Although Transformer-based image restoration methods have demonstrated impressive performance,existing Transformers still insufficiently exploit multiscale information.Previous non-Transformer-based studies have shown...Although Transformer-based image restoration methods have demonstrated impressive performance,existing Transformers still insufficiently exploit multiscale information.Previous non-Transformer-based studies have shown that incorporating multiscale features is crucial for improving restoration results.In this paper,we propose a multiscale Transformer(MST)that captures cross-scale attention among tokens,thereby effectively leveraging the multiscale patch recurrence prior of natural images.Furthermore,we introduce a channel-gate feed-forward network(CGFN)to enhance inter-channel information aggregation and reduce channel redundancy.To simultaneously utilise global,local and multiscale features,we design a multitype feature integration block(MFIB).Extensive experiments on both image super-resolution and HEVC compressed video artefact reduction demonstrate that the proposed MST achieves state-of-the-art performance.Ablation studies further verify the effectiveness of each proposed module.展开更多
Accurately counting dense objects in complex and diverse backgrounds is a significant challenge in computer vision,with applications ranging from crowd counting to various other object counting tasks.To address this,w...Accurately counting dense objects in complex and diverse backgrounds is a significant challenge in computer vision,with applications ranging from crowd counting to various other object counting tasks.To address this,we propose HUANNet(High-Resolution Unified Attention Network),a convolutional neural network designed to capture both local features and rich semantic information through a high-resolution representation learning framework,while optimizing computational distribution across parallel branches.HUANNet introduces three core modules:the High-Resolution Attention Module(HRAM),which enhances feature extraction by optimizing multiresolution feature fusion;the Unified Multi-Scale Attention Module(UMAM),which integrates spatial,channel,and convolutional kernel information through an attention mechanism applied across multiple levels of the network;and the Grid-Assisted Point Matching Module(GPMM),which stabilizes and improves point-to-point matching by leveraging grid-based mechanisms.Extensive experiments show that HUANNet achieves competitive results on the ShanghaiTech Part A/B crowd counting datasets and sets new state-of-the-art performance on dense object counting datasets such as CARPK and XRAY-IECCD,demonstrating the effectiveness and versatility of HUANNet.展开更多
The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adver...The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adversarial network(GAN)algorithm was proposed.Taking GAN as the basic framework,it combined a depthwise separable convolution module,attention mechanism,and reconstructed convolution module to realize the enhancement of underwater degraded images.Multi-scale features were captured by the depthwise separable convolution module,and the attention mechanism was utilized to enhance attention to important features.The reconstructed convolution module further extracts and fuses local and global features.Experimental results showed that the algorithm performs well in improving the color bias and blurring of underwater images,with PSNR reaching 27.835,SSIM reaching 0.883,UIQM reaching 3.205,and UCIQE reaching 0.713.The enhanced image outperforms the comparison algorithm in both subjective and objective metrics.展开更多
Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon...Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks.展开更多
In the image fusion field,fusing infrared images(IRIs)and visible images(VIs)excelled is a key area.The differences between IRIs and VIs make it challenging to fuse both types into a high-quality image.Accordingly,eff...In the image fusion field,fusing infrared images(IRIs)and visible images(VIs)excelled is a key area.The differences between IRIs and VIs make it challenging to fuse both types into a high-quality image.Accordingly,efficiently combining the advantages of both images while overcoming their shortcomings is necessary.To handle this challenge,we developed an end-to-end IRI andVI fusionmethod based on frequency decomposition and enhancement.By applying concepts from frequency domain analysis,we used the layering mechanism to better capture the salient thermal targets from the IRIs and the rich textural information from the VIs,respectively,significantly boosting the image fusion quality and effectiveness.In addition,the backbone network combined Restormer Blocks and Dense Blocks;Restormer blocks utilize global attention to extract shallow features.Meanwhile,Dense Blocks ensure the integration between shallow and deep features,thereby avoiding the loss of shallow attributes.Extensive experiments on TNO and MSRS datasets demonstrated that the suggested method achieved state-of-the-art(SOTA)performance in various metrics:Entropy(EN),Mutual Information(MI),Standard Deviation(SD),The Structural Similarity Index Measure(SSIM),Fusion quality(Qabf),MI of the pixel(FMI_(pixel)),and modified Visual Information Fidelity(VIF_(m)).展开更多
Microscopy imaging is fundamental in analyzing bacterial morphology and dynamics,offering critical insights into bacterial physiology and pathogenicity.Image segmentation techniques enable quantitative analysis of bac...Microscopy imaging is fundamental in analyzing bacterial morphology and dynamics,offering critical insights into bacterial physiology and pathogenicity.Image segmentation techniques enable quantitative analysis of bacterial structures,facilitating precise measurement of morphological variations and population behaviors at single-cell resolution.This paper reviews advancements in bacterial image segmentation,emphasizing the shift from traditional thresholding and watershed methods to deep learning-driven approaches.Convolutional neural networks(CNNs),U-Net architectures,and three-dimensional(3D)frameworks excel at segmenting dense biofilms and resolving antibiotic-induced morphological changes.These methods combine automated feature extraction with physics-informed postprocessing.Despite progress,challenges persist in computational efficiency,cross-species generalizability,and integration with multimodal experimental workflows.Future progress will depend on improving model robustness across species and imaging modalities,integrating multimodal data for phenotype-function mapping,and developing standard pipelines that link computational tools with clinical diagnostics.These innovations will expand microbial phenotyping beyond structural analysis,enabling deeper insights into bacterial physiology and ecological interactions.展开更多
基金supported by the National Natural Science Foundation of China(NSFC,Grant Nos.12063002 and 12163004).
文摘Measuring the transverse velocity field in high-resolution solar images is essential for understanding solar dynamics.This paper introduces an innovative unsupervised deep learning optical flow model designed to calculate the transverse velocity field,addressing the challenges of missing optical flow labels and the limited accuracy of velocity field measurements in high-resolution solar images.The proposed method converts the transverse velocity field computation problem into an optical flow computation problem,using two forward propagations of features to get rid of the reliance on optical flow labels.Additionally,it reduces the impact of the“Brightness Consistency”constraint on optical flow accuracy by identifying and handling optical flow outliers.We apply this method to compute the transverse velocity fields of high-resolution solar image sequences from the Hαand TiO bands,observed by the New Vacuum Solar Telescope.Comparative experiments with several wellestablished optical flow methods,including those based on supervised deep learning models,show that our approach outperforms the comparison methods according to key evaluation metrics such as Residual Map Mean,Residual Map Variance,Cross Correlation,and Structural Similarity Index Measure.Moreover,since optical flow captures the fundamental motion information in image sequences,the proposed method can be applied to a variety of research areas,including solar image registration,sequence alignment,image super-resolution,magnetic field calibration,and solar activity forecasting.The code is available at https://github.com/jackie-willianm/Transverse-Velocity-Field-Measurement-of-Solar-High-Resolution-Images.
基金funded by the National Natural Science Foundation of China(Grant No.40571029).
文摘Measurement of vegetation coverage on a small scale is the foundation for the monitoring of changes in vegetation coverage and of the inversion model of monitoring vegetation coverage on a large scale by remote sensing. Using the object-oriented analytical software, Definiens Professional 5, a new method for calculating vegetation coverage based on high-resolution images (aerial photographs or near-surface photography) is proposed. Our research supplies references to remote sensing measurements of vegetation coverage on a small scale and accurate fundamental data for the inversion model of vegetation coverage on a large and intermediate scale to improve the accuracy of remote sensing monitoring of changes in vegetation coverage.
基金Under the auspices of Jiangsu Provincial Natural ScienceFoundation(No .BK2002420 )
文摘Nowadays, remote sensing imagery, especially with its high spatialresolution, has become an indispensable tool to provide timely up-gradation of urban land use andland cover information, which is a prerequisite for proper urban planning and management. Thepossible method described in the present paper to obtain urban land use types is based on theprinciple that land use can be derived from the land cover existing in a neighborhood. Here, movingwindow is used to represent the spatial pattern of land cover within a neighborhood and seven windowsizes (61mx61m, 68mx68m, 75mx75m, 87mx87m, 99mx99m, 110mx110m and 121mxl21m) are applied todetermining the most proper window size. Then, the unsupervised method of ISODATA is employed toclassify the layered land cover density maps obtained by the moving window. The results of accuracyevaluation show that the window size of 99mx99m is proper to infer urban land use categories and theproposed method has produced a land use map with a total accuracy of 85%.
基金supported by the project funded by International Research Center of Big Data for Sustainable 740 Development Goals[Grant Number CBAS2022GSP07]Fundamental Research Funds for the Central Universities,Chongqing Natural Science Foundation[Grant Number CSTB2022NSCQMSX 2069]Ministry of Education of China[Grant Number 19JZD023].
文摘Individual Tree Detection-and-Counting(ITDC)is among the important tasks in town areas,and numerous methods are proposed in this direction.Despite their many advantages,still,the proposed methods are inadequate to provide robust results because they mostly rely on the direct field investigations.This paper presents a novel approach involving high-resolution imagery and the Canopy-Height-Model(CHM)data to solve the ITDC problem.The new approach is studied in six urban scenes:farmland,woodland,park,industrial land,road and residential areas.First,it identifies tree canopy regions using a deep learning network from high-resolution imagery.It then deploys the CHM-data to detect treetops of the canopy regions using a local maximum algorithm and individual tree canopies using the region growing.Finally,it calculates and describes the number of individual trees and tree canopies.The proposed approach is experimented with the data from Shanghai,China.Our results show that the individual tree detection method had an average overall accuracy of 0.953,with a precision of 0.987 for woodland scene.Meanwhile,the R^(2) value for canopy segmentation in different urban scenes is greater than 0.780 and 0.779 for canopy area and diameter size,respectively.These results confirm that the proposed method is robust enough for urban tree planning and management.
基金funded by the National Natural Science Foundation of China (Grant No. 41930102, 41971333, 41771415, and 41701449)the Priority Academic Program Development of Jiangsu Higher Education Institutions (Grant No. 164320H116)the Open Fund of Key Laboratory for Synergistic Prevention of Water and Soil Environmental Pollution (Grant No. KLSPWSEPA04)。
文摘Ephemeral gullies,which are widely developed worldwide and threaten farmlands,have aroused a growing concern.Identifying and mapping gullies are generally considered prerequisites of gully erosion assessment.However,ephemeral gully mapping remains a challenge.In this study,we proposed a flow-directional detection for identifying ephemeral gullies from high-resolution images and digital elevation models(DEMs).Ephemeral gullies exhibit clear linear features in high-resolution images.An edge detection operator was initially used to identify linear features from high-resolution images.Then,according to gully erosion mechanism,the flow-directional detection was designed.Edge images obtained from edge detection and flow directions obtained from DEMs were used to implement the flow-directional detection that detects ephemeral gullies along the flow direction.Results from ten study areas in the Loess Plateau of China showed that ranges of precision,recall,and Fmeasure are 6 o.66%-90.47%,65.74%-94.98%,and63.10%-91.93%,respectively.The proposed method is flexible and can be used with various images and DEMs.However,analysis of the effect of DEM resolution and accuracy showed that DEM resolution only demonstrates a minor effect on the detection results.Conversely,DEM accuracy influences the detection result and is more important than the DEM resolution.The worse the vertical accuracy of DEM,the lower the performance of the flow-directional detection will be.This work is beneficial to research related to monitoring gully erosion and assessing soil loss.
基金ProjectsupportedbytheNationalNaturalScienceFoundationofChina (No .40 0 2 30 0 4 ) .
文摘On the basis of a thorough understanding of the physical characteristics of remote sensing image, this paper employs the theories of wavelet transform and signal sampling to develop a new image fusion algorithm. The algorithm has been successfully applied to the image fusion of SPOT PAN and TM of Guangdong province, China. The experimental results show that a perfect image fusion can be built up by using the image analytical solution and re-construction in the image frequency domain based on the physical characteristics of the image formation. The method has demonstrated that the results of the image fusion do not change spectral characteristics of the original image.
基金the National Science Fund for Distinguished Young Scholars of China (No. 51625501)the Aeronautical Science Foundation of China (No. 201946051002)
文摘Small-object detection has long been a challenge.High-megapixel cameras are used to solve this problem in industries.However,current detectors are inefficient for high-resolution images.In this work,we propose a new module called Pre-Locate Net,which is a plug-and-play structure that can be combined with most popular detectors.We inspire the use of classification ideas to obtain candidate regions in images,greatly reducing the amount of calculation,and thus achieving rapid detection in high-resolution images.Pre-Locate Net mainly includes two parts,candidate region classification and behavior classification.Candidate region classification is used to obtain a candidate region,and behavior classification is used to estimate the scale of an object.Different follow-up processing is adopted according to different scales to balance the variance of the network input.Different from the popular candidate region generation method,we abandon the idea of regression of a bounding box and adopt the concept of classification,so as to realize the prediction of a candidate region in the shallow network.We build a high-resolution dataset of aircraft and landing gears covering complex scenes to verify the effectiveness of our method.Compared to state-of-the-art detectors(e.g.,Guided Anchoring,Libra-RCNN,and FASF),our method achieves the best m AP of 94.5 on 1920×1080 images at 16.7 FPS.
基金supported by the National Natural Science Foundation of China(41701499)the Sichuan Science and Technology Program(2018GZ0265)the Geomatics Technology and Application Key Laboratory of Qinghai Province(QHDX-2018-07)
文摘A large number of debris flow disasters(called Seismic debris flows) would occur after an earthquake, which can cause a great amount of damage. UAV low-altitude remote sensing technology has become a means of quickly obtaining disaster information as it has the advantage of convenience and timeliness, but the spectral information of the image is so scarce, making it difficult to accurately detect the information of earthquake debris flow disasters. Based on the above problems, a seismic debris flow detection method based on transfer learning(TL) mechanism is proposed. On the basis of the constructed seismic debris flow disaster database, the features acquired from the training of the convolutional neural network(CNN) are transferred to the disaster information detection of the seismic debris flow. The automatic detection of earthquake debris flow disaster information is then completed, and the results of object-oriented seismic debris flow disaster information detection are compared and analyzed with the detection results supported by transfer learning.
基金provided by the Science Research Project of Hebei Education Department under grant No.BJK2024115.
文摘High-resolution remote sensing images(HRSIs)are now an essential data source for gathering surface information due to advancements in remote sensing data capture technologies.However,their significant scale changes and wealth of spatial details pose challenges for semantic segmentation.While convolutional neural networks(CNNs)excel at capturing local features,they are limited in modeling long-range dependencies.Conversely,transformers utilize multihead self-attention to integrate global context effectively,but this approach often incurs a high computational cost.This paper proposes a global-local multiscale context network(GLMCNet)to extract both global and local multiscale contextual information from HRSIs.A detail-enhanced filtering module(DEFM)is proposed at the end of the encoder to refine the encoder outputs further,thereby enhancing the key details extracted by the encoder and effectively suppressing redundant information.In addition,a global-local multiscale transformer block(GLMTB)is proposed in the decoding stage to enable the modeling of rich multiscale global and local information.We also design a stair fusion mechanism to transmit deep semantic information from deep to shallow layers progressively.Finally,we propose the semantic awareness enhancement module(SAEM),which further enhances the representation of multiscale semantic features through spatial attention and covariance channel attention.Extensive ablation analyses and comparative experiments were conducted to evaluate the performance of the proposed method.Specifically,our method achieved a mean Intersection over Union(mIoU)of 86.89%on the ISPRS Potsdam dataset and 84.34%on the ISPRS Vaihingen dataset,outperforming existing models such as ABCNet and BANet.
基金supported by the National Natural Science Foundation of China (Grant Nos. 62405231, 62405235, and 62575229)the National Key Laboratory of Space Target Awareness (Grant Nos. STA2024KGL0203, STA2024ZCA0203, and STA-24-04-05)+3 种基金the Beijing Key Laboratory of Advanced Optical Remote Sensing Technology (Grant No. AORS202405)the China Postdoctoral Science Foundation (Grant No. 2024M762527)the Shaanxi Province High-level Innovation and Entrepreneurship Talent Program (Grant No. H02439005)the Natural Science Foundation of Shaanxi (Grant Nos. S2024-JC-JCQN-60, S2025-JCQYTS-0107, and 2025JC-QYCX-05)。
文摘In complex media scattering,multiple scattering severely degrades the optical wavefront and results in blurred images,while the spectral distortion caused by the scattering effect leads to severe color distortion.Achieving color high-resolution imaging through scattering media remains a significant challenge.Here,we propose a broadband,polarization-based method for color high-resolution imaging through scattering media.This approach enables high-resolution reconstruction by effectively separating the speckle illumination pattern from the mixed-scattering field information,leveraging polarization common-mode characteristics.Concurrently,it incorporates chromatic balance compensation to correct spectral aliasing in the scattered light field,enabling color high-resolution imaging through complex scattering media.To further optimize color distortion caused by scattering,a compensation strategy combining color constancy and white balance theory is adopted.Experimental results demonstrate that the proposed method significantly enhances both spatial resolution and color fidelity across various scattering conditions and target materials,showcasing strong adaptability and robustness.This approach provides an effective solution for achieving high-resolution color optical imaging in complex scattering environments.
文摘While algorithms have been created for land usage in urban settings,there have been few investigations into the extraction of urban footprint(UF).To address this research gap,the study employs several widely used image classification method classified into three categories to evaluate their segmentation capabilities for extracting UF across eight cities.The results indicate that pixel-based methods only excel in clear urban environments,and their overall accuracy is not consistently high.RF and SVM perform well but lack stability in object-based UF extraction,influenced by feature selection and classifier performance.Deep learning enhances feature extraction but requires powerful computing and faces challenges with complex urban layouts.SAM excels in medium-sized urban areas but falters in intricate layouts.Integrating traditional and deep learning methods optimizes UF extraction,balancing accuracy and processing efficiency.Future research should focus on adapting algorithms for diverse urban landscapes to enhance UF extraction accuracy and applicability.
文摘The commercial high-resolution imaging satellite with 1 m spatial resolution IKONOS is an important data source of information for urban planning and geographical information system (GIS) applications. In this paper, a morphological method is proposed. The proposed method combines the automatic thresholding and morphological operation techniques to extract the road centerline of the urban environment. This method intends to solve urban road centerline problems, vehicle, vegetation, building etc. Based on this morphological method, an object extractor is designed to extract road networks from highly remote sensing images. Some filters are applied in this experiment such as line reconstruction and region filling techniques to connect the disconnected road segments and remove the small redundant. Finally, the thinning algorithm is used to extract the road centerline. Experiments have been conducted on a high-resolution IKONOS and QuickBird images showing the efficiency of the proposed method.
基金supported by the National Key R&D Program of China(No.2022YFC2504403)the National Natural Science Foundation of China(No.62172202)+1 种基金the Experiment Project of China Manned Space Program(No.HYZHXM01019)the Fundamental Research Funds for the Central Universities from Southeast University(No.3207032101C3)。
文摘Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of complex diseases,with some even achieving clinical translation.Changes in the overall size,shape,boundary,and other morphological features of organoids provide a noninvasive method for assessing organoid drug sensitivity.However,the precise segmentation of organoids in bright-field microscopy images is made difficult by the complexity of the organoid morphology and interference,including overlapping organoids,bubbles,dust particles,and cell fragments.This paper introduces the precision organoid segmentation technique(POST),which is a deep-learning algorithm for segmenting challenging organoids under simple bright-field imaging conditions.Unlike existing methods,POST accurately segments each organoid and eliminates various artifacts encountered during organoid culturing and imaging.Furthermore,it is sensitive to and aligns with measurements of organoid activity in drug sensitivity experiments.POST is expected to be a valuable tool for drug screening using organoids owing to its capability of automatically and rapidly eliminating interfering substances and thereby streamlining the organoid analysis and drug screening process.
基金supported in part by the National Natural Science Foundation of China under Grants 62101346 and 62301330the Guangdong Basic and Applied Basic Research Foundation under Grants 2021A1515011702 and 2022A1515110101+1 种基金the Shenzhen Science and Technology Programme under Grants JCYJ20240813141358076 and 20231121103807001the Guangdong Provincial Key Laboratory under Grant 2023B1212060076.
文摘Although Transformer-based image restoration methods have demonstrated impressive performance,existing Transformers still insufficiently exploit multiscale information.Previous non-Transformer-based studies have shown that incorporating multiscale features is crucial for improving restoration results.In this paper,we propose a multiscale Transformer(MST)that captures cross-scale attention among tokens,thereby effectively leveraging the multiscale patch recurrence prior of natural images.Furthermore,we introduce a channel-gate feed-forward network(CGFN)to enhance inter-channel information aggregation and reduce channel redundancy.To simultaneously utilise global,local and multiscale features,we design a multitype feature integration block(MFIB).Extensive experiments on both image super-resolution and HEVC compressed video artefact reduction demonstrate that the proposed MST achieves state-of-the-art performance.Ablation studies further verify the effectiveness of each proposed module.
基金funded by the National Natural Science Foundation of China(62273213,62472262,62572287)Natural Science Foundation of Shandong Province(ZR2024MF144)+1 种基金Natural Science Foundation of Shandong Province for Innovation and Development Joint Funds(ZR2022LZH001)Taishan Scholarship Construction Engineering.
文摘Accurately counting dense objects in complex and diverse backgrounds is a significant challenge in computer vision,with applications ranging from crowd counting to various other object counting tasks.To address this,we propose HUANNet(High-Resolution Unified Attention Network),a convolutional neural network designed to capture both local features and rich semantic information through a high-resolution representation learning framework,while optimizing computational distribution across parallel branches.HUANNet introduces three core modules:the High-Resolution Attention Module(HRAM),which enhances feature extraction by optimizing multiresolution feature fusion;the Unified Multi-Scale Attention Module(UMAM),which integrates spatial,channel,and convolutional kernel information through an attention mechanism applied across multiple levels of the network;and the Grid-Assisted Point Matching Module(GPMM),which stabilizes and improves point-to-point matching by leveraging grid-based mechanisms.Extensive experiments show that HUANNet achieves competitive results on the ShanghaiTech Part A/B crowd counting datasets and sets new state-of-the-art performance on dense object counting datasets such as CARPK and XRAY-IECCD,demonstrating the effectiveness and versatility of HUANNet.
文摘The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adversarial network(GAN)algorithm was proposed.Taking GAN as the basic framework,it combined a depthwise separable convolution module,attention mechanism,and reconstructed convolution module to realize the enhancement of underwater degraded images.Multi-scale features were captured by the depthwise separable convolution module,and the attention mechanism was utilized to enhance attention to important features.The reconstructed convolution module further extracts and fuses local and global features.Experimental results showed that the algorithm performs well in improving the color bias and blurring of underwater images,with PSNR reaching 27.835,SSIM reaching 0.883,UIQM reaching 3.205,and UCIQE reaching 0.713.The enhanced image outperforms the comparison algorithm in both subjective and objective metrics.
基金Supported by the National key research and development program in the 14th five year plan 2021YFA1200700)the National Natural Science Foundation of China(62535018,62431025,62561160113)the Natural Science Foundation of Shanghai(23ZR1473400).
文摘Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks.
基金funded by Anhui Province University Key Science and Technology Project(2024AH053415)Anhui Province University Major Science and Technology Project(2024AH040229)+3 种基金Talent Research Initiation Fund Project of Tongling University(2024tlxyrc019)Tongling University School-Level Scientific Research Project(2024tlxyptZD07)TheUniversity Synergy Innovation Programof Anhui Province(GXXT-2023-050)Tongling City Science and Technology Major Special Project(Unveiling and Commanding Model)(200401JB004).
文摘In the image fusion field,fusing infrared images(IRIs)and visible images(VIs)excelled is a key area.The differences between IRIs and VIs make it challenging to fuse both types into a high-quality image.Accordingly,efficiently combining the advantages of both images while overcoming their shortcomings is necessary.To handle this challenge,we developed an end-to-end IRI andVI fusionmethod based on frequency decomposition and enhancement.By applying concepts from frequency domain analysis,we used the layering mechanism to better capture the salient thermal targets from the IRIs and the rich textural information from the VIs,respectively,significantly boosting the image fusion quality and effectiveness.In addition,the backbone network combined Restormer Blocks and Dense Blocks;Restormer blocks utilize global attention to extract shallow features.Meanwhile,Dense Blocks ensure the integration between shallow and deep features,thereby avoiding the loss of shallow attributes.Extensive experiments on TNO and MSRS datasets demonstrated that the suggested method achieved state-of-the-art(SOTA)performance in various metrics:Entropy(EN),Mutual Information(MI),Standard Deviation(SD),The Structural Similarity Index Measure(SSIM),Fusion quality(Qabf),MI of the pixel(FMI_(pixel)),and modified Visual Information Fidelity(VIF_(m)).
基金financially supported by the Open Project Program of Wuhan National Laboratory for Optoelectronics(No.2022WNLOKF009)the National Natural Science Foundation of China(No.62475216)+2 种基金the Key Research and Development Program of Shaanxi(No.2024GH-ZDXM-37)the Fujian Provincial Natural Science Foundation of China(No.2024J01060)the Startup Program of XMU,and the Fundamental Research Funds for the Central Universities.
文摘Microscopy imaging is fundamental in analyzing bacterial morphology and dynamics,offering critical insights into bacterial physiology and pathogenicity.Image segmentation techniques enable quantitative analysis of bacterial structures,facilitating precise measurement of morphological variations and population behaviors at single-cell resolution.This paper reviews advancements in bacterial image segmentation,emphasizing the shift from traditional thresholding and watershed methods to deep learning-driven approaches.Convolutional neural networks(CNNs),U-Net architectures,and three-dimensional(3D)frameworks excel at segmenting dense biofilms and resolving antibiotic-induced morphological changes.These methods combine automated feature extraction with physics-informed postprocessing.Despite progress,challenges persist in computational efficiency,cross-species generalizability,and integration with multimodal experimental workflows.Future progress will depend on improving model robustness across species and imaging modalities,integrating multimodal data for phenotype-function mapping,and developing standard pipelines that link computational tools with clinical diagnostics.These innovations will expand microbial phenotyping beyond structural analysis,enabling deeper insights into bacterial physiology and ecological interactions.