Semantic segmentation plays a foundational role in biomedical image analysis, providing precise information about cellular, tissue, and organ structures in both biological and medical imaging modalities. Traditional a...Semantic segmentation plays a foundational role in biomedical image analysis, providing precise information about cellular, tissue, and organ structures in both biological and medical imaging modalities. Traditional approaches often fail in the face of challenges such as low contrast, morphological variability, and densely packed structures. Recent advancements in deep learning have transformed segmentation capabilities through the integration of fine-scale detail preservation, coarse-scale contextual modeling, and multi-scale feature fusion. This work provides a comprehensive analysis of state-of-the-art deep learning models, including U-Net variants, attention-based frameworks, and Transformer-integrated networks, highlighting innovations that improve accuracy, generalizability, and computational efficiency. Key architectural components such as convolution operations, shallow and deep blocks, skip connections, and hybrid encoders are examined for their roles in enhancing spatial representation and semantic consistency. We further discuss the importance of hierarchical and instance-aware segmentation and annotation in interpreting complex biological scenes and multiplexed medical images. By bridging methodological developments with diverse application domains, this paper outlines current trends and future directions for semantic segmentation, emphasizing its critical role in facilitating annotation, diagnosis, and discovery in biomedical research.展开更多
Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to ...Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to the inability to effectively capture global information from images,CNNs can easily lead to loss of contours and textures in segmentation results.Notice that the transformer model can effectively capture the properties of long-range dependencies in the image,and furthermore,combining the CNN and the transformer can effectively extract local details and global contextual features of the image.Motivated by this,we propose a multi-branch and multi-scale attention network(M2ANet)for medical image segmentation,whose architecture consists of three components.Specifically,in the first component,we construct an adaptive multi-branch patch module for parallel extraction of image features to reduce information loss caused by downsampling.In the second component,we apply residual block to the well-known convolutional block attention module to enhance the network’s ability to recognize important features of images and alleviate the phenomenon of gradient vanishing.In the third component,we design a multi-scale feature fusion module,in which we adopt adaptive average pooling and position encoding to enhance contextual features,and then multi-head attention is introduced to further enrich feature representation.Finally,we validate the effectiveness and feasibility of the proposed M2ANet method through comparative experiments on four benchmark medical image segmentation datasets,particularly in the context of preserving contours and textures.展开更多
The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method f...The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method for infrared and visible image fusion is proposed.The encoder designed according to the optimization objective consists of a base encoder and a detail encoder,which is used to extract low-frequency and high-frequency information from the image.This extraction may lead to some information not being captured,so a compensation encoder is proposed to supplement the missing information.Multi-scale decomposition is also employed to extract image features more comprehensively.The decoder combines low-frequency,high-frequency and supplementary information to obtain multi-scale features.Subsequently,the attention strategy and fusion module are introduced to perform multi-scale fusion for image reconstruction.Experimental results on three datasets show that the fused images generated by this network effectively retain salient targets while being more consistent with human visual perception.展开更多
Cemented paste backfill(CPB)is a technology that achieves safe mining by filling the goaf with waste rocks,tailings,and other materials.It is an inevitable choice to deal with the development of deep and highly diffic...Cemented paste backfill(CPB)is a technology that achieves safe mining by filling the goaf with waste rocks,tailings,and other materials.It is an inevitable choice to deal with the development of deep and highly difficult mines and meet the requirements of environmental protection and safety regulations.It promotes the development of a circular economy in mines through the development of lowgrade resources and the resource utilization of waste,and extends the service life of mines.The mass concentration of solid content(abbreviated as“concentration”)is a critical parameter for CPB.However,discrepancies often arise between the on-site measurements and the pre-designed values due to factors such as groundwater inflow and segregation within the goaf,which cannot be evaluated after the solidification of CPB.This paper innovatively provides an in-situ non-destructive approach to identify the real concentration of CPB after curing for certain days using hyperspectral imaging(HSI)technology.Initially,the spectral variation patterns under different concentration conditions were investigated through hyperspectral scanning experiments on CPB samples.The results demonstrate that as the CPB concentration increases from 61wt%to 73wt%,the overall spectral reflectance gradually increases,with two distinct absorption peaks observed at 1407 and 1917 nm.Notably,the reflectance at 1407 nm exhibited a strong linear relationship with the concentration.Subsequently,the K-nearest neighbors(KNN)and support vector machine(SVM)algorithms were employed to classify and identify different concentrations.The study revealed that,with the KNN algorithm,the highest accuracy was achieved when K(number of nearest neighbors)was 1,although this resulted in overfitting.When K=3,the model displayed the optimal balance between accuracy and stability,with an accuracy of 95.03%.In the SVM algorithm,the highest accuracy of 98.24%was attained with parameters C(regularization parameter)=200 and Gamma(kernel coefficient)=10.A comparative analysis of precision,accuracy,and recall further highlighted that the SVM provided superior stability and precision for identifying CPB concentration.Thus,HSI technology offers an effective solution for the in-situ,non-destructive monitoring of CPB concentration,presenting a promising approach for optimizing and controlling CPB characteristic parameters.展开更多
Synaptic pruning is a crucial process in synaptic refinement,eliminating unstable synaptic connections in neural circuits.This process is triggered and regulated primarily by spontaneous neural activity and experience...Synaptic pruning is a crucial process in synaptic refinement,eliminating unstable synaptic connections in neural circuits.This process is triggered and regulated primarily by spontaneous neural activity and experience-dependent mechanisms.The pruning process involves multiple molecular signals and a series of regulatory activities governing the“eat me”and“don't eat me”states.Under physiological conditions,the interaction between glial cells and neurons results in the clearance of unnecessary synapses,maintaining normal neural circuit functionality via synaptic pruning.Alterations in genetic and environmental factors can lead to imbalanced synaptic pruning,thus promoting the occurrence and development of autism spectrum disorder,schizophrenia,Alzheimer's disease,and other neurological disorders.In this review,we investigated the molecular mechanisms responsible for synaptic pruning during neural development.We focus on how synaptic pruning can regulate neural circuits and its association with neurological disorders.Furthermore,we discuss the application of emerging optical and imaging technologies to observe synaptic structure and function,as well as their potential for clinical translation.Our aim was to enhance our understanding of synaptic pruning during neural development,including the molecular basis underlying the regulation of synaptic function and the dynamic changes in synaptic density,and to investigate the potential role of these mechanisms in the pathophysiology of neurological diseases,thus providing a theoretical foundation for the treatment of neurological disorders.展开更多
This paper aims to develop a nonrigid registration method of preoperative and intraoperative thoracoabdominal CT images in computer-assisted interventional surgeries for accurate tumor localization and tissue visualiz...This paper aims to develop a nonrigid registration method of preoperative and intraoperative thoracoabdominal CT images in computer-assisted interventional surgeries for accurate tumor localization and tissue visualization enhancement.However,fine structure registration of complex thoracoabdominal organs and large deformation registration caused by respiratory motion is challenging.To deal with this problem,we propose a 3D multi-scale attention VoxelMorph(MAVoxelMorph)registration network.To alleviate the large deformation problem,a multi-scale axial attention mechanism is utilized by using a residual dilated pyramid pooling for multi-scale feature extraction,and position-aware axial attention for long-distance dependencies between pixels capture.To further improve the large deformation and fine structure registration results,a multi-scale context channel attention mechanism is employed utilizing content information via adjacent encoding layers.Our method was evaluated on four public lung datasets(DIR-Lab dataset,Creatis dataset,Learn2Reg dataset,OASIS dataset)and a local dataset.Results proved that the proposed method achieved better registration performance than current state-of-the-art methods,especially in handling the registration of large deformations and fine structures.It also proved to be fast in 3D image registration,using about 1.5 s,and faster than most methods.Qualitative and quantitative assessments proved that the proposed MA-VoxelMorph has the potential to realize precise and fast tumor localization in clinical interventional surgeries.展开更多
This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi...This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi-scale encoding significantly enhances the model’s ability to capture both fine-grained and global features,while the dynamic loss function adapts during training to optimize classification accuracy and retrieval performance.Our approach was evaluated on the ISIC-2018 and ChestX-ray14 datasets,yielding notable improvements.Specifically,on the ISIC-2018 dataset,our method achieves an F1-Score improvement of+4.84% compared to the standard ViT,with a precision increase of+5.46% for melanoma(MEL).On the ChestX-ray14 dataset,the method delivers an F1-Score improvement of 5.3%over the conventional ViT,with precision gains of+5.0% for pneumonia(PNEU)and+5.4%for fibrosis(FIB).Experimental results demonstrate that our approach outperforms traditional CNN-based models and existing ViT variants,particularly in retrieving relevant medical cases and enhancing diagnostic accuracy.These findings highlight the potential of the proposedmethod for large-scalemedical image analysis,offering improved tools for clinical decision-making through superior classification and case comparison.展开更多
The application of image super-resolution(SR)has brought significant assistance in the medical field,aiding doctors to make more precise diagnoses.However,solely relying on a convolutional neural network(CNN)for image...The application of image super-resolution(SR)has brought significant assistance in the medical field,aiding doctors to make more precise diagnoses.However,solely relying on a convolutional neural network(CNN)for image SR may lead to issues such as blurry details and excessive smoothness.To address the limitations,we proposed an algorithm based on the generative adversarial network(GAN)framework.In the generator network,three different sizes of convolutions connected by a residual dense structure were used to extract detailed features,and an attention mechanism combined with dual channel and spatial information was applied to concentrate the computing power on crucial areas.In the discriminator network,using InstanceNorm to normalize tensors sped up the training process while retaining feature information.The experimental results demonstrate that our algorithm achieves higher peak signal-to-noise ratio(PSNR)and structural similarity index measure(SSIM)compared to other methods,resulting in an improved visual quality.展开更多
Mesenchymal stromal cell transplantation is an effective and promising approach for treating various systemic and diffuse diseases.However,the biological characteristics of transplanted mesenchymal stromal cells in hu...Mesenchymal stromal cell transplantation is an effective and promising approach for treating various systemic and diffuse diseases.However,the biological characteristics of transplanted mesenchymal stromal cells in humans remain unclear,including cell viability,distribution,migration,and fate.Conventional cell tracing methods cannot be used in the clinic.The use of superparamagnetic iron oxide nanoparticles as contrast agents allows for the observation of transplanted cells using magnetic resonance imaging.In 2016,the National Medical Products Administration of China approved a new superparamagnetic iron oxide nanoparticle,Ruicun,for use as a contrast agent in clinical trials.In the present study,an acute hemi-transection spinal cord injury model was established in beagle dogs.The injury was then treated by transplantation of Ruicun-labeled mesenchymal stromal cells.The results indicated that Ruicunlabeled mesenchymal stromal cells repaired damaged spinal cord fibers and partially restored neurological function in animals with acute spinal cord injury.T2*-weighted imaging revealed low signal areas on both sides of the injured spinal cord.The results of quantitative susceptibility mapping with ultrashort echo time sequences indicated that Ruicun-labeled mesenchymal stromal cells persisted stably within the injured spinal cord for over 4 weeks.These findings suggest that magnetic resonance imaging has the potential to effectively track the migration of Ruicun-labeled mesenchymal stromal cells and assess their ability to repair spinal cord injury.展开更多
Freezing of gait is a significant and debilitating motor symptom often observed in individuals with Parkinson's disease.Resting-state functional magnetic resonance imaging,along with its multi-level feature indice...Freezing of gait is a significant and debilitating motor symptom often observed in individuals with Parkinson's disease.Resting-state functional magnetic resonance imaging,along with its multi-level feature indices,has provided a fresh perspective and valuable insight into the study of freezing of gait in Parkinson's disease.It has been revealed that Parkinson's disease is accompanied by widespread irregularities in inherent brain network activity.However,the effective integration of the multi-level indices of resting-state functional magnetic resonance imaging into clinical settings for the diagnosis of freezing of gait in Parkinson's disease remains a challenge.Although previous studies have demonstrated that radiomics can extract optimal features as biomarkers to identify or predict diseases,a knowledge gap still exists in the field of freezing of gait in Parkinson's disease.This cross-sectional study aimed to evaluate the ability of radiomics features based on multi-level indices of resting-state functional magnetic resonance imaging,along with clinical features,to distinguish between Parkinson's disease patients with and without freezing of gait.We recruited 28 patients with Parkinson's disease who had freezing of gait(15 men and 13 women,average age 63 years)and 30 patients with Parkinson's disease who had no freezing of gait(16 men and 14 women,average age 64 years).Magnetic resonance imaging scans were obtained using a 3.0T scanner to extract the mean amplitude of low-frequency fluctuations,mean regional homogeneity,and degree centrality.Neurological and clinical characteristics were also evaluated.We used the least absolute shrinkage and selection operator algorithm to extract features and established feedforward neural network models based solely on resting-state functional magnetic resonance imaging indicators.We then performed predictive analysis of three distinct groups based on resting-state functional magnetic resonance imaging indicators indicators combined with clinical features.Subsequently,we conducted 100 additional five-fold cross-validations to determine the most effective model for each classification task and evaluated the performance of the model using the area under the receiver operating characteristic curve.The results showed that when differentiating patients with Parkinson's disease who had freezing of gait from those who did not have freezing of gait,or from healthy controls,the models using only the mean regional homogeneity values achieved the highest area under the receiver operating characteristic curve values of 0.750(with an accuracy of 70.9%)and 0.759(with an accuracy of 65.3%),respectively.When classifying patients with Parkinson's disease who had freezing of gait from those who had no freezing of gait,the model using the mean amplitude of low-frequency fluctuation values combined with two clinical features achieved the highest area under the receiver operating characteristic curve of 0.847(with an accuracy of 74.3%).The most significant features for patients with Parkinson's disease who had freezing of gait were amplitude of low-frequency fluctuation alterations in the left parahippocampal gyrus and two clinical characteristics:Montreal Cognitive Assessment and Hamilton Depression Scale scores.Our findings suggest that radiomics features derived from resting-state functional magnetic resonance imaging indices and clinical information can serve as valuable indices for the identification of freezing of gait in Parkinson's disease.展开更多
Multi-label image classification is a challenging task due to the diverse sizes and complex backgrounds of objects in images.Obtaining class-specific precise representations at different scales is a key aspect of feat...Multi-label image classification is a challenging task due to the diverse sizes and complex backgrounds of objects in images.Obtaining class-specific precise representations at different scales is a key aspect of feature representation.However,existing methods often rely on the single-scale deep feature,neglecting shallow and deeper layer features,which poses challenges when predicting objects of varying scales within the same image.Although some studies have explored multi-scale features,they rarely address the flow of information between scales or efficiently obtain class-specific precise representations for features at different scales.To address these issues,we propose a two-stage,three-branch Transformer-based framework.The first stage incorporates multi-scale image feature extraction and hierarchical scale attention.This design enables the model to consider objects at various scales while enhancing the flow of information across different feature scales,improving the model’s generalization to diverse object scales.The second stage includes a global feature enhancement module and a region selection module.The global feature enhancement module strengthens interconnections between different image regions,mitigating the issue of incomplete represen-tations,while the region selection module models the cross-modal relationships between image features and labels.Together,these components enable the efficient acquisition of class-specific precise feature representations.Extensive experiments on public datasets,including COCO2014,VOC2007,and VOC2012,demonstrate the effectiveness of our proposed method.Our approach achieves consistent performance gains of 0.3%,0.4%,and 0.2%over state-of-the-art methods on the three datasets,respectively.These results validate the reliability and superiority of our approach for multi-label image classification.展开更多
Multimodal image fusion plays an important role in image analysis and applications.Multimodal medical image fusion helps to combine contrast features from two or more input imaging modalities to represent fused inform...Multimodal image fusion plays an important role in image analysis and applications.Multimodal medical image fusion helps to combine contrast features from two or more input imaging modalities to represent fused information in a single image.One of the critical clinical applications of medical image fusion is to fuse anatomical and functional modalities for rapid diagnosis of malignant tissues.This paper proposes a multimodal medical image fusion network(MMIF-Net)based on multiscale hybrid attention.The method first decomposes the original image to obtain the low-rank and significant parts.Then,to utilize the features at different scales,we add amultiscalemechanism that uses three filters of different sizes to extract the features in the encoded network.Also,a hybrid attention module is introduced to obtain more image details.Finally,the fused images are reconstructed by decoding the network.We conducted experiments with clinical images from brain computed tomography/magnetic resonance.The experimental results show that the multimodal medical image fusion network method based on multiscale hybrid attention works better than other advanced fusion methods.展开更多
The Pressure Sensitive Paint Technique(PSP)has gained attention in recent years because of its significant benefits in measuring surface pressure on wind tunnel models.However,in the post-processing process of PSP ima...The Pressure Sensitive Paint Technique(PSP)has gained attention in recent years because of its significant benefits in measuring surface pressure on wind tunnel models.However,in the post-processing process of PSP images,issues such as pressure taps,paint peeling,and contamination can lead to the loss of pressure data on the image,which seriously affects the subsequent calculation and analysis of pressure distribution.Therefore,image inpainting is particularly important in the post-processing process of PSP images.Deep learning offers new methods for PSP image inpainting,but some basic characteristics of convolutional neural networks(CNNs)may limit their ability to handle restoration tasks.By contrast,the self-attention mechanism in the transformer can efficiently model nonlocal relationships among input features by generating adaptive attention scores.As a result,we propose an efficient transformer network model for the PSP image inpainting task,named multi-scale dilated attention transformer(D-former).The model utilizes the redundancy of global dependencies modeling in Vision Transformers(ViTs)to introducemulti-scale dilated attention(MDA),thismechanism effectivelymodels the interaction between localized and sparse patches within the shifted window,achieving a better balance between computational complexity and receptive field.As a result,D-former allows efficient modeling of long-range features while using fewer parameters and lower computational costs.The experiments on two public datasets and the PSP dataset indicate that the method in this article performs better compared to several advancedmethods.Through the verification of real wind tunnel tests,thismethod can accurately restore the luminescent intensity data of holes in PSP images,thereby improving the accuracy of full field pressure data,and has a promising future in practical applications.展开更多
Computer-aided diagnosis(CAD)can detect tuberculosis(TB)cases,providing radiologists with more accurate and efficient diagnostic solutions.Various noise information in TB chest X-ray(CXR)images is a major challenge in...Computer-aided diagnosis(CAD)can detect tuberculosis(TB)cases,providing radiologists with more accurate and efficient diagnostic solutions.Various noise information in TB chest X-ray(CXR)images is a major challenge in this classification task.This study aims to propose a model with high performance in TB CXR image detection named multi-scale input mirror network(MIM-Net)based on CXR image symmetry,which consists of a multi-scale input feature extraction network and mirror loss.The multi-scale image input can enhance feature extraction,while the mirror loss can improve the network performance through self-supervision.We used a publicly available TB CXR image classification dataset to evaluate our proposed method via 5-fold cross-validation,with accuracy,sensitivity,specificity,positive predictive value,negative predictive value,and area under curve(AUC)of 99.67%,100%,99.60%,99.80%,100%,and 0.9999,respectively.Compared to other models,MIM-Net performed best in all metrics.Therefore,the proposed MIM-Net can effectively help the network learn more features and can be used to detect TB in CXR images,thus assisting doctors in diagnosing.展开更多
Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding ...Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding phase.This paper presents a medical image segmentation model based on SAM with a local multi-scale feature encoder(LMSFE-SAM)to address the issues above.Firstly,based on the SAM,a local multi-scale feature encoder is introduced to improve the representation of features within local receptive field,thereby supplying the Vision Transformer(ViT)branch in SAM with enriched local multi-scale contextual information.At the same time,a multiaxial Hadamard product module(MHPM)is incorporated into the local multi-scale feature encoder in a lightweight manner to reduce the quadratic complexity and noise interference.Subsequently,a cross-branch balancing adapter is designed to balance the local and global information between the local multi-scale feature encoder and the ViT encoder in SAM.Finally,to obtain smaller input image size and to mitigate overlapping in patch embeddings,the size of the input image is reduced from 1024×1024 pixels to 256×256 pixels,and a multidimensional information adaptation component is developed,which includes feature adapters,position adapters,and channel-spatial adapters.This component effectively integrates the information from small-sized medical images into SAM,enhancing its suitability for clinical deployment.The proposed model demonstrates an average enhancement ranging from 0.0387 to 0.3191 across six objective evaluation metrics on BUSI,DDTI,and TN3K datasets compared to eight other representative image segmentation models.This significantly enhances the performance of the SAM on medical images,providing clinicians with a powerful tool in clinical diagnosis.展开更多
Imaging quality is a critical component of compressive imaging in real applications. In this study, we propose a compressive imaging method based on multi-scale modulation and reconstruction in the spatial frequency d...Imaging quality is a critical component of compressive imaging in real applications. In this study, we propose a compressive imaging method based on multi-scale modulation and reconstruction in the spatial frequency domain. Theoretical analysis and simulation show the relation between the measurement matrix resolution and compressive sensing(CS)imaging quality. The matrix design is improved to provide multi-scale modulations, followed by individual reconstruction of images of different spatial frequencies. Compared with traditional single-scale CS imaging, the multi-scale method provides high quality imaging in both high and low frequencies, and effectively decreases the overall reconstruction error.Experimental results confirm the feasibility of this technique, especially at low sampling rate. The method may thus be helpful in promoting the implementation of compressive imaging in real applications.展开更多
To improve image quality under low illumination conditions,a novel low-light image enhancement method is proposed in this paper based on multi-illumination estimation and multi-scale fusion(MIMS).Firstly,the illuminat...To improve image quality under low illumination conditions,a novel low-light image enhancement method is proposed in this paper based on multi-illumination estimation and multi-scale fusion(MIMS).Firstly,the illumination is processed by contrast-limited adaptive histogram equalization(CLAHE),adaptive complementary gamma function(ACG),and adaptive detail preserving S-curve(ADPS),respectively,to obtain three components.Then,the fusion-relevant features,exposure,and color contrast are selected as the weight maps.Subsequently,these components and weight maps are fused through multi-scale to generate enhanced illumination.Finally,the enhanced images are obtained by multiplying the enhanced illumination and reflectance.Compared with existing approaches,this proposed method achieves an average increase of 0.81%and 2.89%in the structural similarity index measurement(SSIM)and peak signal-to-noise ratio(PSNR),and a decrease of 6.17%and 32.61%in the natural image quality evaluator(NIQE)and gradient magnitude similarity deviation(GMSD),respectively.展开更多
An accurate and comprehensive understanding of shale pore structure is fundamental and critical for accurate reserves evaluation and efficient hydrocarbon development.Thus,by taking the shale of Paleogene Eocene Shahe...An accurate and comprehensive understanding of shale pore structure is fundamental and critical for accurate reserves evaluation and efficient hydrocarbon development.Thus,by taking the shale of Paleogene Eocene Shahejie Formation in the Jiyang Depression,Bohai Bay Basin,as an example,the 2D and 3D multi-resolution images of the shale microstructure are obtained by multiple imaging technologies,including X-ray computed tomography,large-field scanning electron microscopy,scanning electron microscopy and focused ion beam scanning electron microscopy.By integrating image processing and machine learning algorithms,the shale pore structure is characterized at a single scale and multi scales.The results are obtained as follows.First,the shale pore space in the study area is mainly composed of microfractures,inorganic pores,organic matters and organic pores,and exclusively shows multi-scale characteristics.Second,there are various types of inorganic pores,and abundant dissolution pores;organic matters are distributed as strips and patches,and no organic pores are found in some organic matters.Third,pores with radius less than 20 nm account for 25%,those with radius between 20 and 50 nm account for 19%,those with radius between 50 and 100 nm account for 29%,those with radius between 100 and 500 nm account for 14%,those with radius between 500 nm and 20 mm account for 11%,and those with radius between 20 and 50 mm account for 2%.Fourth,the organic pores are less connected than the inorganic pores.The connectivity between organic pores and inorganic pores plays a crucial role in hydrocarbon migration,and microfractures control fluid flow channels.Fifth,pores with radius less than 50 nm are dominantly organic pores,those with radius between 50 and 500 nm are mainly organic and inorganic pores,and microfractures mainly contribute to the pores with radius more than 500 nm.It is concluded that a single imaging experiment cannot accurately and comprehensively reveal the multi-scale micro pore structure of a shale reservoir.Through integration of multiple imaging technologies and machine learning algorithms,the shale pore structure can be recognized and characterized at both single scale and multi scales.The proposed new method provides accurate and comprehensive information of multi-scale pore structures.展开更多
Although the Convolutional Neural Network(CNN)has shown great potential for land cover classification,the frequently used single-scale convolution kernel limits the scope of informa-tion extraction.Therefore,we propos...Although the Convolutional Neural Network(CNN)has shown great potential for land cover classification,the frequently used single-scale convolution kernel limits the scope of informa-tion extraction.Therefore,we propose a Multi-Scale Fully Convolutional Network(MSFCN)with a multi-scale convolutional kernel as well as a Channel Attention Block(CAB)and a Global Pooling Module(GPM)in this paper to exploit discriminative representations from two-dimensional(2D)satellite images.Meanwhile,to explore the ability of the proposed MSFCN for spatio-temporal images,we expand our MSFCN to three-dimension using three-dimensional(3D)CNN,capable of harnessing each land cover category’s time series interac-tion from the reshaped spatio-temporal remote sensing images.To verify the effectiveness of the proposed MSFCN,we conduct experiments on two spatial datasets and two spatio-temporal datasets.The proposed MSFCN achieves 60.366%on the WHDLD dataset and 75.127%on the GID dataset in terms of mIoU index while the figures for two spatio-temporal datasets are 87.753%and 77.156%.Extensive comparative experiments and abla-tion studies demonstrate the effectiveness of the proposed MSFCN.展开更多
Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer ...Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer composite plate by explosive welding.The microscopic properties of each bonding interface were elucidated through field emission scanning electron microscope and electron backscattered diffraction(EBSD).A methodology combining finite element method-smoothed particle hydrodynamics(FEM-SPH)and molecular dynamics(MD)was proposed for the analysis of the forming and evolution characteristics of explosive welding interfaces at multi-scale.The results demonstrate that the bonding interface morphologies of TC4/Al 6063 and Al 6063/Al 7075 exhibit a flat and wavy configuration,without discernible defects or cracks.The phenomenon of grain refinement is observed in the vicinity of the two bonding interfaces.Furthermore,the degree of plastic deformation of TC4 and Al 7075 is more pronounced than that of Al 6063 in the intermediate layer.The interface morphology characteristics obtained by FEM-SPH simulation exhibit a high degree of similarity to the experimental results.MD simulations reveal that the diffusion of interfacial elements predominantly occurs during the unloading phase,and the simulated thickness of interfacial diffusion aligns well with experimental outcomes.The introduction of intermediate layer in the explosive welding process can effectively produce high-quality titanium/aluminum alloy composite plates.Furthermore,this approach offers a multi-scale simulation strategy for the study of explosive welding bonding interfaces.展开更多
基金Open Access funding provided by the National Institutes of Health(NIH)The funding for this project was provided by NCATS Intramural Fund.
文摘Semantic segmentation plays a foundational role in biomedical image analysis, providing precise information about cellular, tissue, and organ structures in both biological and medical imaging modalities. Traditional approaches often fail in the face of challenges such as low contrast, morphological variability, and densely packed structures. Recent advancements in deep learning have transformed segmentation capabilities through the integration of fine-scale detail preservation, coarse-scale contextual modeling, and multi-scale feature fusion. This work provides a comprehensive analysis of state-of-the-art deep learning models, including U-Net variants, attention-based frameworks, and Transformer-integrated networks, highlighting innovations that improve accuracy, generalizability, and computational efficiency. Key architectural components such as convolution operations, shallow and deep blocks, skip connections, and hybrid encoders are examined for their roles in enhancing spatial representation and semantic consistency. We further discuss the importance of hierarchical and instance-aware segmentation and annotation in interpreting complex biological scenes and multiplexed medical images. By bridging methodological developments with diverse application domains, this paper outlines current trends and future directions for semantic segmentation, emphasizing its critical role in facilitating annotation, diagnosis, and discovery in biomedical research.
基金supported by the Natural Science Foundation of the Anhui Higher Education Institutions of China(Grant Nos.2023AH040149 and 2024AH051915)the Anhui Provincial Natural Science Foundation(Grant No.2208085MF168)+1 种基金the Science and Technology Innovation Tackle Plan Project of Maanshan(Grant No.2024RGZN001)the Scientific Research Fund Project of Anhui Medical University(Grant No.2023xkj122).
文摘Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to the inability to effectively capture global information from images,CNNs can easily lead to loss of contours and textures in segmentation results.Notice that the transformer model can effectively capture the properties of long-range dependencies in the image,and furthermore,combining the CNN and the transformer can effectively extract local details and global contextual features of the image.Motivated by this,we propose a multi-branch and multi-scale attention network(M2ANet)for medical image segmentation,whose architecture consists of three components.Specifically,in the first component,we construct an adaptive multi-branch patch module for parallel extraction of image features to reduce information loss caused by downsampling.In the second component,we apply residual block to the well-known convolutional block attention module to enhance the network’s ability to recognize important features of images and alleviate the phenomenon of gradient vanishing.In the third component,we design a multi-scale feature fusion module,in which we adopt adaptive average pooling and position encoding to enhance contextual features,and then multi-head attention is introduced to further enrich feature representation.Finally,we validate the effectiveness and feasibility of the proposed M2ANet method through comparative experiments on four benchmark medical image segmentation datasets,particularly in the context of preserving contours and textures.
基金Supported by the Henan Province Key Research and Development Project(231111211300)the Central Government of Henan Province Guides Local Science and Technology Development Funds(Z20231811005)+2 种基金Henan Province Key Research and Development Project(231111110100)Henan Provincial Outstanding Foreign Scientist Studio(GZS2024006)Henan Provincial Joint Fund for Scientific and Technological Research and Development Plan(Application and Overcoming Technical Barriers)(242103810028)。
文摘The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method for infrared and visible image fusion is proposed.The encoder designed according to the optimization objective consists of a base encoder and a detail encoder,which is used to extract low-frequency and high-frequency information from the image.This extraction may lead to some information not being captured,so a compensation encoder is proposed to supplement the missing information.Multi-scale decomposition is also employed to extract image features more comprehensively.The decoder combines low-frequency,high-frequency and supplementary information to obtain multi-scale features.Subsequently,the attention strategy and fusion module are introduced to perform multi-scale fusion for image reconstruction.Experimental results on three datasets show that the fused images generated by this network effectively retain salient targets while being more consistent with human visual perception.
基金funded by the National Natural Science Foundation of China(Nos.52474165 and 52522404)。
文摘Cemented paste backfill(CPB)is a technology that achieves safe mining by filling the goaf with waste rocks,tailings,and other materials.It is an inevitable choice to deal with the development of deep and highly difficult mines and meet the requirements of environmental protection and safety regulations.It promotes the development of a circular economy in mines through the development of lowgrade resources and the resource utilization of waste,and extends the service life of mines.The mass concentration of solid content(abbreviated as“concentration”)is a critical parameter for CPB.However,discrepancies often arise between the on-site measurements and the pre-designed values due to factors such as groundwater inflow and segregation within the goaf,which cannot be evaluated after the solidification of CPB.This paper innovatively provides an in-situ non-destructive approach to identify the real concentration of CPB after curing for certain days using hyperspectral imaging(HSI)technology.Initially,the spectral variation patterns under different concentration conditions were investigated through hyperspectral scanning experiments on CPB samples.The results demonstrate that as the CPB concentration increases from 61wt%to 73wt%,the overall spectral reflectance gradually increases,with two distinct absorption peaks observed at 1407 and 1917 nm.Notably,the reflectance at 1407 nm exhibited a strong linear relationship with the concentration.Subsequently,the K-nearest neighbors(KNN)and support vector machine(SVM)algorithms were employed to classify and identify different concentrations.The study revealed that,with the KNN algorithm,the highest accuracy was achieved when K(number of nearest neighbors)was 1,although this resulted in overfitting.When K=3,the model displayed the optimal balance between accuracy and stability,with an accuracy of 95.03%.In the SVM algorithm,the highest accuracy of 98.24%was attained with parameters C(regularization parameter)=200 and Gamma(kernel coefficient)=10.A comparative analysis of precision,accuracy,and recall further highlighted that the SVM provided superior stability and precision for identifying CPB concentration.Thus,HSI technology offers an effective solution for the in-situ,non-destructive monitoring of CPB concentration,presenting a promising approach for optimizing and controlling CPB characteristic parameters.
基金supported by the National Natural Science Foundation of China,No.31760290,82160688the Key Development Areas Project of Ganzhou Science and Technology,No.2022B-SF9554(all to XL)。
文摘Synaptic pruning is a crucial process in synaptic refinement,eliminating unstable synaptic connections in neural circuits.This process is triggered and regulated primarily by spontaneous neural activity and experience-dependent mechanisms.The pruning process involves multiple molecular signals and a series of regulatory activities governing the“eat me”and“don't eat me”states.Under physiological conditions,the interaction between glial cells and neurons results in the clearance of unnecessary synapses,maintaining normal neural circuit functionality via synaptic pruning.Alterations in genetic and environmental factors can lead to imbalanced synaptic pruning,thus promoting the occurrence and development of autism spectrum disorder,schizophrenia,Alzheimer's disease,and other neurological disorders.In this review,we investigated the molecular mechanisms responsible for synaptic pruning during neural development.We focus on how synaptic pruning can regulate neural circuits and its association with neurological disorders.Furthermore,we discuss the application of emerging optical and imaging technologies to observe synaptic structure and function,as well as their potential for clinical translation.Our aim was to enhance our understanding of synaptic pruning during neural development,including the molecular basis underlying the regulation of synaptic function and the dynamic changes in synaptic density,and to investigate the potential role of these mechanisms in the pathophysiology of neurological diseases,thus providing a theoretical foundation for the treatment of neurological disorders.
基金supported in part by the National Natural Science Foundation of China[62301374]Hubei Provincial Natural Science Foundation of China[2022CFB804]+2 种基金Hubei Provincial Education Research Project[B2022057]the Youths Science Foundation of Wuhan Institute of Technology[K202240]the 15th Graduate Education Innovation Fund of Wuhan Institute of Technology[CX2023295].
文摘This paper aims to develop a nonrigid registration method of preoperative and intraoperative thoracoabdominal CT images in computer-assisted interventional surgeries for accurate tumor localization and tissue visualization enhancement.However,fine structure registration of complex thoracoabdominal organs and large deformation registration caused by respiratory motion is challenging.To deal with this problem,we propose a 3D multi-scale attention VoxelMorph(MAVoxelMorph)registration network.To alleviate the large deformation problem,a multi-scale axial attention mechanism is utilized by using a residual dilated pyramid pooling for multi-scale feature extraction,and position-aware axial attention for long-distance dependencies between pixels capture.To further improve the large deformation and fine structure registration results,a multi-scale context channel attention mechanism is employed utilizing content information via adjacent encoding layers.Our method was evaluated on four public lung datasets(DIR-Lab dataset,Creatis dataset,Learn2Reg dataset,OASIS dataset)and a local dataset.Results proved that the proposed method achieved better registration performance than current state-of-the-art methods,especially in handling the registration of large deformations and fine structures.It also proved to be fast in 3D image registration,using about 1.5 s,and faster than most methods.Qualitative and quantitative assessments proved that the proposed MA-VoxelMorph has the potential to realize precise and fast tumor localization in clinical interventional surgeries.
基金funded by the Deanship of Research and Graduate Studies at King Khalid University through small group research under grant number RGP1/278/45.
文摘This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi-scale encoding significantly enhances the model’s ability to capture both fine-grained and global features,while the dynamic loss function adapts during training to optimize classification accuracy and retrieval performance.Our approach was evaluated on the ISIC-2018 and ChestX-ray14 datasets,yielding notable improvements.Specifically,on the ISIC-2018 dataset,our method achieves an F1-Score improvement of+4.84% compared to the standard ViT,with a precision increase of+5.46% for melanoma(MEL).On the ChestX-ray14 dataset,the method delivers an F1-Score improvement of 5.3%over the conventional ViT,with precision gains of+5.0% for pneumonia(PNEU)and+5.4%for fibrosis(FIB).Experimental results demonstrate that our approach outperforms traditional CNN-based models and existing ViT variants,particularly in retrieving relevant medical cases and enhancing diagnostic accuracy.These findings highlight the potential of the proposedmethod for large-scalemedical image analysis,offering improved tools for clinical decision-making through superior classification and case comparison.
文摘The application of image super-resolution(SR)has brought significant assistance in the medical field,aiding doctors to make more precise diagnoses.However,solely relying on a convolutional neural network(CNN)for image SR may lead to issues such as blurry details and excessive smoothness.To address the limitations,we proposed an algorithm based on the generative adversarial network(GAN)framework.In the generator network,three different sizes of convolutions connected by a residual dense structure were used to extract detailed features,and an attention mechanism combined with dual channel and spatial information was applied to concentrate the computing power on crucial areas.In the discriminator network,using InstanceNorm to normalize tensors sped up the training process while retaining feature information.The experimental results demonstrate that our algorithm achieves higher peak signal-to-noise ratio(PSNR)and structural similarity index measure(SSIM)compared to other methods,resulting in an improved visual quality.
基金supported by the National Key R&D Program of China,Nos.2017YFA0104302(to NG and XM)and 2017YFA0104304(to BW and ZZ)
文摘Mesenchymal stromal cell transplantation is an effective and promising approach for treating various systemic and diffuse diseases.However,the biological characteristics of transplanted mesenchymal stromal cells in humans remain unclear,including cell viability,distribution,migration,and fate.Conventional cell tracing methods cannot be used in the clinic.The use of superparamagnetic iron oxide nanoparticles as contrast agents allows for the observation of transplanted cells using magnetic resonance imaging.In 2016,the National Medical Products Administration of China approved a new superparamagnetic iron oxide nanoparticle,Ruicun,for use as a contrast agent in clinical trials.In the present study,an acute hemi-transection spinal cord injury model was established in beagle dogs.The injury was then treated by transplantation of Ruicun-labeled mesenchymal stromal cells.The results indicated that Ruicunlabeled mesenchymal stromal cells repaired damaged spinal cord fibers and partially restored neurological function in animals with acute spinal cord injury.T2*-weighted imaging revealed low signal areas on both sides of the injured spinal cord.The results of quantitative susceptibility mapping with ultrashort echo time sequences indicated that Ruicun-labeled mesenchymal stromal cells persisted stably within the injured spinal cord for over 4 weeks.These findings suggest that magnetic resonance imaging has the potential to effectively track the migration of Ruicun-labeled mesenchymal stromal cells and assess their ability to repair spinal cord injury.
基金supported by the National Natural Science Foundation of China,No.82071909(to GF)the Natural Science Foundation of Liaoning Province,No.2023-MS-07(to HL)。
文摘Freezing of gait is a significant and debilitating motor symptom often observed in individuals with Parkinson's disease.Resting-state functional magnetic resonance imaging,along with its multi-level feature indices,has provided a fresh perspective and valuable insight into the study of freezing of gait in Parkinson's disease.It has been revealed that Parkinson's disease is accompanied by widespread irregularities in inherent brain network activity.However,the effective integration of the multi-level indices of resting-state functional magnetic resonance imaging into clinical settings for the diagnosis of freezing of gait in Parkinson's disease remains a challenge.Although previous studies have demonstrated that radiomics can extract optimal features as biomarkers to identify or predict diseases,a knowledge gap still exists in the field of freezing of gait in Parkinson's disease.This cross-sectional study aimed to evaluate the ability of radiomics features based on multi-level indices of resting-state functional magnetic resonance imaging,along with clinical features,to distinguish between Parkinson's disease patients with and without freezing of gait.We recruited 28 patients with Parkinson's disease who had freezing of gait(15 men and 13 women,average age 63 years)and 30 patients with Parkinson's disease who had no freezing of gait(16 men and 14 women,average age 64 years).Magnetic resonance imaging scans were obtained using a 3.0T scanner to extract the mean amplitude of low-frequency fluctuations,mean regional homogeneity,and degree centrality.Neurological and clinical characteristics were also evaluated.We used the least absolute shrinkage and selection operator algorithm to extract features and established feedforward neural network models based solely on resting-state functional magnetic resonance imaging indicators.We then performed predictive analysis of three distinct groups based on resting-state functional magnetic resonance imaging indicators indicators combined with clinical features.Subsequently,we conducted 100 additional five-fold cross-validations to determine the most effective model for each classification task and evaluated the performance of the model using the area under the receiver operating characteristic curve.The results showed that when differentiating patients with Parkinson's disease who had freezing of gait from those who did not have freezing of gait,or from healthy controls,the models using only the mean regional homogeneity values achieved the highest area under the receiver operating characteristic curve values of 0.750(with an accuracy of 70.9%)and 0.759(with an accuracy of 65.3%),respectively.When classifying patients with Parkinson's disease who had freezing of gait from those who had no freezing of gait,the model using the mean amplitude of low-frequency fluctuation values combined with two clinical features achieved the highest area under the receiver operating characteristic curve of 0.847(with an accuracy of 74.3%).The most significant features for patients with Parkinson's disease who had freezing of gait were amplitude of low-frequency fluctuation alterations in the left parahippocampal gyrus and two clinical characteristics:Montreal Cognitive Assessment and Hamilton Depression Scale scores.Our findings suggest that radiomics features derived from resting-state functional magnetic resonance imaging indices and clinical information can serve as valuable indices for the identification of freezing of gait in Parkinson's disease.
基金supported by the National Natural Science Foundation of China(62302167,62477013)Natural Science Foundation of Shanghai(No.24ZR1456100)+1 种基金Science and Technology Commission of Shanghai Municipality(No.24DZ2305900)the Shanghai Municipal Special Fund for Promoting High-Quality Development of Industries(2211106).
文摘Multi-label image classification is a challenging task due to the diverse sizes and complex backgrounds of objects in images.Obtaining class-specific precise representations at different scales is a key aspect of feature representation.However,existing methods often rely on the single-scale deep feature,neglecting shallow and deeper layer features,which poses challenges when predicting objects of varying scales within the same image.Although some studies have explored multi-scale features,they rarely address the flow of information between scales or efficiently obtain class-specific precise representations for features at different scales.To address these issues,we propose a two-stage,three-branch Transformer-based framework.The first stage incorporates multi-scale image feature extraction and hierarchical scale attention.This design enables the model to consider objects at various scales while enhancing the flow of information across different feature scales,improving the model’s generalization to diverse object scales.The second stage includes a global feature enhancement module and a region selection module.The global feature enhancement module strengthens interconnections between different image regions,mitigating the issue of incomplete represen-tations,while the region selection module models the cross-modal relationships between image features and labels.Together,these components enable the efficient acquisition of class-specific precise feature representations.Extensive experiments on public datasets,including COCO2014,VOC2007,and VOC2012,demonstrate the effectiveness of our proposed method.Our approach achieves consistent performance gains of 0.3%,0.4%,and 0.2%over state-of-the-art methods on the three datasets,respectively.These results validate the reliability and superiority of our approach for multi-label image classification.
基金supported by Qingdao Huanghai University School-Level ScientificResearch Project(2023KJ14)Undergraduate Teaching Reform Research Project of Shandong Provincial Department of Education(M2022328)+1 种基金National Natural Science Foundation of China under Grant(42472324)Qingdao Postdoctoral Foundation under Grant(QDBSH202402049).
文摘Multimodal image fusion plays an important role in image analysis and applications.Multimodal medical image fusion helps to combine contrast features from two or more input imaging modalities to represent fused information in a single image.One of the critical clinical applications of medical image fusion is to fuse anatomical and functional modalities for rapid diagnosis of malignant tissues.This paper proposes a multimodal medical image fusion network(MMIF-Net)based on multiscale hybrid attention.The method first decomposes the original image to obtain the low-rank and significant parts.Then,to utilize the features at different scales,we add amultiscalemechanism that uses three filters of different sizes to extract the features in the encoded network.Also,a hybrid attention module is introduced to obtain more image details.Finally,the fused images are reconstructed by decoding the network.We conducted experiments with clinical images from brain computed tomography/magnetic resonance.The experimental results show that the multimodal medical image fusion network method based on multiscale hybrid attention works better than other advanced fusion methods.
基金partly supported by the National Natural Science Foundation of China under Grant 12202476,author Chunhua Wei,https://www.nsfc.gov.cn/.
文摘The Pressure Sensitive Paint Technique(PSP)has gained attention in recent years because of its significant benefits in measuring surface pressure on wind tunnel models.However,in the post-processing process of PSP images,issues such as pressure taps,paint peeling,and contamination can lead to the loss of pressure data on the image,which seriously affects the subsequent calculation and analysis of pressure distribution.Therefore,image inpainting is particularly important in the post-processing process of PSP images.Deep learning offers new methods for PSP image inpainting,but some basic characteristics of convolutional neural networks(CNNs)may limit their ability to handle restoration tasks.By contrast,the self-attention mechanism in the transformer can efficiently model nonlocal relationships among input features by generating adaptive attention scores.As a result,we propose an efficient transformer network model for the PSP image inpainting task,named multi-scale dilated attention transformer(D-former).The model utilizes the redundancy of global dependencies modeling in Vision Transformers(ViTs)to introducemulti-scale dilated attention(MDA),thismechanism effectivelymodels the interaction between localized and sparse patches within the shifted window,achieving a better balance between computational complexity and receptive field.As a result,D-former allows efficient modeling of long-range features while using fewer parameters and lower computational costs.The experiments on two public datasets and the PSP dataset indicate that the method in this article performs better compared to several advancedmethods.Through the verification of real wind tunnel tests,thismethod can accurately restore the luminescent intensity data of holes in PSP images,thereby improving the accuracy of full field pressure data,and has a promising future in practical applications.
基金supported by the Joint Fund of the Ministry of Education for Equipment Pre-research(No.8091B0203)National Key Research and Development Program of China(No.2020YFC2008700)。
文摘Computer-aided diagnosis(CAD)can detect tuberculosis(TB)cases,providing radiologists with more accurate and efficient diagnostic solutions.Various noise information in TB chest X-ray(CXR)images is a major challenge in this classification task.This study aims to propose a model with high performance in TB CXR image detection named multi-scale input mirror network(MIM-Net)based on CXR image symmetry,which consists of a multi-scale input feature extraction network and mirror loss.The multi-scale image input can enhance feature extraction,while the mirror loss can improve the network performance through self-supervision.We used a publicly available TB CXR image classification dataset to evaluate our proposed method via 5-fold cross-validation,with accuracy,sensitivity,specificity,positive predictive value,negative predictive value,and area under curve(AUC)of 99.67%,100%,99.60%,99.80%,100%,and 0.9999,respectively.Compared to other models,MIM-Net performed best in all metrics.Therefore,the proposed MIM-Net can effectively help the network learn more features and can be used to detect TB in CXR images,thus assisting doctors in diagnosing.
基金supported by Natural Science Foundation Programme of Gansu Province(No.24JRRA231)National Natural Science Foundation of China(No.62061023)Gansu Provincial Science and Technology Plan Key Research and Development Program Project(No.24YFFA024).
文摘Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding phase.This paper presents a medical image segmentation model based on SAM with a local multi-scale feature encoder(LMSFE-SAM)to address the issues above.Firstly,based on the SAM,a local multi-scale feature encoder is introduced to improve the representation of features within local receptive field,thereby supplying the Vision Transformer(ViT)branch in SAM with enriched local multi-scale contextual information.At the same time,a multiaxial Hadamard product module(MHPM)is incorporated into the local multi-scale feature encoder in a lightweight manner to reduce the quadratic complexity and noise interference.Subsequently,a cross-branch balancing adapter is designed to balance the local and global information between the local multi-scale feature encoder and the ViT encoder in SAM.Finally,to obtain smaller input image size and to mitigate overlapping in patch embeddings,the size of the input image is reduced from 1024×1024 pixels to 256×256 pixels,and a multidimensional information adaptation component is developed,which includes feature adapters,position adapters,and channel-spatial adapters.This component effectively integrates the information from small-sized medical images into SAM,enhancing its suitability for clinical deployment.The proposed model demonstrates an average enhancement ranging from 0.0387 to 0.3191 across six objective evaluation metrics on BUSI,DDTI,and TN3K datasets compared to eight other representative image segmentation models.This significantly enhances the performance of the SAM on medical images,providing clinicians with a powerful tool in clinical diagnosis.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61601442,61605218,and 61575207)the National Key Research and Development Program of China(Grant No.2018YFB0504302)the Youth Innovation Promotion Association of the Chinese Academy of Sciences(Grant Nos.2015124 and 2019154)。
文摘Imaging quality is a critical component of compressive imaging in real applications. In this study, we propose a compressive imaging method based on multi-scale modulation and reconstruction in the spatial frequency domain. Theoretical analysis and simulation show the relation between the measurement matrix resolution and compressive sensing(CS)imaging quality. The matrix design is improved to provide multi-scale modulations, followed by individual reconstruction of images of different spatial frequencies. Compared with traditional single-scale CS imaging, the multi-scale method provides high quality imaging in both high and low frequencies, and effectively decreases the overall reconstruction error.Experimental results confirm the feasibility of this technique, especially at low sampling rate. The method may thus be helpful in promoting the implementation of compressive imaging in real applications.
基金supported by the National Key R&D Program of China(No.2022YFB3205101)NSAF(No.U2230116)。
文摘To improve image quality under low illumination conditions,a novel low-light image enhancement method is proposed in this paper based on multi-illumination estimation and multi-scale fusion(MIMS).Firstly,the illumination is processed by contrast-limited adaptive histogram equalization(CLAHE),adaptive complementary gamma function(ACG),and adaptive detail preserving S-curve(ADPS),respectively,to obtain three components.Then,the fusion-relevant features,exposure,and color contrast are selected as the weight maps.Subsequently,these components and weight maps are fused through multi-scale to generate enhanced illumination.Finally,the enhanced images are obtained by multiplying the enhanced illumination and reflectance.Compared with existing approaches,this proposed method achieves an average increase of 0.81%and 2.89%in the structural similarity index measurement(SSIM)and peak signal-to-noise ratio(PSNR),and a decrease of 6.17%and 32.61%in the natural image quality evaluator(NIQE)and gradient magnitude similarity deviation(GMSD),respectively.
基金Project supported by the China Outstanding Youth Science Fund Project of the National Natural Science Foundation“Multi-scale Oil and Gas Seepage Mechanics”(No.52122402)Key Project of the National Natural Science Foundation“Scientific Issues on Efficient Production of Gas Reservoirs with Ultra-deep and Ultra-high Pressure”(No.52034010)Outstanding Youth Fund Project of the Shandong Natural Science Foundation“Multi-scale Seepage Theory for Unconventional Reservoirs”(No.ZR2022JQ23).
文摘An accurate and comprehensive understanding of shale pore structure is fundamental and critical for accurate reserves evaluation and efficient hydrocarbon development.Thus,by taking the shale of Paleogene Eocene Shahejie Formation in the Jiyang Depression,Bohai Bay Basin,as an example,the 2D and 3D multi-resolution images of the shale microstructure are obtained by multiple imaging technologies,including X-ray computed tomography,large-field scanning electron microscopy,scanning electron microscopy and focused ion beam scanning electron microscopy.By integrating image processing and machine learning algorithms,the shale pore structure is characterized at a single scale and multi scales.The results are obtained as follows.First,the shale pore space in the study area is mainly composed of microfractures,inorganic pores,organic matters and organic pores,and exclusively shows multi-scale characteristics.Second,there are various types of inorganic pores,and abundant dissolution pores;organic matters are distributed as strips and patches,and no organic pores are found in some organic matters.Third,pores with radius less than 20 nm account for 25%,those with radius between 20 and 50 nm account for 19%,those with radius between 50 and 100 nm account for 29%,those with radius between 100 and 500 nm account for 14%,those with radius between 500 nm and 20 mm account for 11%,and those with radius between 20 and 50 mm account for 2%.Fourth,the organic pores are less connected than the inorganic pores.The connectivity between organic pores and inorganic pores plays a crucial role in hydrocarbon migration,and microfractures control fluid flow channels.Fifth,pores with radius less than 50 nm are dominantly organic pores,those with radius between 50 and 500 nm are mainly organic and inorganic pores,and microfractures mainly contribute to the pores with radius more than 500 nm.It is concluded that a single imaging experiment cannot accurately and comprehensively reveal the multi-scale micro pore structure of a shale reservoir.Through integration of multiple imaging technologies and machine learning algorithms,the shale pore structure can be recognized and characterized at both single scale and multi scales.The proposed new method provides accurate and comprehensive information of multi-scale pore structures.
基金supported by the National Natural Science Foundation of China[grant number 41671452].
文摘Although the Convolutional Neural Network(CNN)has shown great potential for land cover classification,the frequently used single-scale convolution kernel limits the scope of informa-tion extraction.Therefore,we propose a Multi-Scale Fully Convolutional Network(MSFCN)with a multi-scale convolutional kernel as well as a Channel Attention Block(CAB)and a Global Pooling Module(GPM)in this paper to exploit discriminative representations from two-dimensional(2D)satellite images.Meanwhile,to explore the ability of the proposed MSFCN for spatio-temporal images,we expand our MSFCN to three-dimension using three-dimensional(3D)CNN,capable of harnessing each land cover category’s time series interac-tion from the reshaped spatio-temporal remote sensing images.To verify the effectiveness of the proposed MSFCN,we conduct experiments on two spatial datasets and two spatio-temporal datasets.The proposed MSFCN achieves 60.366%on the WHDLD dataset and 75.127%on the GID dataset in terms of mIoU index while the figures for two spatio-temporal datasets are 87.753%and 77.156%.Extensive comparative experiments and abla-tion studies demonstrate the effectiveness of the proposed MSFCN.
基金Opening Foundation of Key Laboratory of Explosive Energy Utilization and Control,Anhui Province(BP20240104)Graduate Innovation Program of China University of Mining and Technology(2024WLJCRCZL049)Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX24_2701)。
文摘Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer composite plate by explosive welding.The microscopic properties of each bonding interface were elucidated through field emission scanning electron microscope and electron backscattered diffraction(EBSD).A methodology combining finite element method-smoothed particle hydrodynamics(FEM-SPH)and molecular dynamics(MD)was proposed for the analysis of the forming and evolution characteristics of explosive welding interfaces at multi-scale.The results demonstrate that the bonding interface morphologies of TC4/Al 6063 and Al 6063/Al 7075 exhibit a flat and wavy configuration,without discernible defects or cracks.The phenomenon of grain refinement is observed in the vicinity of the two bonding interfaces.Furthermore,the degree of plastic deformation of TC4 and Al 7075 is more pronounced than that of Al 6063 in the intermediate layer.The interface morphology characteristics obtained by FEM-SPH simulation exhibit a high degree of similarity to the experimental results.MD simulations reveal that the diffusion of interfacial elements predominantly occurs during the unloading phase,and the simulated thickness of interfacial diffusion aligns well with experimental outcomes.The introduction of intermediate layer in the explosive welding process can effectively produce high-quality titanium/aluminum alloy composite plates.Furthermore,this approach offers a multi-scale simulation strategy for the study of explosive welding bonding interfaces.