With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods ...With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods face numerous challenges in practical deployment,including scale variation handling,feature degradation,and complex backgrounds.To address these issues,we propose Edge-enhanced and Detail-Capturing You Only Look Once(EHDC-YOLO),a novel framework for object detection in Unmanned Aerial Vehicle(UAV)imagery.Based on the You Only Look Once version 11 nano(YOLOv11n)baseline,EHDC-YOLO systematically introduces several architectural enhancements:(1)a Multi-Scale Edge Enhancement(MSEE)module that leverages multi-scale pooling and edge information to enhance boundary feature extraction;(2)an Enhanced Feature Pyramid Network(EFPN)that integrates P2-level features with Cross Stage Partial(CSP)structures and OmniKernel convolutions for better fine-grained representation;and(3)Dynamic Head(DyHead)with multi-dimensional attention mechanisms for enhanced cross-scale modeling and perspective adaptability.Comprehensive experiments on the Vision meets Drones for Detection(VisDrone-DET)2019 dataset demonstrate that EHDC-YOLO achieves significant improvements,increasing mean Average Precision(mAP)@0.5 from 33.2%to 46.1%(an absolute improvement of 12.9 percentage points)and mAP@0.5:0.95 from 19.5%to 28.0%(an absolute improvement of 8.5 percentage points)compared with the YOLOv11n baseline,while maintaining a reasonable parameter count(2.81 M vs the baseline’s 2.58 M).Further ablation studies confirm the effectiveness of each proposed component,while visualization results highlight EHDC-YOLO’s superior performance in detecting objects and handling occlusions in complex drone scenarios.展开更多
Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectra...Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability.展开更多
Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious an...Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious and they are numerous,resulting in low detection accuracy by deep learning models.Therefore,we proposed a new multi-scale fusion crater detection algorithm(MSF-CDA)based on the YOLO11 to improve the accuracy of lunar impact crater detection,especially for small craters with a diameter of<1 km.Using the images taken by the LROC(Lunar Reconnaissance Orbiter Camera)at the Chang’e-4(CE-4)landing area,we constructed three separate datasets for craters with diameters of 0-70 m,70-140 m,and>140 m.We then trained three submodels separately with these three datasets.Additionally,we designed a slicing-amplifying-slicing strategy to enhance the ability to extract features from small craters.To handle redundant predictions,we proposed a new Non-Maximum Suppression with Area Filtering method to fuse the results in overlapping targets within the multi-scale submodels.Finally,our new MSF-CDA method achieved high detection performance,with the Precision,Recall,and F1 score having values of 0.991,0.987,and 0.989,respectively,perfectly addressing the problems induced by the lesser features and sample imbalance of small craters.Our MSF-CDA can provide strong data support for more in-depth study of the geological evolution of the lunar surface and finer geological age estimations.This strategy can also be used to detect other small objects with lesser features and sample imbalance problems.We detected approximately 500,000 impact craters in an area of approximately 214 km2 around the CE-4 landing area.By statistically analyzing the new data,we updated the distribution function of the number and diameter of impact craters.Finally,we identified the most suitable lighting conditions for detecting impact crater targets by analyzing the effect of different lighting conditions on the detection accuracy.展开更多
Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approach...Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments.展开更多
Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer ...Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer composite plate by explosive welding.The microscopic properties of each bonding interface were elucidated through field emission scanning electron microscope and electron backscattered diffraction(EBSD).A methodology combining finite element method-smoothed particle hydrodynamics(FEM-SPH)and molecular dynamics(MD)was proposed for the analysis of the forming and evolution characteristics of explosive welding interfaces at multi-scale.The results demonstrate that the bonding interface morphologies of TC4/Al 6063 and Al 6063/Al 7075 exhibit a flat and wavy configuration,without discernible defects or cracks.The phenomenon of grain refinement is observed in the vicinity of the two bonding interfaces.Furthermore,the degree of plastic deformation of TC4 and Al 7075 is more pronounced than that of Al 6063 in the intermediate layer.The interface morphology characteristics obtained by FEM-SPH simulation exhibit a high degree of similarity to the experimental results.MD simulations reveal that the diffusion of interfacial elements predominantly occurs during the unloading phase,and the simulated thickness of interfacial diffusion aligns well with experimental outcomes.The introduction of intermediate layer in the explosive welding process can effectively produce high-quality titanium/aluminum alloy composite plates.Furthermore,this approach offers a multi-scale simulation strategy for the study of explosive welding bonding interfaces.展开更多
Improving the volumetric energy density of supercapacitors is essential for practical applications,which highly relies on the dense storage of ions in carbon-based electrodes.The functional units of carbon-based elect...Improving the volumetric energy density of supercapacitors is essential for practical applications,which highly relies on the dense storage of ions in carbon-based electrodes.The functional units of carbon-based electrode exhibit multi-scale structural characteristics including macroscopic electrode morphologies,mesoscopic microcrystals and pores,and microscopic defects and dopants in the carbon basal plane.Therefore,the ordered combination of multi-scale structures of carbon electrode is crucial for achieving dense energy storage and high volumetric performance by leveraging the functions of various scale structu re.Considering that previous reviews have focused more on the discussion of specific scale structu re of carbon electrodes,this review takes a multi-scale perspective in which recent progresses regarding the structureperformance relationship,underlying mechanism and directional design of carbon-based multi-scale structures including carbon morphology,pore structure,carbon basal plane micro-environment and electrode technology on dense energy storage and volumetric property of supercapacitors are systematically discussed.We analyzed in detail the effects of the morphology,pore,and micro-environment of carbon electrode materials on ion dense storage,summarized the specific effects of different scale structures on volumetric property and recent research progress,and proposed the mutual influence and trade-off relationship between various scale structures.In addition,the challenges and outlooks for improving the dense storage and volumetric performance of carbon-based supercapacitors are analyzed,which can provide feasible technical reference and guidance for the design and manufacture of dense carbon-based electrode materials.展开更多
Prediction of production decline and evaluation of the adsorbed/free gas ratio are critical for determining the lifespan and production status of shale gas wells.Traditional production prediction methods have some sho...Prediction of production decline and evaluation of the adsorbed/free gas ratio are critical for determining the lifespan and production status of shale gas wells.Traditional production prediction methods have some shortcomings because of the low permeability and tightness of shale,complex gas flow behavior of multi-scale gas transport regions and multiple gas transport mechanism superpositions,and complex and variable production regimes of shale gas wells.Recent research has demonstrated the existence of a multi-stage isotope fractionation phenomenon during shale gas production,with the fractionation characteristics of each stage associated with the pore structure,gas in place(GIP),adsorption/desorption,and gas production process.This study presents a new approach for estimating shale gas well production and evaluating the adsorbed/free gas ratio throughout production using isotope fractionation techniques.A reservoir-scale carbon isotope fractionation(CIF)model applicable to the production process of shale gas wells was developed for the first time in this research.In contrast to the traditional model,this model improves production prediction accuracy by simultaneously fitting the gas production rate and δ^(13)C_(1) data and provides a new evaluation method of the adsorbed/free gas ratio during shale gas production.The results indicate that the diffusion and adsorption/desorption properties of rock,bottom-hole flowing pressure(BHP)of gas well,and multi-scale gas transport regions of the reservoir all affect isotope fractionation,with the diffusion and adsorption/desorption parameters of rock having the greatest effect on isotope fractionation being D∗/D,PL,VL,α,and others in that order.We effectively tested the universality of the four-stage isotope fractionation feature and revealed a unique isotope fractionation mechanism caused by the superimposed coupling of multi-scale gas transport regions during shale gas well production.Finally,we applied the established CIF model to a shale gas well in the Sichuan Basin,China,and calculated the estimated ultimate recovery(EUR)of the well to be 3.33×10^(8) m^(3);the adsorbed gas ratio during shale gas production was 1.65%,10.03%,and 23.44%in the first,fifth,and tenth years,respectively.The findings are significant for understanding the isotope fractionation mechanism during natural gas transport in complex systems and for formulating and optimizing unconventional natural gas development strategies.展开更多
Coral reef limestone(CRL)constitutes a distinctive marine carbonate formation with complex mechanical properties.This study investigates the multiscale damage and fracture mechanisms of CRL through integrated experime...Coral reef limestone(CRL)constitutes a distinctive marine carbonate formation with complex mechanical properties.This study investigates the multiscale damage and fracture mechanisms of CRL through integrated experimental testing,digital core technology,and theoretical modelling.Two CRL types with contrasting mesostructures were characterized across three scales.Macroscopically,CRL-I and CRL-II exhibited mean compressive strengths of 8.46 and 5.17 MPa,respectively.Mesoscopically,CRL-I featured small-scale highly interconnected pores,whilst CRL-II developed larger stratified pores with diminished connectivity.Microscopically,both CRL matrices demonstrated remarkable similarity in mineral composition and mechanical properties.A novel voxel average-based digital core scaling methodology was developed to facilitate numerical simulation of cross-scale damage processes,revealing network-progressive failure in CRL-I versus directional-brittle failure in CRL-II.Furthermore,a damage statistical constitutive model based on digital core technology and mesoscopic homogenisation theory established quantitative relationships between microelement strength distribution and macroscopic mechanical behavior.These findings illuminate the fundamental mechanisms through which mesoscopic structure governs the macroscopic mechanical properties of CRL.展开更多
Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to ...Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to the inability to effectively capture global information from images,CNNs can easily lead to loss of contours and textures in segmentation results.Notice that the transformer model can effectively capture the properties of long-range dependencies in the image,and furthermore,combining the CNN and the transformer can effectively extract local details and global contextual features of the image.Motivated by this,we propose a multi-branch and multi-scale attention network(M2ANet)for medical image segmentation,whose architecture consists of three components.Specifically,in the first component,we construct an adaptive multi-branch patch module for parallel extraction of image features to reduce information loss caused by downsampling.In the second component,we apply residual block to the well-known convolutional block attention module to enhance the network’s ability to recognize important features of images and alleviate the phenomenon of gradient vanishing.In the third component,we design a multi-scale feature fusion module,in which we adopt adaptive average pooling and position encoding to enhance contextual features,and then multi-head attention is introduced to further enrich feature representation.Finally,we validate the effectiveness and feasibility of the proposed M2ANet method through comparative experiments on four benchmark medical image segmentation datasets,particularly in the context of preserving contours and textures.展开更多
A nonlinear multi-scale interaction(NMI)model was proposed and developed by the first author for nearly 30 years to represent the evolution of atmospheric blocking.In this review paper,we first review the creation and...A nonlinear multi-scale interaction(NMI)model was proposed and developed by the first author for nearly 30 years to represent the evolution of atmospheric blocking.In this review paper,we first review the creation and development of the NMI model and then emphasize that the NMI model represents a new tool for identifying the basic physics of how climate change influences mid-to-high latitude weather extremes.The building of the NMI model took place over three main periods.In the 1990s,a nonlinear Schr?dinger(NLS)equation model was presented to describe atmospheric blocking as a wave packet;however,it could not depict the lifetime(10-20 days)of atmospheric blocking.In the 2000s,we proposed an NMI model of atmospheric blocking in a uniform basic flow by making a scale-separation assumption and deriving an eddyforced NLS equation.This model succeeded in describing the life cycle of atmospheric blocking.In the 2020s,the NMI model was extended to include the impact of a changing climate mainly by altering the basic zonal winds and the magnitude of the meridional background potential vorticity gradient(PVy).Model results show that when PVy is smaller,blocking has a weaker dispersion and a stronger nonlinearity,so blocking can be more persistent and have a larger zonal scale and weaker eastward movement,thus favoring stronger weather extremes.However,when PVy is much smaller and below a critical threshold under much stronger winter Arctic warming of global warming,atmospheric blocking becomes locally less persistent and shows a much stronger westward movement,which acts to inhibit local cold extremes.Such a case does not happen in summer under global warming because PVy fails to fall below the critical threshold.Thus,our theory indicates that global warming can render summer-blocking anticyclones and mid-to-high latitude heatwaves more persistent,intense,and widespread.展开更多
In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accurac...In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accuracy significantly declines when the data is occluded.To enhance the accuracy of gait emotion recognition under occlusion,this paper proposes a Multi-scale Suppression Graph ConvolutionalNetwork(MS-GCN).TheMS-GCN consists of three main components:Joint Interpolation Module(JI Moudle),Multi-scale Temporal Convolution Network(MS-TCN),and Suppression Graph Convolutional Network(SGCN).The JI Module completes the spatially occluded skeletal joints using the(K-Nearest Neighbors)KNN interpolation method.The MS-TCN employs convolutional kernels of various sizes to comprehensively capture the emotional information embedded in the gait,compensating for the temporal occlusion of gait information.The SGCN extracts more non-prominent human gait features by suppressing the extraction of key body part features,thereby reducing the negative impact of occlusion on emotion recognition results.The proposed method is evaluated on two comprehensive datasets:Emotion-Gait,containing 4227 real gaits from sources like BML,ICT-Pollick,and ELMD,and 1000 synthetic gaits generated using STEP-Gen technology,and ELMB,consisting of 3924 gaits,with 1835 labeled with emotions such as“Happy,”“Sad,”“Angry,”and“Neutral.”On the standard datasets Emotion-Gait and ELMB,the proposed method achieved accuracies of 0.900 and 0.896,respectively,attaining performance comparable to other state-ofthe-artmethods.Furthermore,on occlusion datasets,the proposedmethod significantly mitigates the performance degradation caused by occlusion compared to other methods,the accuracy is significantly higher than that of other methods.展开更多
The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method f...The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method for infrared and visible image fusion is proposed.The encoder designed according to the optimization objective consists of a base encoder and a detail encoder,which is used to extract low-frequency and high-frequency information from the image.This extraction may lead to some information not being captured,so a compensation encoder is proposed to supplement the missing information.Multi-scale decomposition is also employed to extract image features more comprehensively.The decoder combines low-frequency,high-frequency and supplementary information to obtain multi-scale features.Subsequently,the attention strategy and fusion module are introduced to perform multi-scale fusion for image reconstruction.Experimental results on three datasets show that the fused images generated by this network effectively retain salient targets while being more consistent with human visual perception.展开更多
Advanced chemical engineering for simultaneous modulation of nanomaterial morphology, defects, interfaces, and structure to enhance electromagnetic and microwave absorption (MA) performance. However, accurately distin...Advanced chemical engineering for simultaneous modulation of nanomaterial morphology, defects, interfaces, and structure to enhance electromagnetic and microwave absorption (MA) performance. However, accurately distinguishing the MA contributions of different scale factors and tuning the optimal combined effects remains a formidable challenge. This study employs a synergistic approach combining template protection etching and vacuum annealing to construct a controlled system of micrometer-sized cavities and amorphous carbon matrices in metal-organic framework (MOF) derivatives. The results demonstrate that the spatial effects introduced by the hollow structure enhance dielectric loss but significantly weaken impedance matching. By increasing the proportion of amorphous carbon, the balance between electromagnetic loss and impedance matching can be effectively maintained. Importantly, in a suitable graphitization environment, the presence of oxygen vacancies in amorphous carbon can induce significant polarization to compensate for the reduced conductivity loss due to the absence of sp2 carbon. Through the synergistic effects of morphology and composition, the samples exhibit a broader absorption bandwidth (6.28 GHz) and stronger reflection loss (−61.64 dB) compared to the original MOF. In conclusion, this study aims to elucidate the multiscale impacts of macroscopic micro-nano structure and microscopic defect engineering, providing valuable insights for future research in this field.展开更多
Water electrolyzers play a crucial role in green hydrogen production.However,their efficiency and scalability are often compromised by bubble dynamics across various scales,from nanoscale to macroscale components.This...Water electrolyzers play a crucial role in green hydrogen production.However,their efficiency and scalability are often compromised by bubble dynamics across various scales,from nanoscale to macroscale components.This review explores multi-scale modeling as a tool to visualize multi-phase flow and improve mass transport in water electrolyzers.At the nanoscale,molecular dynamics(MD)simulations reveal how electrode surface features and wettability influence nanobubble nucleation and stability.Moving to the mesoscale,models such as volume of fluid(VOF)and lattice Boltzmann method(LBM)shed light on bubble transport in porous transport layers(PTLs).These insights inform innovative designs,including gradient porosity and hydrophilic-hydrophobic patterning,aimed at minimizing gas saturation.At the macroscale,VOF simulations elucidate two-phase flow regimes within channels,showing how flow field geometry and wettability affect bubble discharging.Moreover,artificial intelligence(AI)-driven surrogate models expedite the optimization process,allowing for rapid exploration of structural parameters in channel-rib flow fields and porous flow field designs.By integrating these approaches,we can bridge theoretical insights with experimental validation,ultimately enhancing water electrolyzer performance,reducing costs,and advancing affordable,high-efficiency hydrogen production.展开更多
Self-vibrating systems comprised of active materials have great potential for application in the fields of energy harvesting,actuation,bionic instrumentation,and autonomous robotics.However,it is challenging to obtain...Self-vibrating systems comprised of active materials have great potential for application in the fields of energy harvesting,actuation,bionic instrumentation,and autonomous robotics.However,it is challenging to obtain analytical solutions describing these systems,which hinders analysis and design.In this work,we propose a self-vibrating liquid crystal elastomer(LCE)fiber-spring system exposed to spatially-constant gradient light,and determine analytical solutions for its amplitude and period.First,using a dynamic model of LCE,we obtain the equations governing the self-vibration.Then,we analyze two different motion states and elucidate the mechanism of self-vibration.Subsequently,we derive analytical solutions for the amplitude and frequency using the multi-scale method,and compare the solutions with numerical results.The analytical outcomes are shown to be consistent with the numerical calculations,while taking far less computational time.Our findings reveal the utility of the multi-scale method in describing self-vibration,which may contribute to more efficient and accurate analyses of self-vibrating systems.展开更多
Oil and gas pipelines,as linear infrastructure spanning multiple regions,are highly susceptible to geological hazards.Previous research has focused on discrete hazard points,yet failure at a single point can compromis...Oil and gas pipelines,as linear infrastructure spanning multiple regions,are highly susceptible to geological hazards.Previous research has focused on discrete hazard points,yet failure at a single point can compromise the integrity of the entire system,underscoring a gap in quantitative assessment of systemic risk.This study examines the Guangdong Dapeng liquefied natural gas(LNG)pipeline and proposes a multiscale,coupled risk assessment framework based on a hierarchical“region–section–point”approach.At the regional scale,an informationentropy model maps risk distributions and rapidly flags high-risk sections.At the section scale,UAVbased inspection is combined with in situ monitoring to conduct dynamic slope-stability analysis.At the point scale,monitoring data drive a slope–pipeline coupled finite element model that simulates the pipeline's mechanical response under prospective slope-failure scenarios,enabling early warning of buried-pipeline hazards.Results indicate that the Dapeng pipeline is generally stable.High-risk sections cluster in the Nanshan–Pingshan,Dapeng–Xiasha,and Qingxi–Zhangmutou areas,accounting for 18.9%of the total length.The regional model achieves an area under the ROC curve(AUC)of 0.89,supporting its predictive reliability.UAV inspections detected no visible pipeline damage,and the maximum change in displacement-tangent angle at the Pingshan and Dapeng monitoring sites was-1.83°,consistent with stable slopes.Under current conditions,the Pingshan site remains safe;simulations indicate that damage would initiate near the slope toe when ground displacement reaches about 4 m.Monitoring and simulation exhibit consistent pore-water-pressure trends,with maximum changes of 1.7 Pa and 0.8 Pa during the study period,mutually validating monitoring data quality and simulation credibility.The framework shifts the focus from isolated hazard points to continuous linear segments and provides a methodological basis for systematic,fine-grained geohazard-risk management of linear infrastructure.展开更多
Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion siz...Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion sizes.To overcome these limitations,we introduce MSAMamba-UNet,a lightweight model that integrates two novel architectures:Multi-Scale Mamba(MSMamba)and Adaptive Dynamic Gating Block(ADGB).MSMamba utilizes multi-scale decomposition and a parallel hierarchical structure to enhance the delineation of irregular lesion boundaries and sensitivity to small targets.ADGB dynamically selects convolutional kernels with varying receptive fields based on input features,improving the model’s capacity to accommodate diverse lesion textures and scales.Additionally,we introduce a Mix Attention Fusion Block(MAF)to enhance shallow feature representation by integrating parallel channel and pixel attention mechanisms.Extensive evaluation of MSAMamba-UNet on the ISIC 2016,ISIC 2017,and ISIC 2018 datasets demonstrates competitive segmentation accuracy with only 0.056 M parameters and 0.069 GFLOPs.Our experiments revealed that MSAMamba-UNet achieved IoU scores of 85.53%,85.47%,and 82.22%,as well as DSC scores of 92.20%,92.17%,and 90.24%,respectively.These results underscore the lightweight design and effectiveness of MSAMamba-UNet.展开更多
Deep Learning has been widely used to model soft sensors in modern industrial processes with nonlinear variables and uncertainty.Due to the outstanding ability for high-level feature extraction,stacked autoencoder(SAE...Deep Learning has been widely used to model soft sensors in modern industrial processes with nonlinear variables and uncertainty.Due to the outstanding ability for high-level feature extraction,stacked autoencoder(SAE)has been widely used to improve the model accuracy of soft sensors.However,with the increase of network layers,SAE may encounter serious information loss issues,which affect the modeling performance of soft sensors.Besides,there are typically very few labeled samples in the data set,which brings challenges to traditional neural networks to solve.In this paper,a multi-scale feature fused stacked autoencoder(MFF-SAE)is suggested for feature representation related to hierarchical output,where stacked autoencoder,mutual information(MI)and multi-scale feature fusion(MFF)strategies are integrated.Based on correlation analysis between output and input variables,critical hidden variables are extracted from the original variables in each autoencoder's input layer,which are correspondingly given varying weights.Besides,an integration strategy based on multi-scale feature fusion is adopted to mitigate the impact of information loss with the deepening of the network layers.Then,the MFF-SAE method is designed and stacked to form deep networks.Two practical industrial processes are utilized to evaluate the performance of MFF-SAE.Results from simulations indicate that in comparison to other cutting-edge techniques,the proposed method may considerably enhance the accuracy of soft sensor modeling,where the suggested method reduces the root mean square error(RMSE)by 71.8%,17.1%and 64.7%,15.1%,respectively.展开更多
Globally,diabetic retinopathy(DR)is the primary cause of blindness,affecting millions of people worldwide.This widespread impact underscores the critical need for reliable and precise diagnostic techniques to ensure p...Globally,diabetic retinopathy(DR)is the primary cause of blindness,affecting millions of people worldwide.This widespread impact underscores the critical need for reliable and precise diagnostic techniques to ensure prompt diagnosis and effective treatment.Deep learning-based automated diagnosis for diabetic retinopathy can facilitate early detection and treatment.However,traditional deep learning models that focus on local views often learn feature representations that are less discriminative at the semantic level.On the other hand,models that focus on global semantic-level information might overlook critical,subtle local pathological features.To address this issue,we propose an adaptive multi-scale feature fusion network called(AMSFuse),which can adaptively combine multi-scale global and local features without compromising their individual representation.Specifically,our model incorporates global features for extracting high-level contextual information from retinal images.Concurrently,local features capture fine-grained details,such as microaneurysms,hemorrhages,and exudates,which are critical for DR diagnosis.These global and local features are adaptively fused using a fusion block,followed by an Integrated Attention Mechanism(IAM)that refines the fused features by emphasizing relevant regions,thereby enhancing classification accuracy for DR classification.Our model achieves 86.3%accuracy on the APTOS dataset and 96.6%RFMiD,both of which are comparable to state-of-the-art methods.展开更多
Semantic segmentation plays a foundational role in biomedical image analysis, providing precise information about cellular, tissue, and organ structures in both biological and medical imaging modalities. Traditional a...Semantic segmentation plays a foundational role in biomedical image analysis, providing precise information about cellular, tissue, and organ structures in both biological and medical imaging modalities. Traditional approaches often fail in the face of challenges such as low contrast, morphological variability, and densely packed structures. Recent advancements in deep learning have transformed segmentation capabilities through the integration of fine-scale detail preservation, coarse-scale contextual modeling, and multi-scale feature fusion. This work provides a comprehensive analysis of state-of-the-art deep learning models, including U-Net variants, attention-based frameworks, and Transformer-integrated networks, highlighting innovations that improve accuracy, generalizability, and computational efficiency. Key architectural components such as convolution operations, shallow and deep blocks, skip connections, and hybrid encoders are examined for their roles in enhancing spatial representation and semantic consistency. We further discuss the importance of hierarchical and instance-aware segmentation and annotation in interpreting complex biological scenes and multiplexed medical images. By bridging methodological developments with diverse application domains, this paper outlines current trends and future directions for semantic segmentation, emphasizing its critical role in facilitating annotation, diagnosis, and discovery in biomedical research.展开更多
文摘With the rapid expansion of drone applications,accurate detection of objects in aerial imagery has become crucial for intelligent transportation,urban management,and emergency rescue missions.However,existing methods face numerous challenges in practical deployment,including scale variation handling,feature degradation,and complex backgrounds.To address these issues,we propose Edge-enhanced and Detail-Capturing You Only Look Once(EHDC-YOLO),a novel framework for object detection in Unmanned Aerial Vehicle(UAV)imagery.Based on the You Only Look Once version 11 nano(YOLOv11n)baseline,EHDC-YOLO systematically introduces several architectural enhancements:(1)a Multi-Scale Edge Enhancement(MSEE)module that leverages multi-scale pooling and edge information to enhance boundary feature extraction;(2)an Enhanced Feature Pyramid Network(EFPN)that integrates P2-level features with Cross Stage Partial(CSP)structures and OmniKernel convolutions for better fine-grained representation;and(3)Dynamic Head(DyHead)with multi-dimensional attention mechanisms for enhanced cross-scale modeling and perspective adaptability.Comprehensive experiments on the Vision meets Drones for Detection(VisDrone-DET)2019 dataset demonstrate that EHDC-YOLO achieves significant improvements,increasing mean Average Precision(mAP)@0.5 from 33.2%to 46.1%(an absolute improvement of 12.9 percentage points)and mAP@0.5:0.95 from 19.5%to 28.0%(an absolute improvement of 8.5 percentage points)compared with the YOLOv11n baseline,while maintaining a reasonable parameter count(2.81 M vs the baseline’s 2.58 M).Further ablation studies confirm the effectiveness of each proposed component,while visualization results highlight EHDC-YOLO’s superior performance in detecting objects and handling occlusions in complex drone scenarios.
基金supported by the Henan Province Key R&D Project under Grant 241111210400the Henan Provincial Science and Technology Research Project under Grants 252102211047,252102211062,252102211055 and 232102210069+2 种基金the Jiangsu Provincial Scheme Double Initiative Plan JSS-CBS20230474,the XJTLU RDF-21-02-008the Science and Technology Innovation Project of Zhengzhou University of Light Industry under Grant 23XNKJTD0205the Higher Education Teaching Reform Research and Practice Project of Henan Province under Grant 2024SJGLX0126。
文摘Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability.
基金the National Key Research and Development Program of China(Grant No.2022YFF0711400)which provided valuable financial support and resources for my research and made it possible for me to deeply explore the unknown mysteries in the field of lunar geologythe National Space Science Data Center Youth Open Project(Grant No.NSSDC2302001),which has not only facilitated the smooth progress of my research,but has also built a platform for me to communicate and cooperate with experts in the field.
文摘Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious and they are numerous,resulting in low detection accuracy by deep learning models.Therefore,we proposed a new multi-scale fusion crater detection algorithm(MSF-CDA)based on the YOLO11 to improve the accuracy of lunar impact crater detection,especially for small craters with a diameter of<1 km.Using the images taken by the LROC(Lunar Reconnaissance Orbiter Camera)at the Chang’e-4(CE-4)landing area,we constructed three separate datasets for craters with diameters of 0-70 m,70-140 m,and>140 m.We then trained three submodels separately with these three datasets.Additionally,we designed a slicing-amplifying-slicing strategy to enhance the ability to extract features from small craters.To handle redundant predictions,we proposed a new Non-Maximum Suppression with Area Filtering method to fuse the results in overlapping targets within the multi-scale submodels.Finally,our new MSF-CDA method achieved high detection performance,with the Precision,Recall,and F1 score having values of 0.991,0.987,and 0.989,respectively,perfectly addressing the problems induced by the lesser features and sample imbalance of small craters.Our MSF-CDA can provide strong data support for more in-depth study of the geological evolution of the lunar surface and finer geological age estimations.This strategy can also be used to detect other small objects with lesser features and sample imbalance problems.We detected approximately 500,000 impact craters in an area of approximately 214 km2 around the CE-4 landing area.By statistically analyzing the new data,we updated the distribution function of the number and diameter of impact craters.Finally,we identified the most suitable lighting conditions for detecting impact crater targets by analyzing the effect of different lighting conditions on the detection accuracy.
基金funded by the National Natural Science Foundation of China,grant numbers 52374156 and 62476005。
文摘Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments.
基金Opening Foundation of Key Laboratory of Explosive Energy Utilization and Control,Anhui Province(BP20240104)Graduate Innovation Program of China University of Mining and Technology(2024WLJCRCZL049)Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX24_2701)。
文摘Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer composite plate by explosive welding.The microscopic properties of each bonding interface were elucidated through field emission scanning electron microscope and electron backscattered diffraction(EBSD).A methodology combining finite element method-smoothed particle hydrodynamics(FEM-SPH)and molecular dynamics(MD)was proposed for the analysis of the forming and evolution characteristics of explosive welding interfaces at multi-scale.The results demonstrate that the bonding interface morphologies of TC4/Al 6063 and Al 6063/Al 7075 exhibit a flat and wavy configuration,without discernible defects or cracks.The phenomenon of grain refinement is observed in the vicinity of the two bonding interfaces.Furthermore,the degree of plastic deformation of TC4 and Al 7075 is more pronounced than that of Al 6063 in the intermediate layer.The interface morphology characteristics obtained by FEM-SPH simulation exhibit a high degree of similarity to the experimental results.MD simulations reveal that the diffusion of interfacial elements predominantly occurs during the unloading phase,and the simulated thickness of interfacial diffusion aligns well with experimental outcomes.The introduction of intermediate layer in the explosive welding process can effectively produce high-quality titanium/aluminum alloy composite plates.Furthermore,this approach offers a multi-scale simulation strategy for the study of explosive welding bonding interfaces.
基金funded by the Joint Fund for Regional Innovation and Development of National Natural Science Foundation of China(U21A20143)the National Science Fund for Excellent Young Scholars(52322607)the Excellent Youth Foundation of Heilongjiang Scientific Committee(YQ2022E028)。
文摘Improving the volumetric energy density of supercapacitors is essential for practical applications,which highly relies on the dense storage of ions in carbon-based electrodes.The functional units of carbon-based electrode exhibit multi-scale structural characteristics including macroscopic electrode morphologies,mesoscopic microcrystals and pores,and microscopic defects and dopants in the carbon basal plane.Therefore,the ordered combination of multi-scale structures of carbon electrode is crucial for achieving dense energy storage and high volumetric performance by leveraging the functions of various scale structu re.Considering that previous reviews have focused more on the discussion of specific scale structu re of carbon electrodes,this review takes a multi-scale perspective in which recent progresses regarding the structureperformance relationship,underlying mechanism and directional design of carbon-based multi-scale structures including carbon morphology,pore structure,carbon basal plane micro-environment and electrode technology on dense energy storage and volumetric property of supercapacitors are systematically discussed.We analyzed in detail the effects of the morphology,pore,and micro-environment of carbon electrode materials on ion dense storage,summarized the specific effects of different scale structures on volumetric property and recent research progress,and proposed the mutual influence and trade-off relationship between various scale structures.In addition,the challenges and outlooks for improving the dense storage and volumetric performance of carbon-based supercapacitors are analyzed,which can provide feasible technical reference and guidance for the design and manufacture of dense carbon-based electrode materials.
基金supported by the Natural Science Foundation of China(Grant No.42302170)National Postdoctoral Innovative Talent Support Program(Grant No.BX20220062)+3 种基金CNPC Innovation Found(Grant No.2022DQ02-0104)National Science Foundation of Heilongjiang Province of China(Grant No.YQ2023D001)Postdoctoral Science Foundation of Heilongjiang Province of China(Grant No.LBH-Z22091)the Natural Science Foundation of Shandong Province(Grant No.ZR2022YQ30).
文摘Prediction of production decline and evaluation of the adsorbed/free gas ratio are critical for determining the lifespan and production status of shale gas wells.Traditional production prediction methods have some shortcomings because of the low permeability and tightness of shale,complex gas flow behavior of multi-scale gas transport regions and multiple gas transport mechanism superpositions,and complex and variable production regimes of shale gas wells.Recent research has demonstrated the existence of a multi-stage isotope fractionation phenomenon during shale gas production,with the fractionation characteristics of each stage associated with the pore structure,gas in place(GIP),adsorption/desorption,and gas production process.This study presents a new approach for estimating shale gas well production and evaluating the adsorbed/free gas ratio throughout production using isotope fractionation techniques.A reservoir-scale carbon isotope fractionation(CIF)model applicable to the production process of shale gas wells was developed for the first time in this research.In contrast to the traditional model,this model improves production prediction accuracy by simultaneously fitting the gas production rate and δ^(13)C_(1) data and provides a new evaluation method of the adsorbed/free gas ratio during shale gas production.The results indicate that the diffusion and adsorption/desorption properties of rock,bottom-hole flowing pressure(BHP)of gas well,and multi-scale gas transport regions of the reservoir all affect isotope fractionation,with the diffusion and adsorption/desorption parameters of rock having the greatest effect on isotope fractionation being D∗/D,PL,VL,α,and others in that order.We effectively tested the universality of the four-stage isotope fractionation feature and revealed a unique isotope fractionation mechanism caused by the superimposed coupling of multi-scale gas transport regions during shale gas well production.Finally,we applied the established CIF model to a shale gas well in the Sichuan Basin,China,and calculated the estimated ultimate recovery(EUR)of the well to be 3.33×10^(8) m^(3);the adsorbed gas ratio during shale gas production was 1.65%,10.03%,and 23.44%in the first,fifth,and tenth years,respectively.The findings are significant for understanding the isotope fractionation mechanism during natural gas transport in complex systems and for formulating and optimizing unconventional natural gas development strategies.
基金National Key Research and Development Program of China (No.2021YFC3100800)the National Natural Science Foundation of China (Nos.42407235 and 42271026)+1 种基金the Project of Sanya Yazhou Bay Science and Technology City (No.SCKJ-JYRC-2023-54)supported by the Hefei advanced computing center
文摘Coral reef limestone(CRL)constitutes a distinctive marine carbonate formation with complex mechanical properties.This study investigates the multiscale damage and fracture mechanisms of CRL through integrated experimental testing,digital core technology,and theoretical modelling.Two CRL types with contrasting mesostructures were characterized across three scales.Macroscopically,CRL-I and CRL-II exhibited mean compressive strengths of 8.46 and 5.17 MPa,respectively.Mesoscopically,CRL-I featured small-scale highly interconnected pores,whilst CRL-II developed larger stratified pores with diminished connectivity.Microscopically,both CRL matrices demonstrated remarkable similarity in mineral composition and mechanical properties.A novel voxel average-based digital core scaling methodology was developed to facilitate numerical simulation of cross-scale damage processes,revealing network-progressive failure in CRL-I versus directional-brittle failure in CRL-II.Furthermore,a damage statistical constitutive model based on digital core technology and mesoscopic homogenisation theory established quantitative relationships between microelement strength distribution and macroscopic mechanical behavior.These findings illuminate the fundamental mechanisms through which mesoscopic structure governs the macroscopic mechanical properties of CRL.
基金supported by the Natural Science Foundation of the Anhui Higher Education Institutions of China(Grant Nos.2023AH040149 and 2024AH051915)the Anhui Provincial Natural Science Foundation(Grant No.2208085MF168)+1 种基金the Science and Technology Innovation Tackle Plan Project of Maanshan(Grant No.2024RGZN001)the Scientific Research Fund Project of Anhui Medical University(Grant No.2023xkj122).
文摘Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to the inability to effectively capture global information from images,CNNs can easily lead to loss of contours and textures in segmentation results.Notice that the transformer model can effectively capture the properties of long-range dependencies in the image,and furthermore,combining the CNN and the transformer can effectively extract local details and global contextual features of the image.Motivated by this,we propose a multi-branch and multi-scale attention network(M2ANet)for medical image segmentation,whose architecture consists of three components.Specifically,in the first component,we construct an adaptive multi-branch patch module for parallel extraction of image features to reduce information loss caused by downsampling.In the second component,we apply residual block to the well-known convolutional block attention module to enhance the network’s ability to recognize important features of images and alleviate the phenomenon of gradient vanishing.In the third component,we design a multi-scale feature fusion module,in which we adopt adaptive average pooling and position encoding to enhance contextual features,and then multi-head attention is introduced to further enrich feature representation.Finally,we validate the effectiveness and feasibility of the proposed M2ANet method through comparative experiments on four benchmark medical image segmentation datasets,particularly in the context of preserving contours and textures.
基金supported by the National Natural Science Foundation of China(Grant Nos.42150204 and 2288101)supported by the China National Postdoctoral Program for Innovative Talents(BX20230045)the China Postdoctoral Science Foundation(2023M730279)。
文摘A nonlinear multi-scale interaction(NMI)model was proposed and developed by the first author for nearly 30 years to represent the evolution of atmospheric blocking.In this review paper,we first review the creation and development of the NMI model and then emphasize that the NMI model represents a new tool for identifying the basic physics of how climate change influences mid-to-high latitude weather extremes.The building of the NMI model took place over three main periods.In the 1990s,a nonlinear Schr?dinger(NLS)equation model was presented to describe atmospheric blocking as a wave packet;however,it could not depict the lifetime(10-20 days)of atmospheric blocking.In the 2000s,we proposed an NMI model of atmospheric blocking in a uniform basic flow by making a scale-separation assumption and deriving an eddyforced NLS equation.This model succeeded in describing the life cycle of atmospheric blocking.In the 2020s,the NMI model was extended to include the impact of a changing climate mainly by altering the basic zonal winds and the magnitude of the meridional background potential vorticity gradient(PVy).Model results show that when PVy is smaller,blocking has a weaker dispersion and a stronger nonlinearity,so blocking can be more persistent and have a larger zonal scale and weaker eastward movement,thus favoring stronger weather extremes.However,when PVy is much smaller and below a critical threshold under much stronger winter Arctic warming of global warming,atmospheric blocking becomes locally less persistent and shows a much stronger westward movement,which acts to inhibit local cold extremes.Such a case does not happen in summer under global warming because PVy fails to fall below the critical threshold.Thus,our theory indicates that global warming can render summer-blocking anticyclones and mid-to-high latitude heatwaves more persistent,intense,and widespread.
基金supported by the National Natural Science Foundation of China(62272049,62236006,62172045)the Key Projects of Beijing Union University(ZKZD202301).
文摘In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accuracy significantly declines when the data is occluded.To enhance the accuracy of gait emotion recognition under occlusion,this paper proposes a Multi-scale Suppression Graph ConvolutionalNetwork(MS-GCN).TheMS-GCN consists of three main components:Joint Interpolation Module(JI Moudle),Multi-scale Temporal Convolution Network(MS-TCN),and Suppression Graph Convolutional Network(SGCN).The JI Module completes the spatially occluded skeletal joints using the(K-Nearest Neighbors)KNN interpolation method.The MS-TCN employs convolutional kernels of various sizes to comprehensively capture the emotional information embedded in the gait,compensating for the temporal occlusion of gait information.The SGCN extracts more non-prominent human gait features by suppressing the extraction of key body part features,thereby reducing the negative impact of occlusion on emotion recognition results.The proposed method is evaluated on two comprehensive datasets:Emotion-Gait,containing 4227 real gaits from sources like BML,ICT-Pollick,and ELMD,and 1000 synthetic gaits generated using STEP-Gen technology,and ELMB,consisting of 3924 gaits,with 1835 labeled with emotions such as“Happy,”“Sad,”“Angry,”and“Neutral.”On the standard datasets Emotion-Gait and ELMB,the proposed method achieved accuracies of 0.900 and 0.896,respectively,attaining performance comparable to other state-ofthe-artmethods.Furthermore,on occlusion datasets,the proposedmethod significantly mitigates the performance degradation caused by occlusion compared to other methods,the accuracy is significantly higher than that of other methods.
基金Supported by the Henan Province Key Research and Development Project(231111211300)the Central Government of Henan Province Guides Local Science and Technology Development Funds(Z20231811005)+2 种基金Henan Province Key Research and Development Project(231111110100)Henan Provincial Outstanding Foreign Scientist Studio(GZS2024006)Henan Provincial Joint Fund for Scientific and Technological Research and Development Plan(Application and Overcoming Technical Barriers)(242103810028)。
文摘The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method for infrared and visible image fusion is proposed.The encoder designed according to the optimization objective consists of a base encoder and a detail encoder,which is used to extract low-frequency and high-frequency information from the image.This extraction may lead to some information not being captured,so a compensation encoder is proposed to supplement the missing information.Multi-scale decomposition is also employed to extract image features more comprehensively.The decoder combines low-frequency,high-frequency and supplementary information to obtain multi-scale features.Subsequently,the attention strategy and fusion module are introduced to perform multi-scale fusion for image reconstruction.Experimental results on three datasets show that the fused images generated by this network effectively retain salient targets while being more consistent with human visual perception.
基金supported by the National Natural Science Foundation of China(52172091,52172295)Defense Industrial Technology Development Program(JCKY2023605C002)+4 种基金Frontier Leading Technology Basic Research Major Project of Jiangsu Province(SBK2023050110)the National Key Laboratory on Electromagnetic Environmental Effects and Electro-optical Engineering(NO.61422062301)the Opening Project of Science and Technology on Reliability Physics and Application Technology of Electronic Component Laboratory(ZHD202305)the Opening Project of Jiangsu Key Laboratory of Advanced Structural Materials and Application Technology(ASMA202303)the Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX23_0371).
文摘Advanced chemical engineering for simultaneous modulation of nanomaterial morphology, defects, interfaces, and structure to enhance electromagnetic and microwave absorption (MA) performance. However, accurately distinguishing the MA contributions of different scale factors and tuning the optimal combined effects remains a formidable challenge. This study employs a synergistic approach combining template protection etching and vacuum annealing to construct a controlled system of micrometer-sized cavities and amorphous carbon matrices in metal-organic framework (MOF) derivatives. The results demonstrate that the spatial effects introduced by the hollow structure enhance dielectric loss but significantly weaken impedance matching. By increasing the proportion of amorphous carbon, the balance between electromagnetic loss and impedance matching can be effectively maintained. Importantly, in a suitable graphitization environment, the presence of oxygen vacancies in amorphous carbon can induce significant polarization to compensate for the reduced conductivity loss due to the absence of sp2 carbon. Through the synergistic effects of morphology and composition, the samples exhibit a broader absorption bandwidth (6.28 GHz) and stronger reflection loss (−61.64 dB) compared to the original MOF. In conclusion, this study aims to elucidate the multiscale impacts of macroscopic micro-nano structure and microscopic defect engineering, providing valuable insights for future research in this field.
基金supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region,China(Project No.15308024)a grant from Research Centre for Carbon-Strategic Catalysis,The Hong Kong Polytechnic University(CE2X).
文摘Water electrolyzers play a crucial role in green hydrogen production.However,their efficiency and scalability are often compromised by bubble dynamics across various scales,from nanoscale to macroscale components.This review explores multi-scale modeling as a tool to visualize multi-phase flow and improve mass transport in water electrolyzers.At the nanoscale,molecular dynamics(MD)simulations reveal how electrode surface features and wettability influence nanobubble nucleation and stability.Moving to the mesoscale,models such as volume of fluid(VOF)and lattice Boltzmann method(LBM)shed light on bubble transport in porous transport layers(PTLs).These insights inform innovative designs,including gradient porosity and hydrophilic-hydrophobic patterning,aimed at minimizing gas saturation.At the macroscale,VOF simulations elucidate two-phase flow regimes within channels,showing how flow field geometry and wettability affect bubble discharging.Moreover,artificial intelligence(AI)-driven surrogate models expedite the optimization process,allowing for rapid exploration of structural parameters in channel-rib flow fields and porous flow field designs.By integrating these approaches,we can bridge theoretical insights with experimental validation,ultimately enhancing water electrolyzer performance,reducing costs,and advancing affordable,high-efficiency hydrogen production.
基金supported by the National Natural Science Foundation of China(No.12172001)the University Natural Science Research Project of Anhui Province(No.2022AH020029)+1 种基金the Anhui Provincial Natural Science Foundation(Nos.2208085Y01 and 2008085QA23)the Housing and Urban-Rural Development Science and Technology Project of Anhui Province(No.2023-YF129),China.
文摘Self-vibrating systems comprised of active materials have great potential for application in the fields of energy harvesting,actuation,bionic instrumentation,and autonomous robotics.However,it is challenging to obtain analytical solutions describing these systems,which hinders analysis and design.In this work,we propose a self-vibrating liquid crystal elastomer(LCE)fiber-spring system exposed to spatially-constant gradient light,and determine analytical solutions for its amplitude and period.First,using a dynamic model of LCE,we obtain the equations governing the self-vibration.Then,we analyze two different motion states and elucidate the mechanism of self-vibration.Subsequently,we derive analytical solutions for the amplitude and frequency using the multi-scale method,and compare the solutions with numerical results.The analytical outcomes are shown to be consistent with the numerical calculations,while taking far less computational time.Our findings reveal the utility of the multi-scale method in describing self-vibration,which may contribute to more efficient and accurate analyses of self-vibrating systems.
基金financially supported by the National Natural Science Foundation of China(Grant Nos.U21A2008,42271075 and 42120104002)Research Project of China Railway Eryuan Engineering Group Co.Ltd.(Grant No.KSNQ213011)。
文摘Oil and gas pipelines,as linear infrastructure spanning multiple regions,are highly susceptible to geological hazards.Previous research has focused on discrete hazard points,yet failure at a single point can compromise the integrity of the entire system,underscoring a gap in quantitative assessment of systemic risk.This study examines the Guangdong Dapeng liquefied natural gas(LNG)pipeline and proposes a multiscale,coupled risk assessment framework based on a hierarchical“region–section–point”approach.At the regional scale,an informationentropy model maps risk distributions and rapidly flags high-risk sections.At the section scale,UAVbased inspection is combined with in situ monitoring to conduct dynamic slope-stability analysis.At the point scale,monitoring data drive a slope–pipeline coupled finite element model that simulates the pipeline's mechanical response under prospective slope-failure scenarios,enabling early warning of buried-pipeline hazards.Results indicate that the Dapeng pipeline is generally stable.High-risk sections cluster in the Nanshan–Pingshan,Dapeng–Xiasha,and Qingxi–Zhangmutou areas,accounting for 18.9%of the total length.The regional model achieves an area under the ROC curve(AUC)of 0.89,supporting its predictive reliability.UAV inspections detected no visible pipeline damage,and the maximum change in displacement-tangent angle at the Pingshan and Dapeng monitoring sites was-1.83°,consistent with stable slopes.Under current conditions,the Pingshan site remains safe;simulations indicate that damage would initiate near the slope toe when ground displacement reaches about 4 m.Monitoring and simulation exhibit consistent pore-water-pressure trends,with maximum changes of 1.7 Pa and 0.8 Pa during the study period,mutually validating monitoring data quality and simulation credibility.The framework shifts the focus from isolated hazard points to continuous linear segments and provides a methodological basis for systematic,fine-grained geohazard-risk management of linear infrastructure.
基金supported in part by the National Natural Science Foundation of China under Grant 62201201the Foundation of Henan Educational Committee under Grant 242102211042.
文摘Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion sizes.To overcome these limitations,we introduce MSAMamba-UNet,a lightweight model that integrates two novel architectures:Multi-Scale Mamba(MSMamba)and Adaptive Dynamic Gating Block(ADGB).MSMamba utilizes multi-scale decomposition and a parallel hierarchical structure to enhance the delineation of irregular lesion boundaries and sensitivity to small targets.ADGB dynamically selects convolutional kernels with varying receptive fields based on input features,improving the model’s capacity to accommodate diverse lesion textures and scales.Additionally,we introduce a Mix Attention Fusion Block(MAF)to enhance shallow feature representation by integrating parallel channel and pixel attention mechanisms.Extensive evaluation of MSAMamba-UNet on the ISIC 2016,ISIC 2017,and ISIC 2018 datasets demonstrates competitive segmentation accuracy with only 0.056 M parameters and 0.069 GFLOPs.Our experiments revealed that MSAMamba-UNet achieved IoU scores of 85.53%,85.47%,and 82.22%,as well as DSC scores of 92.20%,92.17%,and 90.24%,respectively.These results underscore the lightweight design and effectiveness of MSAMamba-UNet.
基金supported by the National Key Research and Development Program of China(2023YFB3307800)National Natural Science Foundation of China(62394343,62373155)+2 种基金Major Science and Technology Project of Xinjiang(No.2022A01006-4)State Key Laboratory of Industrial Control Technology,China(Grant No.ICT2024A26)Fundamental Research Funds for the Central Universities.
文摘Deep Learning has been widely used to model soft sensors in modern industrial processes with nonlinear variables and uncertainty.Due to the outstanding ability for high-level feature extraction,stacked autoencoder(SAE)has been widely used to improve the model accuracy of soft sensors.However,with the increase of network layers,SAE may encounter serious information loss issues,which affect the modeling performance of soft sensors.Besides,there are typically very few labeled samples in the data set,which brings challenges to traditional neural networks to solve.In this paper,a multi-scale feature fused stacked autoencoder(MFF-SAE)is suggested for feature representation related to hierarchical output,where stacked autoencoder,mutual information(MI)and multi-scale feature fusion(MFF)strategies are integrated.Based on correlation analysis between output and input variables,critical hidden variables are extracted from the original variables in each autoencoder's input layer,which are correspondingly given varying weights.Besides,an integration strategy based on multi-scale feature fusion is adopted to mitigate the impact of information loss with the deepening of the network layers.Then,the MFF-SAE method is designed and stacked to form deep networks.Two practical industrial processes are utilized to evaluate the performance of MFF-SAE.Results from simulations indicate that in comparison to other cutting-edge techniques,the proposed method may considerably enhance the accuracy of soft sensor modeling,where the suggested method reduces the root mean square error(RMSE)by 71.8%,17.1%and 64.7%,15.1%,respectively.
基金supported by the National Natural Science Foundation of China(No.62376287)the International Science and Technology Innovation Joint Base of Machine Vision and Medical Image Processing in Hunan Province(2021CB1013)the Natural Science Foundation of Hunan Province(Nos.2022JJ30762,2023JJ70016).
文摘Globally,diabetic retinopathy(DR)is the primary cause of blindness,affecting millions of people worldwide.This widespread impact underscores the critical need for reliable and precise diagnostic techniques to ensure prompt diagnosis and effective treatment.Deep learning-based automated diagnosis for diabetic retinopathy can facilitate early detection and treatment.However,traditional deep learning models that focus on local views often learn feature representations that are less discriminative at the semantic level.On the other hand,models that focus on global semantic-level information might overlook critical,subtle local pathological features.To address this issue,we propose an adaptive multi-scale feature fusion network called(AMSFuse),which can adaptively combine multi-scale global and local features without compromising their individual representation.Specifically,our model incorporates global features for extracting high-level contextual information from retinal images.Concurrently,local features capture fine-grained details,such as microaneurysms,hemorrhages,and exudates,which are critical for DR diagnosis.These global and local features are adaptively fused using a fusion block,followed by an Integrated Attention Mechanism(IAM)that refines the fused features by emphasizing relevant regions,thereby enhancing classification accuracy for DR classification.Our model achieves 86.3%accuracy on the APTOS dataset and 96.6%RFMiD,both of which are comparable to state-of-the-art methods.
基金Open Access funding provided by the National Institutes of Health(NIH)The funding for this project was provided by NCATS Intramural Fund.
文摘Semantic segmentation plays a foundational role in biomedical image analysis, providing precise information about cellular, tissue, and organ structures in both biological and medical imaging modalities. Traditional approaches often fail in the face of challenges such as low contrast, morphological variability, and densely packed structures. Recent advancements in deep learning have transformed segmentation capabilities through the integration of fine-scale detail preservation, coarse-scale contextual modeling, and multi-scale feature fusion. This work provides a comprehensive analysis of state-of-the-art deep learning models, including U-Net variants, attention-based frameworks, and Transformer-integrated networks, highlighting innovations that improve accuracy, generalizability, and computational efficiency. Key architectural components such as convolution operations, shallow and deep blocks, skip connections, and hybrid encoders are examined for their roles in enhancing spatial representation and semantic consistency. We further discuss the importance of hierarchical and instance-aware segmentation and annotation in interpreting complex biological scenes and multiplexed medical images. By bridging methodological developments with diverse application domains, this paper outlines current trends and future directions for semantic segmentation, emphasizing its critical role in facilitating annotation, diagnosis, and discovery in biomedical research.