Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhanc...Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhancement and visual improvement.To deal with these problems,a sub-regional infrared-visible image fusion method(SRF)is proposed.First,morphology and threshold segmentation is applied to extract targets interested in infrared images.Second,the infrared back-ground is reconstructed based on extracted targets and the visible image.Finally,target and back-ground regions are fused using a multi-scale transform.Experimental results are obtained using public data for comparison and evaluation,which demonstrate that the proposed SRF has poten-tial benefits over other methods.展开更多
Deformable retinal image registration is crucial in clinical diagnosis and longitudinal studies of retinal diseases.Most existing deep deformable retinal image registration methods focus on fully convolutional network...Deformable retinal image registration is crucial in clinical diagnosis and longitudinal studies of retinal diseases.Most existing deep deformable retinal image registration methods focus on fully convolutional network(FCN)architecture design,which fails to model long-range dependencies among pixels-a significant factor in deformable retinal image registration.Transformers based on the self-attention mechanism,can capture global context dependencies,complementing local convolution.However,multi-scale spatial feature fusion and pixel-wise position selection are also crucial for the deformable retinal image registration,are often ignored by both FCNs and transformers.To fully leverage the merits of FCNs,multi-scale spatial attention and transformers,we propose a hierarchical hybrid architecture,reparameterized multi-scale transformer(RMFormer),for deformable retinal image registration.In RMFormer,we specifically develop a reparameterized multi-scale spatial attention to adaptively fuse multi-scale spatial features,with the assistance of the reparameterizing technique,thereby highlighting informative pixel-wise positions in a lightweight manner.The experimental results on two publicly available datasets demonstrate the superiority of our RMFormer over state-of-the-art methods and show that it is data-efficient in a limited medical image regime.Additionally,we are the first to provide a visualization analysis to explain how our proposed method affects the deformable retinal image registration process.The source code of our work is available at https://github.com/Tloops/RMFormer.展开更多
This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi...This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi-scale encoding significantly enhances the model’s ability to capture both fine-grained and global features,while the dynamic loss function adapts during training to optimize classification accuracy and retrieval performance.Our approach was evaluated on the ISIC-2018 and ChestX-ray14 datasets,yielding notable improvements.Specifically,on the ISIC-2018 dataset,our method achieves an F1-Score improvement of+4.84% compared to the standard ViT,with a precision increase of+5.46% for melanoma(MEL).On the ChestX-ray14 dataset,the method delivers an F1-Score improvement of 5.3%over the conventional ViT,with precision gains of+5.0% for pneumonia(PNEU)and+5.4%for fibrosis(FIB).Experimental results demonstrate that our approach outperforms traditional CNN-based models and existing ViT variants,particularly in retrieving relevant medical cases and enhancing diagnostic accuracy.These findings highlight the potential of the proposedmethod for large-scalemedical image analysis,offering improved tools for clinical decision-making through superior classification and case comparison.展开更多
The capacity to diagnose faults in rolling bearings is of significant practical importance to ensure the normal operation of the equipment.Frequency-domain features can effectively enhance the identification of fault ...The capacity to diagnose faults in rolling bearings is of significant practical importance to ensure the normal operation of the equipment.Frequency-domain features can effectively enhance the identification of fault modes.However,existing methods often suffer from insufficient frequency-domain representation in practical applications,which greatly affects diagnostic performance.Therefore,this paper proposes a rolling bearing fault diagnosismethod based on aMulti-Scale FusionNetwork(MSFN)using the Time-Division Fourier Transform(TDFT).The method constructs multi-scale channels to extract time-domain and frequency-domain features of the signal in parallel.A multi-level,multi-scale filter-based approach is designed to extract frequency-domain features in a segmented manner.A cross-attention mechanism is introduced to facilitate the fusion of the extracted time-frequency domain features.The performance of the proposed method is validated using the CWRU and Ottawa datasets.The results show that the average accuracy of MSFN under complex noisy signals is 97.75%and 94.41%.The average accuracy under variable load conditions is 98.68%.This demonstrates its significant application potential compared to existing methods.展开更多
Segmentation of the retinal vessels in the fundus is crucial for diagnosing ocular diseases.Retinal vessel images often suffer from category imbalance and large scale variations.This ultimately results in incomplete v...Segmentation of the retinal vessels in the fundus is crucial for diagnosing ocular diseases.Retinal vessel images often suffer from category imbalance and large scale variations.This ultimately results in incomplete vessel segmentation and poor continuity.In this study,we propose CT-MFENet to address the aforementioned issues.First,the use of context transformer(CT)allows for the integration of contextual feature information,which helps establish the connection between pixels and solve the problem of incomplete vessel continuity.Second,multi-scale dense residual networks are used instead of traditional CNN to address the issue of inadequate local feature extraction when the model encounters vessels at multiple scales.In the decoding stage,we introduce a local-global fusion module.It enhances the localization of vascular information and reduces the semantic gap between high-and low-level features.To address the class imbalance in retinal images,we propose a hybrid loss function that enhances the segmentation ability of the model for topological structures.We conducted experiments on the publicly available DRIVE,CHASEDB1,STARE,and IOSTAR datasets.The experimental results show that our CT-MFENet performs better than most existing methods,including the baseline U-Net.展开更多
Remote sensing plays a pivotal role in environmental monitoring,disaster relief,and urban planning,where accurate scene classification of aerial images is essential.However,conventional convolutional neural networks(C...Remote sensing plays a pivotal role in environmental monitoring,disaster relief,and urban planning,where accurate scene classification of aerial images is essential.However,conventional convolutional neural networks(CNNs)struggle with long-range dependencies and preserving high-resolution features,limiting their effectiveness in complex aerial image analysis.To address these challenges,we propose a Hybrid HRNet-Swin Transformer model that synergizes the strengths of HRNet-W48 for high-resolution segmentation and the Swin Transformer for global feature extraction.This hybrid architecture ensures robust multi-scale feature fusion,capturing fine-grained details and broader contextual relationships in aerial imagery.Our methodology begins with preprocessing steps,including normalization,histogram equalization,and noise reduction,to enhance input data quality.The HRNet-W48 backbone maintains high-resolution feature maps throughout the network,enabling precise segmentation,while the Swin Transformer leverages hierarchical self-attention to model long-range dependencies efficiently.By integrating these components,our model achieves superior performance in segmentation and classification tasks compared to traditional CNNs and standalone transformer models.We evaluate our approach on two benchmark datasets:UC Merced and WHU-RS19.Experimental results demonstrate that the proposed hybrid model outperforms existing methods,achieving state-of-the-art accuracy while maintaining computational efficiency.Specifically,it excels in preserving fine spatial details and contextual understanding,critical for applications like land-use classification and disaster assessment.展开更多
The Pressure Sensitive Paint Technique(PSP)has gained attention in recent years because of its significant benefits in measuring surface pressure on wind tunnel models.However,in the post-processing process of PSP ima...The Pressure Sensitive Paint Technique(PSP)has gained attention in recent years because of its significant benefits in measuring surface pressure on wind tunnel models.However,in the post-processing process of PSP images,issues such as pressure taps,paint peeling,and contamination can lead to the loss of pressure data on the image,which seriously affects the subsequent calculation and analysis of pressure distribution.Therefore,image inpainting is particularly important in the post-processing process of PSP images.Deep learning offers new methods for PSP image inpainting,but some basic characteristics of convolutional neural networks(CNNs)may limit their ability to handle restoration tasks.By contrast,the self-attention mechanism in the transformer can efficiently model nonlocal relationships among input features by generating adaptive attention scores.As a result,we propose an efficient transformer network model for the PSP image inpainting task,named multi-scale dilated attention transformer(D-former).The model utilizes the redundancy of global dependencies modeling in Vision Transformers(ViTs)to introducemulti-scale dilated attention(MDA),thismechanism effectivelymodels the interaction between localized and sparse patches within the shifted window,achieving a better balance between computational complexity and receptive field.As a result,D-former allows efficient modeling of long-range features while using fewer parameters and lower computational costs.The experiments on two public datasets and the PSP dataset indicate that the method in this article performs better compared to several advancedmethods.Through the verification of real wind tunnel tests,thismethod can accurately restore the luminescent intensity data of holes in PSP images,thereby improving the accuracy of full field pressure data,and has a promising future in practical applications.展开更多
Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer ...Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer composite plate by explosive welding.The microscopic properties of each bonding interface were elucidated through field emission scanning electron microscope and electron backscattered diffraction(EBSD).A methodology combining finite element method-smoothed particle hydrodynamics(FEM-SPH)and molecular dynamics(MD)was proposed for the analysis of the forming and evolution characteristics of explosive welding interfaces at multi-scale.The results demonstrate that the bonding interface morphologies of TC4/Al 6063 and Al 6063/Al 7075 exhibit a flat and wavy configuration,without discernible defects or cracks.The phenomenon of grain refinement is observed in the vicinity of the two bonding interfaces.Furthermore,the degree of plastic deformation of TC4 and Al 7075 is more pronounced than that of Al 6063 in the intermediate layer.The interface morphology characteristics obtained by FEM-SPH simulation exhibit a high degree of similarity to the experimental results.MD simulations reveal that the diffusion of interfacial elements predominantly occurs during the unloading phase,and the simulated thickness of interfacial diffusion aligns well with experimental outcomes.The introduction of intermediate layer in the explosive welding process can effectively produce high-quality titanium/aluminum alloy composite plates.Furthermore,this approach offers a multi-scale simulation strategy for the study of explosive welding bonding interfaces.展开更多
Prediction of production decline and evaluation of the adsorbed/free gas ratio are critical for determining the lifespan and production status of shale gas wells.Traditional production prediction methods have some sho...Prediction of production decline and evaluation of the adsorbed/free gas ratio are critical for determining the lifespan and production status of shale gas wells.Traditional production prediction methods have some shortcomings because of the low permeability and tightness of shale,complex gas flow behavior of multi-scale gas transport regions and multiple gas transport mechanism superpositions,and complex and variable production regimes of shale gas wells.Recent research has demonstrated the existence of a multi-stage isotope fractionation phenomenon during shale gas production,with the fractionation characteristics of each stage associated with the pore structure,gas in place(GIP),adsorption/desorption,and gas production process.This study presents a new approach for estimating shale gas well production and evaluating the adsorbed/free gas ratio throughout production using isotope fractionation techniques.A reservoir-scale carbon isotope fractionation(CIF)model applicable to the production process of shale gas wells was developed for the first time in this research.In contrast to the traditional model,this model improves production prediction accuracy by simultaneously fitting the gas production rate and δ^(13)C_(1) data and provides a new evaluation method of the adsorbed/free gas ratio during shale gas production.The results indicate that the diffusion and adsorption/desorption properties of rock,bottom-hole flowing pressure(BHP)of gas well,and multi-scale gas transport regions of the reservoir all affect isotope fractionation,with the diffusion and adsorption/desorption parameters of rock having the greatest effect on isotope fractionation being D∗/D,PL,VL,α,and others in that order.We effectively tested the universality of the four-stage isotope fractionation feature and revealed a unique isotope fractionation mechanism caused by the superimposed coupling of multi-scale gas transport regions during shale gas well production.Finally,we applied the established CIF model to a shale gas well in the Sichuan Basin,China,and calculated the estimated ultimate recovery(EUR)of the well to be 3.33×10^(8) m^(3);the adsorbed gas ratio during shale gas production was 1.65%,10.03%,and 23.44%in the first,fifth,and tenth years,respectively.The findings are significant for understanding the isotope fractionation mechanism during natural gas transport in complex systems and for formulating and optimizing unconventional natural gas development strategies.展开更多
In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accurac...In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accuracy significantly declines when the data is occluded.To enhance the accuracy of gait emotion recognition under occlusion,this paper proposes a Multi-scale Suppression Graph ConvolutionalNetwork(MS-GCN).TheMS-GCN consists of three main components:Joint Interpolation Module(JI Moudle),Multi-scale Temporal Convolution Network(MS-TCN),and Suppression Graph Convolutional Network(SGCN).The JI Module completes the spatially occluded skeletal joints using the(K-Nearest Neighbors)KNN interpolation method.The MS-TCN employs convolutional kernels of various sizes to comprehensively capture the emotional information embedded in the gait,compensating for the temporal occlusion of gait information.The SGCN extracts more non-prominent human gait features by suppressing the extraction of key body part features,thereby reducing the negative impact of occlusion on emotion recognition results.The proposed method is evaluated on two comprehensive datasets:Emotion-Gait,containing 4227 real gaits from sources like BML,ICT-Pollick,and ELMD,and 1000 synthetic gaits generated using STEP-Gen technology,and ELMB,consisting of 3924 gaits,with 1835 labeled with emotions such as“Happy,”“Sad,”“Angry,”and“Neutral.”On the standard datasets Emotion-Gait and ELMB,the proposed method achieved accuracies of 0.900 and 0.896,respectively,attaining performance comparable to other state-ofthe-artmethods.Furthermore,on occlusion datasets,the proposedmethod significantly mitigates the performance degradation caused by occlusion compared to other methods,the accuracy is significantly higher than that of other methods.展开更多
The Ti17(a+β)-Ti17(β)dual alloy-dual property blisk produced using Linear Friction Welding(LFW)is considered as high-performance component in advanced aeroengine.However,up to now,microstructure evolution and relati...The Ti17(a+β)-Ti17(β)dual alloy-dual property blisk produced using Linear Friction Welding(LFW)is considered as high-performance component in advanced aeroengine.However,up to now,microstructure evolution and relationship between microstructure and micro mechanical properties of LFWed Ti17(a+β)/Ti17(β)dissimilar joint have not been thoroughly revealed.In this work,complex analyses of the phase transformation mechanisms of the joint are conducted,and phase transformations in individual zones are correlated to their microhardness and nanohardness.Results reveal that a dissolution occurs under high temperatures encountered during LFW,which reduces microhardness of the joint to that of Ti17(a+β)and Ti17(β).In ThermoMechanically Affected Zone of Ti17(a+β)(TMAZ-(a+β))side joint,a large number of nanocrystalline a phases form with different orientations.This microstructure strengthens significantly by fine grains which balances partial softening effect of a dissolution,and increases nanohardness of a phase and microhardness of TMAZ-(a+β).Superlattice metastableβphase precipitates from metastableβin Weld Zone(WZ)during quick cooling following welding,because of short-range diffusion migration of solute atoms,especiallyβstabilizing elements Mo and Cr.The precipitation of the superlattice metastableβphase results in precipitation strengthening,which in turn increases nanohardness of metastableβand microhardness in WZ.展开更多
The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method f...The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method for infrared and visible image fusion is proposed.The encoder designed according to the optimization objective consists of a base encoder and a detail encoder,which is used to extract low-frequency and high-frequency information from the image.This extraction may lead to some information not being captured,so a compensation encoder is proposed to supplement the missing information.Multi-scale decomposition is also employed to extract image features more comprehensively.The decoder combines low-frequency,high-frequency and supplementary information to obtain multi-scale features.Subsequently,the attention strategy and fusion module are introduced to perform multi-scale fusion for image reconstruction.Experimental results on three datasets show that the fused images generated by this network effectively retain salient targets while being more consistent with human visual perception.展开更多
A nonlinear multi-scale interaction(NMI)model was proposed and developed by the first author for nearly 30 years to represent the evolution of atmospheric blocking.In this review paper,we first review the creation and...A nonlinear multi-scale interaction(NMI)model was proposed and developed by the first author for nearly 30 years to represent the evolution of atmospheric blocking.In this review paper,we first review the creation and development of the NMI model and then emphasize that the NMI model represents a new tool for identifying the basic physics of how climate change influences mid-to-high latitude weather extremes.The building of the NMI model took place over three main periods.In the 1990s,a nonlinear Schr?dinger(NLS)equation model was presented to describe atmospheric blocking as a wave packet;however,it could not depict the lifetime(10-20 days)of atmospheric blocking.In the 2000s,we proposed an NMI model of atmospheric blocking in a uniform basic flow by making a scale-separation assumption and deriving an eddyforced NLS equation.This model succeeded in describing the life cycle of atmospheric blocking.In the 2020s,the NMI model was extended to include the impact of a changing climate mainly by altering the basic zonal winds and the magnitude of the meridional background potential vorticity gradient(PVy).Model results show that when PVy is smaller,blocking has a weaker dispersion and a stronger nonlinearity,so blocking can be more persistent and have a larger zonal scale and weaker eastward movement,thus favoring stronger weather extremes.However,when PVy is much smaller and below a critical threshold under much stronger winter Arctic warming of global warming,atmospheric blocking becomes locally less persistent and shows a much stronger westward movement,which acts to inhibit local cold extremes.Such a case does not happen in summer under global warming because PVy fails to fall below the critical threshold.Thus,our theory indicates that global warming can render summer-blocking anticyclones and mid-to-high latitude heatwaves more persistent,intense,and widespread.展开更多
Mental health problems and potential psychological crises affect the healthy growth and learning performance of college students.Effective and suitable prevention of psychological crises among college students is a co...Mental health problems and potential psychological crises affect the healthy growth and learning performance of college students.Effective and suitable prevention of psychological crises among college students is a continuous challenge university managers face.To explore a method of preventing psychological crises among college students,we measured 38661 students by using SCL-90(symptom check list-90)and screened out 5790 students with positive results.Then,we measured 33188 students by using PHQ-9(patient health questionnaire-9)and screened out 603 students with suicidal ideation or behavior;we interviewed 392 students by using GAQ(growth adversity questionnaire).The number of students who had positive results at both phases is 155.As a result,we obtained a data set(N=76)by integrating the students who tested positive on the PHQ-9(i.e.total score≥20)with those who completed the PHQ-9 and GAQ.In addition,we obtained a data set(N=50)by excluding the cases in which the GAQ score is 0.With regard to QCA(qualitative comparative analysis)results,the data set(N=76)exhibits 5 constellations of solutions with a coverage rate greater than 0.7,and the first eight indicators of the PHQ-9 constitute the explanatory variables in the combined solutions.About the data set(N=50),the combined solutions are extremely complicated and the explanatory variables encompass indicators from both the PHQ-9 and GAQ.All these mean that the multi-scale could more comprehensively reflect mental health states of college students,thus enhance the accuracy and effectiveness of the corresponding hierarchical intervention,and finally provide support for preventing psychological crises in universities.展开更多
Water electrolyzers play a crucial role in green hydrogen production.However,their efficiency and scalability are often compromised by bubble dynamics across various scales,from nanoscale to macroscale components.This...Water electrolyzers play a crucial role in green hydrogen production.However,their efficiency and scalability are often compromised by bubble dynamics across various scales,from nanoscale to macroscale components.This review explores multi-scale modeling as a tool to visualize multi-phase flow and improve mass transport in water electrolyzers.At the nanoscale,molecular dynamics(MD)simulations reveal how electrode surface features and wettability influence nanobubble nucleation and stability.Moving to the mesoscale,models such as volume of fluid(VOF)and lattice Boltzmann method(LBM)shed light on bubble transport in porous transport layers(PTLs).These insights inform innovative designs,including gradient porosity and hydrophilic-hydrophobic patterning,aimed at minimizing gas saturation.At the macroscale,VOF simulations elucidate two-phase flow regimes within channels,showing how flow field geometry and wettability affect bubble discharging.Moreover,artificial intelligence(AI)-driven surrogate models expedite the optimization process,allowing for rapid exploration of structural parameters in channel-rib flow fields and porous flow field designs.By integrating these approaches,we can bridge theoretical insights with experimental validation,ultimately enhancing water electrolyzer performance,reducing costs,and advancing affordable,high-efficiency hydrogen production.展开更多
Detecting abnormal cervical cells is crucial for early identification and timely treatment of cervical cancer.However,this task is challenging due to the morphological similarities between abnormal and normal cells an...Detecting abnormal cervical cells is crucial for early identification and timely treatment of cervical cancer.However,this task is challenging due to the morphological similarities between abnormal and normal cells and the significant variations in cell size.Pathologists often refer to surrounding cells to identify abnormalities.To emulate this slide examination behavior,this study proposes a Multi-Scale Feature Fusion Network(MSFF-Net)for detecting cervical abnormal cells.MSFF-Net employs a Cross-Scale Pooling Model(CSPM)to effectively capture diverse features and contextual information,ranging from local details to the overall structure.Additionally,a Multi-Scale Fusion Attention(MSFA)module is introduced to mitigate the impact of cell size variations by adaptively fusing local and global information at different scales.To handle the complex environment of cervical cell images,such as cell adhesion and overlapping,the Inner-CIoU loss function is utilized to more precisely measure the overlap between bounding boxes,thereby improving detection accuracy in such scenarios.Experimental results on the Comparison detector dataset demonstrate that MSFF-Net achieves a mean average precision(mAP)of 63.2%,outperforming state-of-the-art methods while maintaining a relatively small number of parameters(26.8 M).This study highlights the effectiveness of multi-scale feature fusion in enhancing the detection of cervical abnormal cells,contributing to more accurate and efficient cervical cancer screening.展开更多
Advanced chemical engineering for simultaneous modulation of nanomaterial morphology, defects, interfaces, and structure to enhance electromagnetic and microwave absorption (MA) performance. However, accurately distin...Advanced chemical engineering for simultaneous modulation of nanomaterial morphology, defects, interfaces, and structure to enhance electromagnetic and microwave absorption (MA) performance. However, accurately distinguishing the MA contributions of different scale factors and tuning the optimal combined effects remains a formidable challenge. This study employs a synergistic approach combining template protection etching and vacuum annealing to construct a controlled system of micrometer-sized cavities and amorphous carbon matrices in metal-organic framework (MOF) derivatives. The results demonstrate that the spatial effects introduced by the hollow structure enhance dielectric loss but significantly weaken impedance matching. By increasing the proportion of amorphous carbon, the balance between electromagnetic loss and impedance matching can be effectively maintained. Importantly, in a suitable graphitization environment, the presence of oxygen vacancies in amorphous carbon can induce significant polarization to compensate for the reduced conductivity loss due to the absence of sp2 carbon. Through the synergistic effects of morphology and composition, the samples exhibit a broader absorption bandwidth (6.28 GHz) and stronger reflection loss (−61.64 dB) compared to the original MOF. In conclusion, this study aims to elucidate the multiscale impacts of macroscopic micro-nano structure and microscopic defect engineering, providing valuable insights for future research in this field.展开更多
With the rapid growth of socialmedia,the spread of fake news has become a growing problem,misleading the public and causing significant harm.As social media content is often composed of both images and text,the use of...With the rapid growth of socialmedia,the spread of fake news has become a growing problem,misleading the public and causing significant harm.As social media content is often composed of both images and text,the use of multimodal approaches for fake news detection has gained significant attention.To solve the problems existing in previous multi-modal fake news detection algorithms,such as insufficient feature extraction and insufficient use of semantic relations between modes,this paper proposes the MFFFND-Co(Multimodal Feature Fusion Fake News Detection with Co-Attention Block)model.First,the model deeply explores the textual content,image content,and frequency domain features.Then,it employs a Co-Attention mechanism for cross-modal fusion.Additionally,a semantic consistency detectionmodule is designed to quantify semantic deviations,thereby enhancing the performance of fake news detection.Experimentally verified on two commonly used datasets,Twitter and Weibo,the model achieved F1 scores of 90.0% and 94.0%,respectively,significantly outperforming the pre-modified MFFFND(Multimodal Feature Fusion Fake News Detection with Attention Block)model and surpassing other baseline models.This improves the accuracy of detecting fake information in artificial intelligence detection and engineering software detection.展开更多
This paper aims to develop a nonrigid registration method of preoperative and intraoperative thoracoabdominal CT images in computer-assisted interventional surgeries for accurate tumor localization and tissue visualiz...This paper aims to develop a nonrigid registration method of preoperative and intraoperative thoracoabdominal CT images in computer-assisted interventional surgeries for accurate tumor localization and tissue visualization enhancement.However,fine structure registration of complex thoracoabdominal organs and large deformation registration caused by respiratory motion is challenging.To deal with this problem,we propose a 3D multi-scale attention VoxelMorph(MAVoxelMorph)registration network.To alleviate the large deformation problem,a multi-scale axial attention mechanism is utilized by using a residual dilated pyramid pooling for multi-scale feature extraction,and position-aware axial attention for long-distance dependencies between pixels capture.To further improve the large deformation and fine structure registration results,a multi-scale context channel attention mechanism is employed utilizing content information via adjacent encoding layers.Our method was evaluated on four public lung datasets(DIR-Lab dataset,Creatis dataset,Learn2Reg dataset,OASIS dataset)and a local dataset.Results proved that the proposed method achieved better registration performance than current state-of-the-art methods,especially in handling the registration of large deformations and fine structures.It also proved to be fast in 3D image registration,using about 1.5 s,and faster than most methods.Qualitative and quantitative assessments proved that the proposed MA-VoxelMorph has the potential to realize precise and fast tumor localization in clinical interventional surgeries.展开更多
Improving the volumetric energy density of supercapacitors is essential for practical applications,which highly relies on the dense storage of ions in carbon-based electrodes.The functional units of carbon-based elect...Improving the volumetric energy density of supercapacitors is essential for practical applications,which highly relies on the dense storage of ions in carbon-based electrodes.The functional units of carbon-based electrode exhibit multi-scale structural characteristics including macroscopic electrode morphologies,mesoscopic microcrystals and pores,and microscopic defects and dopants in the carbon basal plane.Therefore,the ordered combination of multi-scale structures of carbon electrode is crucial for achieving dense energy storage and high volumetric performance by leveraging the functions of various scale structu re.Considering that previous reviews have focused more on the discussion of specific scale structu re of carbon electrodes,this review takes a multi-scale perspective in which recent progresses regarding the structureperformance relationship,underlying mechanism and directional design of carbon-based multi-scale structures including carbon morphology,pore structure,carbon basal plane micro-environment and electrode technology on dense energy storage and volumetric property of supercapacitors are systematically discussed.We analyzed in detail the effects of the morphology,pore,and micro-environment of carbon electrode materials on ion dense storage,summarized the specific effects of different scale structures on volumetric property and recent research progress,and proposed the mutual influence and trade-off relationship between various scale structures.In addition,the challenges and outlooks for improving the dense storage and volumetric performance of carbon-based supercapacitors are analyzed,which can provide feasible technical reference and guidance for the design and manufacture of dense carbon-based electrode materials.展开更多
基金supported by the China Postdoctoral Science Foundation Funded Project(No.2021M690385)the National Natural Science Foundation of China(No.62101045).
文摘Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhancement and visual improvement.To deal with these problems,a sub-regional infrared-visible image fusion method(SRF)is proposed.First,morphology and threshold segmentation is applied to extract targets interested in infrared images.Second,the infrared back-ground is reconstructed based on extracted targets and the visible image.Finally,target and back-ground regions are fused using a multi-scale transform.Experimental results are obtained using public data for comparison and evaluation,which demonstrate that the proposed SRF has poten-tial benefits over other methods.
基金supported in part by the National Natural Science Foundation of China(No.82272086)the Leading Goose Program of Zhejiang,China(No.2023C03079)the Shenzhen Natural Science Fund,China(No.JCYJ20200109140820699).
文摘Deformable retinal image registration is crucial in clinical diagnosis and longitudinal studies of retinal diseases.Most existing deep deformable retinal image registration methods focus on fully convolutional network(FCN)architecture design,which fails to model long-range dependencies among pixels-a significant factor in deformable retinal image registration.Transformers based on the self-attention mechanism,can capture global context dependencies,complementing local convolution.However,multi-scale spatial feature fusion and pixel-wise position selection are also crucial for the deformable retinal image registration,are often ignored by both FCNs and transformers.To fully leverage the merits of FCNs,multi-scale spatial attention and transformers,we propose a hierarchical hybrid architecture,reparameterized multi-scale transformer(RMFormer),for deformable retinal image registration.In RMFormer,we specifically develop a reparameterized multi-scale spatial attention to adaptively fuse multi-scale spatial features,with the assistance of the reparameterizing technique,thereby highlighting informative pixel-wise positions in a lightweight manner.The experimental results on two publicly available datasets demonstrate the superiority of our RMFormer over state-of-the-art methods and show that it is data-efficient in a limited medical image regime.Additionally,we are the first to provide a visualization analysis to explain how our proposed method affects the deformable retinal image registration process.The source code of our work is available at https://github.com/Tloops/RMFormer.
基金funded by the Deanship of Research and Graduate Studies at King Khalid University through small group research under grant number RGP1/278/45.
文摘This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi-scale encoding significantly enhances the model’s ability to capture both fine-grained and global features,while the dynamic loss function adapts during training to optimize classification accuracy and retrieval performance.Our approach was evaluated on the ISIC-2018 and ChestX-ray14 datasets,yielding notable improvements.Specifically,on the ISIC-2018 dataset,our method achieves an F1-Score improvement of+4.84% compared to the standard ViT,with a precision increase of+5.46% for melanoma(MEL).On the ChestX-ray14 dataset,the method delivers an F1-Score improvement of 5.3%over the conventional ViT,with precision gains of+5.0% for pneumonia(PNEU)and+5.4%for fibrosis(FIB).Experimental results demonstrate that our approach outperforms traditional CNN-based models and existing ViT variants,particularly in retrieving relevant medical cases and enhancing diagnostic accuracy.These findings highlight the potential of the proposedmethod for large-scalemedical image analysis,offering improved tools for clinical decision-making through superior classification and case comparison.
基金fully supported by the Frontier Exploration Projects of Longmen Laboratory(No.LMQYTSKT034)Key Research and Development and Promotion of Special(Science and Technology)Project of Henan Province,China(No.252102210158)。
文摘The capacity to diagnose faults in rolling bearings is of significant practical importance to ensure the normal operation of the equipment.Frequency-domain features can effectively enhance the identification of fault modes.However,existing methods often suffer from insufficient frequency-domain representation in practical applications,which greatly affects diagnostic performance.Therefore,this paper proposes a rolling bearing fault diagnosismethod based on aMulti-Scale FusionNetwork(MSFN)using the Time-Division Fourier Transform(TDFT).The method constructs multi-scale channels to extract time-domain and frequency-domain features of the signal in parallel.A multi-level,multi-scale filter-based approach is designed to extract frequency-domain features in a segmented manner.A cross-attention mechanism is introduced to facilitate the fusion of the extracted time-frequency domain features.The performance of the proposed method is validated using the CWRU and Ottawa datasets.The results show that the average accuracy of MSFN under complex noisy signals is 97.75%and 94.41%.The average accuracy under variable load conditions is 98.68%.This demonstrates its significant application potential compared to existing methods.
基金the National Natural Science Foundation of China(No.62266025)。
文摘Segmentation of the retinal vessels in the fundus is crucial for diagnosing ocular diseases.Retinal vessel images often suffer from category imbalance and large scale variations.This ultimately results in incomplete vessel segmentation and poor continuity.In this study,we propose CT-MFENet to address the aforementioned issues.First,the use of context transformer(CT)allows for the integration of contextual feature information,which helps establish the connection between pixels and solve the problem of incomplete vessel continuity.Second,multi-scale dense residual networks are used instead of traditional CNN to address the issue of inadequate local feature extraction when the model encounters vessels at multiple scales.In the decoding stage,we introduce a local-global fusion module.It enhances the localization of vascular information and reduces the semantic gap between high-and low-level features.To address the class imbalance in retinal images,we propose a hybrid loss function that enhances the segmentation ability of the model for topological structures.We conducted experiments on the publicly available DRIVE,CHASEDB1,STARE,and IOSTAR datasets.The experimental results show that our CT-MFENet performs better than most existing methods,including the baseline U-Net.
基金supported by the ITP(Institute of Information&Communications Technology Planning&Evaluation)-ICAN(ICT Challenge and Advanced Network of HRD)(ITP-2025-RS-2022-00156326,33)grant funded by the Korea government(Ministry of Science and ICT)the Deanship of Research and Graduate Studies at King Khalid University for funding this work through the Large Group Project under grant number(RGP2/568/45)the Deanship of Scientific Research at Northern Border University,Arar,Saudi Arabia,for funding this research work through the Project Number"NBU-FFR-2025-231-03".
文摘Remote sensing plays a pivotal role in environmental monitoring,disaster relief,and urban planning,where accurate scene classification of aerial images is essential.However,conventional convolutional neural networks(CNNs)struggle with long-range dependencies and preserving high-resolution features,limiting their effectiveness in complex aerial image analysis.To address these challenges,we propose a Hybrid HRNet-Swin Transformer model that synergizes the strengths of HRNet-W48 for high-resolution segmentation and the Swin Transformer for global feature extraction.This hybrid architecture ensures robust multi-scale feature fusion,capturing fine-grained details and broader contextual relationships in aerial imagery.Our methodology begins with preprocessing steps,including normalization,histogram equalization,and noise reduction,to enhance input data quality.The HRNet-W48 backbone maintains high-resolution feature maps throughout the network,enabling precise segmentation,while the Swin Transformer leverages hierarchical self-attention to model long-range dependencies efficiently.By integrating these components,our model achieves superior performance in segmentation and classification tasks compared to traditional CNNs and standalone transformer models.We evaluate our approach on two benchmark datasets:UC Merced and WHU-RS19.Experimental results demonstrate that the proposed hybrid model outperforms existing methods,achieving state-of-the-art accuracy while maintaining computational efficiency.Specifically,it excels in preserving fine spatial details and contextual understanding,critical for applications like land-use classification and disaster assessment.
基金partly supported by the National Natural Science Foundation of China under Grant 12202476,author Chunhua Wei,https://www.nsfc.gov.cn/.
文摘The Pressure Sensitive Paint Technique(PSP)has gained attention in recent years because of its significant benefits in measuring surface pressure on wind tunnel models.However,in the post-processing process of PSP images,issues such as pressure taps,paint peeling,and contamination can lead to the loss of pressure data on the image,which seriously affects the subsequent calculation and analysis of pressure distribution.Therefore,image inpainting is particularly important in the post-processing process of PSP images.Deep learning offers new methods for PSP image inpainting,but some basic characteristics of convolutional neural networks(CNNs)may limit their ability to handle restoration tasks.By contrast,the self-attention mechanism in the transformer can efficiently model nonlocal relationships among input features by generating adaptive attention scores.As a result,we propose an efficient transformer network model for the PSP image inpainting task,named multi-scale dilated attention transformer(D-former).The model utilizes the redundancy of global dependencies modeling in Vision Transformers(ViTs)to introducemulti-scale dilated attention(MDA),thismechanism effectivelymodels the interaction between localized and sparse patches within the shifted window,achieving a better balance between computational complexity and receptive field.As a result,D-former allows efficient modeling of long-range features while using fewer parameters and lower computational costs.The experiments on two public datasets and the PSP dataset indicate that the method in this article performs better compared to several advancedmethods.Through the verification of real wind tunnel tests,thismethod can accurately restore the luminescent intensity data of holes in PSP images,thereby improving the accuracy of full field pressure data,and has a promising future in practical applications.
基金Opening Foundation of Key Laboratory of Explosive Energy Utilization and Control,Anhui Province(BP20240104)Graduate Innovation Program of China University of Mining and Technology(2024WLJCRCZL049)Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX24_2701)。
文摘Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer composite plate by explosive welding.The microscopic properties of each bonding interface were elucidated through field emission scanning electron microscope and electron backscattered diffraction(EBSD).A methodology combining finite element method-smoothed particle hydrodynamics(FEM-SPH)and molecular dynamics(MD)was proposed for the analysis of the forming and evolution characteristics of explosive welding interfaces at multi-scale.The results demonstrate that the bonding interface morphologies of TC4/Al 6063 and Al 6063/Al 7075 exhibit a flat and wavy configuration,without discernible defects or cracks.The phenomenon of grain refinement is observed in the vicinity of the two bonding interfaces.Furthermore,the degree of plastic deformation of TC4 and Al 7075 is more pronounced than that of Al 6063 in the intermediate layer.The interface morphology characteristics obtained by FEM-SPH simulation exhibit a high degree of similarity to the experimental results.MD simulations reveal that the diffusion of interfacial elements predominantly occurs during the unloading phase,and the simulated thickness of interfacial diffusion aligns well with experimental outcomes.The introduction of intermediate layer in the explosive welding process can effectively produce high-quality titanium/aluminum alloy composite plates.Furthermore,this approach offers a multi-scale simulation strategy for the study of explosive welding bonding interfaces.
基金supported by the Natural Science Foundation of China(Grant No.42302170)National Postdoctoral Innovative Talent Support Program(Grant No.BX20220062)+3 种基金CNPC Innovation Found(Grant No.2022DQ02-0104)National Science Foundation of Heilongjiang Province of China(Grant No.YQ2023D001)Postdoctoral Science Foundation of Heilongjiang Province of China(Grant No.LBH-Z22091)the Natural Science Foundation of Shandong Province(Grant No.ZR2022YQ30).
文摘Prediction of production decline and evaluation of the adsorbed/free gas ratio are critical for determining the lifespan and production status of shale gas wells.Traditional production prediction methods have some shortcomings because of the low permeability and tightness of shale,complex gas flow behavior of multi-scale gas transport regions and multiple gas transport mechanism superpositions,and complex and variable production regimes of shale gas wells.Recent research has demonstrated the existence of a multi-stage isotope fractionation phenomenon during shale gas production,with the fractionation characteristics of each stage associated with the pore structure,gas in place(GIP),adsorption/desorption,and gas production process.This study presents a new approach for estimating shale gas well production and evaluating the adsorbed/free gas ratio throughout production using isotope fractionation techniques.A reservoir-scale carbon isotope fractionation(CIF)model applicable to the production process of shale gas wells was developed for the first time in this research.In contrast to the traditional model,this model improves production prediction accuracy by simultaneously fitting the gas production rate and δ^(13)C_(1) data and provides a new evaluation method of the adsorbed/free gas ratio during shale gas production.The results indicate that the diffusion and adsorption/desorption properties of rock,bottom-hole flowing pressure(BHP)of gas well,and multi-scale gas transport regions of the reservoir all affect isotope fractionation,with the diffusion and adsorption/desorption parameters of rock having the greatest effect on isotope fractionation being D∗/D,PL,VL,α,and others in that order.We effectively tested the universality of the four-stage isotope fractionation feature and revealed a unique isotope fractionation mechanism caused by the superimposed coupling of multi-scale gas transport regions during shale gas well production.Finally,we applied the established CIF model to a shale gas well in the Sichuan Basin,China,and calculated the estimated ultimate recovery(EUR)of the well to be 3.33×10^(8) m^(3);the adsorbed gas ratio during shale gas production was 1.65%,10.03%,and 23.44%in the first,fifth,and tenth years,respectively.The findings are significant for understanding the isotope fractionation mechanism during natural gas transport in complex systems and for formulating and optimizing unconventional natural gas development strategies.
基金supported by the National Natural Science Foundation of China(62272049,62236006,62172045)the Key Projects of Beijing Union University(ZKZD202301).
文摘In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accuracy significantly declines when the data is occluded.To enhance the accuracy of gait emotion recognition under occlusion,this paper proposes a Multi-scale Suppression Graph ConvolutionalNetwork(MS-GCN).TheMS-GCN consists of three main components:Joint Interpolation Module(JI Moudle),Multi-scale Temporal Convolution Network(MS-TCN),and Suppression Graph Convolutional Network(SGCN).The JI Module completes the spatially occluded skeletal joints using the(K-Nearest Neighbors)KNN interpolation method.The MS-TCN employs convolutional kernels of various sizes to comprehensively capture the emotional information embedded in the gait,compensating for the temporal occlusion of gait information.The SGCN extracts more non-prominent human gait features by suppressing the extraction of key body part features,thereby reducing the negative impact of occlusion on emotion recognition results.The proposed method is evaluated on two comprehensive datasets:Emotion-Gait,containing 4227 real gaits from sources like BML,ICT-Pollick,and ELMD,and 1000 synthetic gaits generated using STEP-Gen technology,and ELMB,consisting of 3924 gaits,with 1835 labeled with emotions such as“Happy,”“Sad,”“Angry,”and“Neutral.”On the standard datasets Emotion-Gait and ELMB,the proposed method achieved accuracies of 0.900 and 0.896,respectively,attaining performance comparable to other state-ofthe-artmethods.Furthermore,on occlusion datasets,the proposedmethod significantly mitigates the performance degradation caused by occlusion compared to other methods,the accuracy is significantly higher than that of other methods.
基金supported by the National Science and Technology Major Project,China(No.2017-VII-0005-0098)the National Natural Science Foundation of China(No.52105400)+1 种基金the State Key Laboratory of Solidification Processing,China(No.2021-TS-07)the Innovation Foundation for Doctor Dissertation of Northwestern Polytechnical University,China(No.CX2023008)。
文摘The Ti17(a+β)-Ti17(β)dual alloy-dual property blisk produced using Linear Friction Welding(LFW)is considered as high-performance component in advanced aeroengine.However,up to now,microstructure evolution and relationship between microstructure and micro mechanical properties of LFWed Ti17(a+β)/Ti17(β)dissimilar joint have not been thoroughly revealed.In this work,complex analyses of the phase transformation mechanisms of the joint are conducted,and phase transformations in individual zones are correlated to their microhardness and nanohardness.Results reveal that a dissolution occurs under high temperatures encountered during LFW,which reduces microhardness of the joint to that of Ti17(a+β)and Ti17(β).In ThermoMechanically Affected Zone of Ti17(a+β)(TMAZ-(a+β))side joint,a large number of nanocrystalline a phases form with different orientations.This microstructure strengthens significantly by fine grains which balances partial softening effect of a dissolution,and increases nanohardness of a phase and microhardness of TMAZ-(a+β).Superlattice metastableβphase precipitates from metastableβin Weld Zone(WZ)during quick cooling following welding,because of short-range diffusion migration of solute atoms,especiallyβstabilizing elements Mo and Cr.The precipitation of the superlattice metastableβphase results in precipitation strengthening,which in turn increases nanohardness of metastableβand microhardness in WZ.
基金Supported by the Henan Province Key Research and Development Project(231111211300)the Central Government of Henan Province Guides Local Science and Technology Development Funds(Z20231811005)+2 种基金Henan Province Key Research and Development Project(231111110100)Henan Provincial Outstanding Foreign Scientist Studio(GZS2024006)Henan Provincial Joint Fund for Scientific and Technological Research and Development Plan(Application and Overcoming Technical Barriers)(242103810028)。
文摘The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method for infrared and visible image fusion is proposed.The encoder designed according to the optimization objective consists of a base encoder and a detail encoder,which is used to extract low-frequency and high-frequency information from the image.This extraction may lead to some information not being captured,so a compensation encoder is proposed to supplement the missing information.Multi-scale decomposition is also employed to extract image features more comprehensively.The decoder combines low-frequency,high-frequency and supplementary information to obtain multi-scale features.Subsequently,the attention strategy and fusion module are introduced to perform multi-scale fusion for image reconstruction.Experimental results on three datasets show that the fused images generated by this network effectively retain salient targets while being more consistent with human visual perception.
基金supported by the National Natural Science Foundation of China(Grant Nos.42150204 and 2288101)supported by the China National Postdoctoral Program for Innovative Talents(BX20230045)the China Postdoctoral Science Foundation(2023M730279)。
文摘A nonlinear multi-scale interaction(NMI)model was proposed and developed by the first author for nearly 30 years to represent the evolution of atmospheric blocking.In this review paper,we first review the creation and development of the NMI model and then emphasize that the NMI model represents a new tool for identifying the basic physics of how climate change influences mid-to-high latitude weather extremes.The building of the NMI model took place over three main periods.In the 1990s,a nonlinear Schr?dinger(NLS)equation model was presented to describe atmospheric blocking as a wave packet;however,it could not depict the lifetime(10-20 days)of atmospheric blocking.In the 2000s,we proposed an NMI model of atmospheric blocking in a uniform basic flow by making a scale-separation assumption and deriving an eddyforced NLS equation.This model succeeded in describing the life cycle of atmospheric blocking.In the 2020s,the NMI model was extended to include the impact of a changing climate mainly by altering the basic zonal winds and the magnitude of the meridional background potential vorticity gradient(PVy).Model results show that when PVy is smaller,blocking has a weaker dispersion and a stronger nonlinearity,so blocking can be more persistent and have a larger zonal scale and weaker eastward movement,thus favoring stronger weather extremes.However,when PVy is much smaller and below a critical threshold under much stronger winter Arctic warming of global warming,atmospheric blocking becomes locally less persistent and shows a much stronger westward movement,which acts to inhibit local cold extremes.Such a case does not happen in summer under global warming because PVy fails to fall below the critical threshold.Thus,our theory indicates that global warming can render summer-blocking anticyclones and mid-to-high latitude heatwaves more persistent,intense,and widespread.
文摘Mental health problems and potential psychological crises affect the healthy growth and learning performance of college students.Effective and suitable prevention of psychological crises among college students is a continuous challenge university managers face.To explore a method of preventing psychological crises among college students,we measured 38661 students by using SCL-90(symptom check list-90)and screened out 5790 students with positive results.Then,we measured 33188 students by using PHQ-9(patient health questionnaire-9)and screened out 603 students with suicidal ideation or behavior;we interviewed 392 students by using GAQ(growth adversity questionnaire).The number of students who had positive results at both phases is 155.As a result,we obtained a data set(N=76)by integrating the students who tested positive on the PHQ-9(i.e.total score≥20)with those who completed the PHQ-9 and GAQ.In addition,we obtained a data set(N=50)by excluding the cases in which the GAQ score is 0.With regard to QCA(qualitative comparative analysis)results,the data set(N=76)exhibits 5 constellations of solutions with a coverage rate greater than 0.7,and the first eight indicators of the PHQ-9 constitute the explanatory variables in the combined solutions.About the data set(N=50),the combined solutions are extremely complicated and the explanatory variables encompass indicators from both the PHQ-9 and GAQ.All these mean that the multi-scale could more comprehensively reflect mental health states of college students,thus enhance the accuracy and effectiveness of the corresponding hierarchical intervention,and finally provide support for preventing psychological crises in universities.
基金supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region,China(Project No.15308024)a grant from Research Centre for Carbon-Strategic Catalysis,The Hong Kong Polytechnic University(CE2X).
文摘Water electrolyzers play a crucial role in green hydrogen production.However,their efficiency and scalability are often compromised by bubble dynamics across various scales,from nanoscale to macroscale components.This review explores multi-scale modeling as a tool to visualize multi-phase flow and improve mass transport in water electrolyzers.At the nanoscale,molecular dynamics(MD)simulations reveal how electrode surface features and wettability influence nanobubble nucleation and stability.Moving to the mesoscale,models such as volume of fluid(VOF)and lattice Boltzmann method(LBM)shed light on bubble transport in porous transport layers(PTLs).These insights inform innovative designs,including gradient porosity and hydrophilic-hydrophobic patterning,aimed at minimizing gas saturation.At the macroscale,VOF simulations elucidate two-phase flow regimes within channels,showing how flow field geometry and wettability affect bubble discharging.Moreover,artificial intelligence(AI)-driven surrogate models expedite the optimization process,allowing for rapid exploration of structural parameters in channel-rib flow fields and porous flow field designs.By integrating these approaches,we can bridge theoretical insights with experimental validation,ultimately enhancing water electrolyzer performance,reducing costs,and advancing affordable,high-efficiency hydrogen production.
基金funded by the China Chongqing Municipal Science and Technology Bureau,grant numbers 2024TIAD-CYKJCXX0121,2024NSCQ-LZX0135Chongqing Municipal Commission of Housing and Urban-Rural Development,grant number CKZ2024-87+3 种基金the Chongqing University of Technology graduate education high-quality development project,grant number gzlsz202401the Chongqing University of Technology-Chongqing LINGLUE Technology Co.,Ltd.,Electronic Information(Artificial Intelligence)graduate joint training basethe Postgraduate Education and Teaching Reform Research Project in Chongqing,grant number yjg213116the Chongqing University of Technology-CISDI Chongqing Information Technology Co.,Ltd.,Computer Technology graduate joint training base.
文摘Detecting abnormal cervical cells is crucial for early identification and timely treatment of cervical cancer.However,this task is challenging due to the morphological similarities between abnormal and normal cells and the significant variations in cell size.Pathologists often refer to surrounding cells to identify abnormalities.To emulate this slide examination behavior,this study proposes a Multi-Scale Feature Fusion Network(MSFF-Net)for detecting cervical abnormal cells.MSFF-Net employs a Cross-Scale Pooling Model(CSPM)to effectively capture diverse features and contextual information,ranging from local details to the overall structure.Additionally,a Multi-Scale Fusion Attention(MSFA)module is introduced to mitigate the impact of cell size variations by adaptively fusing local and global information at different scales.To handle the complex environment of cervical cell images,such as cell adhesion and overlapping,the Inner-CIoU loss function is utilized to more precisely measure the overlap between bounding boxes,thereby improving detection accuracy in such scenarios.Experimental results on the Comparison detector dataset demonstrate that MSFF-Net achieves a mean average precision(mAP)of 63.2%,outperforming state-of-the-art methods while maintaining a relatively small number of parameters(26.8 M).This study highlights the effectiveness of multi-scale feature fusion in enhancing the detection of cervical abnormal cells,contributing to more accurate and efficient cervical cancer screening.
基金supported by the National Natural Science Foundation of China(52172091,52172295)Defense Industrial Technology Development Program(JCKY2023605C002)+4 种基金Frontier Leading Technology Basic Research Major Project of Jiangsu Province(SBK2023050110)the National Key Laboratory on Electromagnetic Environmental Effects and Electro-optical Engineering(NO.61422062301)the Opening Project of Science and Technology on Reliability Physics and Application Technology of Electronic Component Laboratory(ZHD202305)the Opening Project of Jiangsu Key Laboratory of Advanced Structural Materials and Application Technology(ASMA202303)the Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX23_0371).
文摘Advanced chemical engineering for simultaneous modulation of nanomaterial morphology, defects, interfaces, and structure to enhance electromagnetic and microwave absorption (MA) performance. However, accurately distinguishing the MA contributions of different scale factors and tuning the optimal combined effects remains a formidable challenge. This study employs a synergistic approach combining template protection etching and vacuum annealing to construct a controlled system of micrometer-sized cavities and amorphous carbon matrices in metal-organic framework (MOF) derivatives. The results demonstrate that the spatial effects introduced by the hollow structure enhance dielectric loss but significantly weaken impedance matching. By increasing the proportion of amorphous carbon, the balance between electromagnetic loss and impedance matching can be effectively maintained. Importantly, in a suitable graphitization environment, the presence of oxygen vacancies in amorphous carbon can induce significant polarization to compensate for the reduced conductivity loss due to the absence of sp2 carbon. Through the synergistic effects of morphology and composition, the samples exhibit a broader absorption bandwidth (6.28 GHz) and stronger reflection loss (−61.64 dB) compared to the original MOF. In conclusion, this study aims to elucidate the multiscale impacts of macroscopic micro-nano structure and microscopic defect engineering, providing valuable insights for future research in this field.
基金supported by Communication University of China(HG23035)partly supported by the Fundamental Research Funds for the Central Universities(CUC230A013).
文摘With the rapid growth of socialmedia,the spread of fake news has become a growing problem,misleading the public and causing significant harm.As social media content is often composed of both images and text,the use of multimodal approaches for fake news detection has gained significant attention.To solve the problems existing in previous multi-modal fake news detection algorithms,such as insufficient feature extraction and insufficient use of semantic relations between modes,this paper proposes the MFFFND-Co(Multimodal Feature Fusion Fake News Detection with Co-Attention Block)model.First,the model deeply explores the textual content,image content,and frequency domain features.Then,it employs a Co-Attention mechanism for cross-modal fusion.Additionally,a semantic consistency detectionmodule is designed to quantify semantic deviations,thereby enhancing the performance of fake news detection.Experimentally verified on two commonly used datasets,Twitter and Weibo,the model achieved F1 scores of 90.0% and 94.0%,respectively,significantly outperforming the pre-modified MFFFND(Multimodal Feature Fusion Fake News Detection with Attention Block)model and surpassing other baseline models.This improves the accuracy of detecting fake information in artificial intelligence detection and engineering software detection.
基金supported in part by the National Natural Science Foundation of China[62301374]Hubei Provincial Natural Science Foundation of China[2022CFB804]+2 种基金Hubei Provincial Education Research Project[B2022057]the Youths Science Foundation of Wuhan Institute of Technology[K202240]the 15th Graduate Education Innovation Fund of Wuhan Institute of Technology[CX2023295].
文摘This paper aims to develop a nonrigid registration method of preoperative and intraoperative thoracoabdominal CT images in computer-assisted interventional surgeries for accurate tumor localization and tissue visualization enhancement.However,fine structure registration of complex thoracoabdominal organs and large deformation registration caused by respiratory motion is challenging.To deal with this problem,we propose a 3D multi-scale attention VoxelMorph(MAVoxelMorph)registration network.To alleviate the large deformation problem,a multi-scale axial attention mechanism is utilized by using a residual dilated pyramid pooling for multi-scale feature extraction,and position-aware axial attention for long-distance dependencies between pixels capture.To further improve the large deformation and fine structure registration results,a multi-scale context channel attention mechanism is employed utilizing content information via adjacent encoding layers.Our method was evaluated on four public lung datasets(DIR-Lab dataset,Creatis dataset,Learn2Reg dataset,OASIS dataset)and a local dataset.Results proved that the proposed method achieved better registration performance than current state-of-the-art methods,especially in handling the registration of large deformations and fine structures.It also proved to be fast in 3D image registration,using about 1.5 s,and faster than most methods.Qualitative and quantitative assessments proved that the proposed MA-VoxelMorph has the potential to realize precise and fast tumor localization in clinical interventional surgeries.
基金funded by the Joint Fund for Regional Innovation and Development of National Natural Science Foundation of China(U21A20143)the National Science Fund for Excellent Young Scholars(52322607)the Excellent Youth Foundation of Heilongjiang Scientific Committee(YQ2022E028)。
文摘Improving the volumetric energy density of supercapacitors is essential for practical applications,which highly relies on the dense storage of ions in carbon-based electrodes.The functional units of carbon-based electrode exhibit multi-scale structural characteristics including macroscopic electrode morphologies,mesoscopic microcrystals and pores,and microscopic defects and dopants in the carbon basal plane.Therefore,the ordered combination of multi-scale structures of carbon electrode is crucial for achieving dense energy storage and high volumetric performance by leveraging the functions of various scale structu re.Considering that previous reviews have focused more on the discussion of specific scale structu re of carbon electrodes,this review takes a multi-scale perspective in which recent progresses regarding the structureperformance relationship,underlying mechanism and directional design of carbon-based multi-scale structures including carbon morphology,pore structure,carbon basal plane micro-environment and electrode technology on dense energy storage and volumetric property of supercapacitors are systematically discussed.We analyzed in detail the effects of the morphology,pore,and micro-environment of carbon electrode materials on ion dense storage,summarized the specific effects of different scale structures on volumetric property and recent research progress,and proposed the mutual influence and trade-off relationship between various scale structures.In addition,the challenges and outlooks for improving the dense storage and volumetric performance of carbon-based supercapacitors are analyzed,which can provide feasible technical reference and guidance for the design and manufacture of dense carbon-based electrode materials.