In foggy traffic scenarios, existing object detection algorithms face challenges such as low detection accuracy, poor robustness, occlusion, missed detections, and false detections. To address this issue, a multi-scal...In foggy traffic scenarios, existing object detection algorithms face challenges such as low detection accuracy, poor robustness, occlusion, missed detections, and false detections. To address this issue, a multi-scale object detection algorithm based on an improved YOLOv8 has been proposed. Firstly, a lightweight attention mechanism, Triplet Attention, is introduced to enhance the algorithm’s ability to extract multi-dimensional and multi-scale features, thereby improving the receptive capability of the feature maps. Secondly, the Diverse Branch Block (DBB) is integrated into the CSP Bottleneck with two Convolutions (C2F) module to strengthen the fusion of semantic information across different layers. Thirdly, a new decoupled detection head is proposed by redesigning the original network head based on the Diverse Branch Block module to improve detection accuracy and reduce missed and false detections. Finally, the Minimum Point Distance based Intersection-over-Union (MPDIoU) is used to replace the original YOLOv8 Complete Intersection-over-Union (CIoU) to accelerate the network’s training convergence. Comparative experiments and dehazing pre-processing tests were conducted on the RTTS and VOC-Fog datasets. Compared to the baseline YOLOv8 model, the improved algorithm achieved mean Average Precision (mAP) improvements of 4.6% and 3.8%, respectively. After defogging pre-processing, the mAP increased by 5.3% and 4.4%, respectively. The experimental results demonstrate that the improved algorithm exhibits high practicality and effectiveness in foggy traffic scenarios.展开更多
While moving ahead with the object detection technology, especially deep neural networks, many related tasks, such as medical application and industrial automation, have achieved great success. However, the detection ...While moving ahead with the object detection technology, especially deep neural networks, many related tasks, such as medical application and industrial automation, have achieved great success. However, the detection of objects with multiple aspect ratios and scales is still a key problem. This paper proposes a top-down and bottom-up feature pyramid network(TDBU-FPN),which combines multi-scale feature representation and anchor generation at multiple aspect ratios. First, in order to build the multi-scale feature map, this paper puts a number of fully convolutional layers after the backbone. Second, to link neighboring feature maps, top-down and bottom-up flows are adopted to introduce context information via top-down flow and supplement suboriginal information via bottom-up flow. The top-down flow refers to the deconvolution procedure, and the bottom-up flow refers to the pooling procedure. Third, the problem of adapting different object aspect ratios is tackled via many anchor shapes with different aspect ratios on each multi-scale feature map. The proposed method is evaluated on the pattern analysis, statistical modeling and computational learning visual object classes(PASCAL VOC)dataset and reaches an accuracy of 79%, which exhibits a 1.8% improvement with a detection speed of 23 fps.展开更多
This paper mainly focuses on the issues about generic multi-scale object perception for detection or recognition. A novel computational model in visually-feature space is presented for scene & object representatio...This paper mainly focuses on the issues about generic multi-scale object perception for detection or recognition. A novel computational model in visually-feature space is presented for scene & object representation to purse the underlying textural manifold statistically in nonparametric manner. The associative method approximately makes perceptual hierarchy in human-vision biologically coherency in specific quad-tree-pyramid structure, and the appropriate scale-value of different objects can automatically be selected by evaluating from well-defined scale function without any priori knowledge. The sufficient experiments truly demonstrate the effectiveness of scale determination in textural manifold with object localization rapidly.展开更多
Multi-scale object detection is a research hotspot,and it has critical applications in many secure systems.Although the object detection algorithms have constantly been progressing recently,how to perform highly accur...Multi-scale object detection is a research hotspot,and it has critical applications in many secure systems.Although the object detection algorithms have constantly been progressing recently,how to perform highly accurate and reliable multi-class object detection is still a challenging task due to the influence of many factors,such as the deformation and occlusion of the object in the actual scene.The more interference factors,the more complicated the semantic information,so we need a deeper network to extract deep information.However,deep neural networks often suffer from network degradation.To prevent the occurrence of degradation on deep neural networks,we put forth a new model using a newly-designed Pre-ReLU,which inserts a ReLU layer before the convolution layer for the sake of preventing network degradation and ensuring the performance of deep networks.This structure can transfer the semantic information more smoothly from the shallow to the deep layer.However,the deep networks will encounter not only degradation,but also a decline in efficiency.Therefore,to speed up the two-stage detector,we divide the feature map into many groups so as to diminish the number of parameters.Correspondingly,calculation speed has been enhanced,achieving a balance between speed and accuracy.Through mathematical demonstration,a Balanced Loss(BL)is proposed by a balance factor to decrease the weight of the negative sample during the training phase to balance the positives and negatives.Finally,our detector demonstrates rosy results in a range of experiments and gains an mAP of 73.38 on PASCAL VOC2007,which approaches the requirement of many security systems.展开更多
Computer vision-based traffic object detection plays a critical role in road traffic safety.Under hazy weather conditions,images captured by road monitoring systems exhibit three main challenges:significant scale vari...Computer vision-based traffic object detection plays a critical role in road traffic safety.Under hazy weather conditions,images captured by road monitoring systems exhibit three main challenges:significant scale variations,abundant background noise,and diverse perspectives.These factors lead to insufficient detection accuracy and limited real-time performance in object detection algorithms.We propose AMC-YOLO an improved YOLOv11-based traffic detection algorithm to address these challenges.In this work,we replace the C3k block's bottleneck module with our novel attention-gate convolution(AGConv),which improves contextual information capture,enhances feature extraction,and reduces computational redundancy.Additionally,we introduce the multi-dilation sharing convolution(MDSC)module to prevent feature information loss during pooling operations,enhancing the model's sensitivity to multi-scale features.We design a lightweight and efficient cross-channel feature fusion module(CCFM)for the path aggregation neck to adaptively adjust feature weights and optimize the model's overall performance.Experimental results demonstrate that AMC-YOLO achieves a 1.1%improvement in mAP@0.5 and a 2.7%increase in mAP@0.5:0.95 compared to YOLOv11n.On graphics processing unit(GPU)hardware,it achieves real-time performance at 376(FPS)with only 2.6 million parameters,ensuring high-precision traffic detection while meeting deployment requirements on resource-constrained devices.展开更多
The ubiquity of mobile devices has driven advancements in mobile object detection.However,challenges in multi-scale object detection in open,complex environments persist due to limited computational resources.Traditio...The ubiquity of mobile devices has driven advancements in mobile object detection.However,challenges in multi-scale object detection in open,complex environments persist due to limited computational resources.Traditional approaches like network compression,quantization,and lightweight design often sacrifice accuracy or feature representation robustness.This article introduces the Fast Multi-scale Channel Shuffling Network(FMCSNet),a novel lightweight detection model optimized for mobile devices.FMCSNet integrates a fully convolutional Multilayer Perceptron(MLP)module,offering global perception without significantly increasing parameters,effectively bridging the gap between CNNs and Vision Transformers.FMCSNet achieves a delicate balance between computation and accuracy mainly by two key modules:the ShiftMLP module,including a shift operation and an MLP module,and a Partial group Convolutional(PGConv)module,reducing computation while enhancing information exchange between channels.With a computational complexity of 1.4G FLOPs and 1.3M parameters,FMCSNet outperforms CNN-based and DWConv-based ShuffleNetv2 by 1%and 4.5%mAP on the Pascal VOC 2007 dataset,respectively.Additionally,FMCSNet achieves a mAP of 30.0(0.5:0.95 IoU threshold)with only 2.5G FLOPs and 2.0M parameters.It achieves 32 FPS on low-performance i5-series CPUs,meeting real-time detection requirements.The versatility of the PGConv module’s adaptability across scenarios further highlights FMCSNet as a promising solution for real-time mobile object detection.展开更多
Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer ...Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer composite plate by explosive welding.The microscopic properties of each bonding interface were elucidated through field emission scanning electron microscope and electron backscattered diffraction(EBSD).A methodology combining finite element method-smoothed particle hydrodynamics(FEM-SPH)and molecular dynamics(MD)was proposed for the analysis of the forming and evolution characteristics of explosive welding interfaces at multi-scale.The results demonstrate that the bonding interface morphologies of TC4/Al 6063 and Al 6063/Al 7075 exhibit a flat and wavy configuration,without discernible defects or cracks.The phenomenon of grain refinement is observed in the vicinity of the two bonding interfaces.Furthermore,the degree of plastic deformation of TC4 and Al 7075 is more pronounced than that of Al 6063 in the intermediate layer.The interface morphology characteristics obtained by FEM-SPH simulation exhibit a high degree of similarity to the experimental results.MD simulations reveal that the diffusion of interfacial elements predominantly occurs during the unloading phase,and the simulated thickness of interfacial diffusion aligns well with experimental outcomes.The introduction of intermediate layer in the explosive welding process can effectively produce high-quality titanium/aluminum alloy composite plates.Furthermore,this approach offers a multi-scale simulation strategy for the study of explosive welding bonding interfaces.展开更多
Prediction of production decline and evaluation of the adsorbed/free gas ratio are critical for determining the lifespan and production status of shale gas wells.Traditional production prediction methods have some sho...Prediction of production decline and evaluation of the adsorbed/free gas ratio are critical for determining the lifespan and production status of shale gas wells.Traditional production prediction methods have some shortcomings because of the low permeability and tightness of shale,complex gas flow behavior of multi-scale gas transport regions and multiple gas transport mechanism superpositions,and complex and variable production regimes of shale gas wells.Recent research has demonstrated the existence of a multi-stage isotope fractionation phenomenon during shale gas production,with the fractionation characteristics of each stage associated with the pore structure,gas in place(GIP),adsorption/desorption,and gas production process.This study presents a new approach for estimating shale gas well production and evaluating the adsorbed/free gas ratio throughout production using isotope fractionation techniques.A reservoir-scale carbon isotope fractionation(CIF)model applicable to the production process of shale gas wells was developed for the first time in this research.In contrast to the traditional model,this model improves production prediction accuracy by simultaneously fitting the gas production rate and δ^(13)C_(1) data and provides a new evaluation method of the adsorbed/free gas ratio during shale gas production.The results indicate that the diffusion and adsorption/desorption properties of rock,bottom-hole flowing pressure(BHP)of gas well,and multi-scale gas transport regions of the reservoir all affect isotope fractionation,with the diffusion and adsorption/desorption parameters of rock having the greatest effect on isotope fractionation being D∗/D,PL,VL,α,and others in that order.We effectively tested the universality of the four-stage isotope fractionation feature and revealed a unique isotope fractionation mechanism caused by the superimposed coupling of multi-scale gas transport regions during shale gas well production.Finally,we applied the established CIF model to a shale gas well in the Sichuan Basin,China,and calculated the estimated ultimate recovery(EUR)of the well to be 3.33×10^(8) m^(3);the adsorbed gas ratio during shale gas production was 1.65%,10.03%,and 23.44%in the first,fifth,and tenth years,respectively.The findings are significant for understanding the isotope fractionation mechanism during natural gas transport in complex systems and for formulating and optimizing unconventional natural gas development strategies.展开更多
In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accurac...In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accuracy significantly declines when the data is occluded.To enhance the accuracy of gait emotion recognition under occlusion,this paper proposes a Multi-scale Suppression Graph ConvolutionalNetwork(MS-GCN).TheMS-GCN consists of three main components:Joint Interpolation Module(JI Moudle),Multi-scale Temporal Convolution Network(MS-TCN),and Suppression Graph Convolutional Network(SGCN).The JI Module completes the spatially occluded skeletal joints using the(K-Nearest Neighbors)KNN interpolation method.The MS-TCN employs convolutional kernels of various sizes to comprehensively capture the emotional information embedded in the gait,compensating for the temporal occlusion of gait information.The SGCN extracts more non-prominent human gait features by suppressing the extraction of key body part features,thereby reducing the negative impact of occlusion on emotion recognition results.The proposed method is evaluated on two comprehensive datasets:Emotion-Gait,containing 4227 real gaits from sources like BML,ICT-Pollick,and ELMD,and 1000 synthetic gaits generated using STEP-Gen technology,and ELMB,consisting of 3924 gaits,with 1835 labeled with emotions such as“Happy,”“Sad,”“Angry,”and“Neutral.”On the standard datasets Emotion-Gait and ELMB,the proposed method achieved accuracies of 0.900 and 0.896,respectively,attaining performance comparable to other state-ofthe-artmethods.Furthermore,on occlusion datasets,the proposedmethod significantly mitigates the performance degradation caused by occlusion compared to other methods,the accuracy is significantly higher than that of other methods.展开更多
The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method f...The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method for infrared and visible image fusion is proposed.The encoder designed according to the optimization objective consists of a base encoder and a detail encoder,which is used to extract low-frequency and high-frequency information from the image.This extraction may lead to some information not being captured,so a compensation encoder is proposed to supplement the missing information.Multi-scale decomposition is also employed to extract image features more comprehensively.The decoder combines low-frequency,high-frequency and supplementary information to obtain multi-scale features.Subsequently,the attention strategy and fusion module are introduced to perform multi-scale fusion for image reconstruction.Experimental results on three datasets show that the fused images generated by this network effectively retain salient targets while being more consistent with human visual perception.展开更多
A nonlinear multi-scale interaction(NMI)model was proposed and developed by the first author for nearly 30 years to represent the evolution of atmospheric blocking.In this review paper,we first review the creation and...A nonlinear multi-scale interaction(NMI)model was proposed and developed by the first author for nearly 30 years to represent the evolution of atmospheric blocking.In this review paper,we first review the creation and development of the NMI model and then emphasize that the NMI model represents a new tool for identifying the basic physics of how climate change influences mid-to-high latitude weather extremes.The building of the NMI model took place over three main periods.In the 1990s,a nonlinear Schr?dinger(NLS)equation model was presented to describe atmospheric blocking as a wave packet;however,it could not depict the lifetime(10-20 days)of atmospheric blocking.In the 2000s,we proposed an NMI model of atmospheric blocking in a uniform basic flow by making a scale-separation assumption and deriving an eddyforced NLS equation.This model succeeded in describing the life cycle of atmospheric blocking.In the 2020s,the NMI model was extended to include the impact of a changing climate mainly by altering the basic zonal winds and the magnitude of the meridional background potential vorticity gradient(PVy).Model results show that when PVy is smaller,blocking has a weaker dispersion and a stronger nonlinearity,so blocking can be more persistent and have a larger zonal scale and weaker eastward movement,thus favoring stronger weather extremes.However,when PVy is much smaller and below a critical threshold under much stronger winter Arctic warming of global warming,atmospheric blocking becomes locally less persistent and shows a much stronger westward movement,which acts to inhibit local cold extremes.Such a case does not happen in summer under global warming because PVy fails to fall below the critical threshold.Thus,our theory indicates that global warming can render summer-blocking anticyclones and mid-to-high latitude heatwaves more persistent,intense,and widespread.展开更多
Remote sensing imagery,due to its high altitude,presents inherent challenges characterized by multiple scales,limited target areas,and intricate backgrounds.These inherent traits often lead to increased miss and false...Remote sensing imagery,due to its high altitude,presents inherent challenges characterized by multiple scales,limited target areas,and intricate backgrounds.These inherent traits often lead to increased miss and false detection rates when applying object recognition algorithms tailored for remote sensing imagery.Additionally,these complexities contribute to inaccuracies in target localization and hinder precise target categorization.This paper addresses these challenges by proposing a solution:The YOLO-MFD model(YOLO-MFD:Remote Sensing Image Object Detection withMulti-scale Fusion Dynamic Head).Before presenting our method,we delve into the prevalent issues faced in remote sensing imagery analysis.Specifically,we emphasize the struggles of existing object recognition algorithms in comprehensively capturing critical image features amidst varying scales and complex backgrounds.To resolve these issues,we introduce a novel approach.First,we propose the implementation of a lightweight multi-scale module called CEF.This module significantly improves the model’s ability to comprehensively capture important image features by merging multi-scale feature information.It effectively addresses the issues of missed detection and mistaken alarms that are common in remote sensing imagery.Second,an additional layer of small target detection heads is added,and a residual link is established with the higher-level feature extraction module in the backbone section.This allows the model to incorporate shallower information,significantly improving the accuracy of target localization in remotely sensed images.Finally,a dynamic head attentionmechanism is introduced.This allows themodel to exhibit greater flexibility and accuracy in recognizing shapes and targets of different sizes.Consequently,the precision of object detection is significantly improved.The trial results show that the YOLO-MFD model shows improvements of 6.3%,3.5%,and 2.5%over the original YOLOv8 model in Precision,map@0.5 and map@0.5:0.95,separately.These results illustrate the clear advantages of the method.展开更多
Mental health problems and potential psychological crises affect the healthy growth and learning performance of college students.Effective and suitable prevention of psychological crises among college students is a co...Mental health problems and potential psychological crises affect the healthy growth and learning performance of college students.Effective and suitable prevention of psychological crises among college students is a continuous challenge university managers face.To explore a method of preventing psychological crises among college students,we measured 38661 students by using SCL-90(symptom check list-90)and screened out 5790 students with positive results.Then,we measured 33188 students by using PHQ-9(patient health questionnaire-9)and screened out 603 students with suicidal ideation or behavior;we interviewed 392 students by using GAQ(growth adversity questionnaire).The number of students who had positive results at both phases is 155.As a result,we obtained a data set(N=76)by integrating the students who tested positive on the PHQ-9(i.e.total score≥20)with those who completed the PHQ-9 and GAQ.In addition,we obtained a data set(N=50)by excluding the cases in which the GAQ score is 0.With regard to QCA(qualitative comparative analysis)results,the data set(N=76)exhibits 5 constellations of solutions with a coverage rate greater than 0.7,and the first eight indicators of the PHQ-9 constitute the explanatory variables in the combined solutions.About the data set(N=50),the combined solutions are extremely complicated and the explanatory variables encompass indicators from both the PHQ-9 and GAQ.All these mean that the multi-scale could more comprehensively reflect mental health states of college students,thus enhance the accuracy and effectiveness of the corresponding hierarchical intervention,and finally provide support for preventing psychological crises in universities.展开更多
Accurately identifying small objects in high-resolution aerial images presents a complex and crucial task in thefield of small object detection on unmanned aerial vehicles(UAVs).This task is challenging due to variati...Accurately identifying small objects in high-resolution aerial images presents a complex and crucial task in thefield of small object detection on unmanned aerial vehicles(UAVs).This task is challenging due to variations inUAV flight altitude,differences in object scales,as well as factors like flight speed and motion blur.To enhancethe detection efficacy of small targets in drone aerial imagery,we propose an enhanced You Only Look Onceversion 7(YOLOv7)algorithm based on multi-scale spatial context.We build the MSC-YOLO model,whichincorporates an additional prediction head,denoted as P2,to improve adaptability for small objects.We replaceconventional downsampling with a Spatial-to-Depth Convolutional Combination(CSPDC)module to mitigatethe loss of intricate feature details related to small objects.Furthermore,we propose a Spatial Context Pyramidwith Multi-Scale Attention(SCPMA)module,which captures spatial and channel-dependent features of smalltargets acrossmultiple scales.This module enhances the perception of spatial contextual features and the utilizationof multiscale feature information.On the Visdrone2023 and UAVDT datasets,MSC-YOLO achieves remarkableresults,outperforming the baseline method YOLOv7 by 3.0%in terms ofmean average precision(mAP).The MSCYOLOalgorithm proposed in this paper has demonstrated satisfactory performance in detecting small targets inUAV aerial photography,providing strong support for practical applications.展开更多
Water electrolyzers play a crucial role in green hydrogen production.However,their efficiency and scalability are often compromised by bubble dynamics across various scales,from nanoscale to macroscale components.This...Water electrolyzers play a crucial role in green hydrogen production.However,their efficiency and scalability are often compromised by bubble dynamics across various scales,from nanoscale to macroscale components.This review explores multi-scale modeling as a tool to visualize multi-phase flow and improve mass transport in water electrolyzers.At the nanoscale,molecular dynamics(MD)simulations reveal how electrode surface features and wettability influence nanobubble nucleation and stability.Moving to the mesoscale,models such as volume of fluid(VOF)and lattice Boltzmann method(LBM)shed light on bubble transport in porous transport layers(PTLs).These insights inform innovative designs,including gradient porosity and hydrophilic-hydrophobic patterning,aimed at minimizing gas saturation.At the macroscale,VOF simulations elucidate two-phase flow regimes within channels,showing how flow field geometry and wettability affect bubble discharging.Moreover,artificial intelligence(AI)-driven surrogate models expedite the optimization process,allowing for rapid exploration of structural parameters in channel-rib flow fields and porous flow field designs.By integrating these approaches,we can bridge theoretical insights with experimental validation,ultimately enhancing water electrolyzer performance,reducing costs,and advancing affordable,high-efficiency hydrogen production.展开更多
Detecting abnormal cervical cells is crucial for early identification and timely treatment of cervical cancer.However,this task is challenging due to the morphological similarities between abnormal and normal cells an...Detecting abnormal cervical cells is crucial for early identification and timely treatment of cervical cancer.However,this task is challenging due to the morphological similarities between abnormal and normal cells and the significant variations in cell size.Pathologists often refer to surrounding cells to identify abnormalities.To emulate this slide examination behavior,this study proposes a Multi-Scale Feature Fusion Network(MSFF-Net)for detecting cervical abnormal cells.MSFF-Net employs a Cross-Scale Pooling Model(CSPM)to effectively capture diverse features and contextual information,ranging from local details to the overall structure.Additionally,a Multi-Scale Fusion Attention(MSFA)module is introduced to mitigate the impact of cell size variations by adaptively fusing local and global information at different scales.To handle the complex environment of cervical cell images,such as cell adhesion and overlapping,the Inner-CIoU loss function is utilized to more precisely measure the overlap between bounding boxes,thereby improving detection accuracy in such scenarios.Experimental results on the Comparison detector dataset demonstrate that MSFF-Net achieves a mean average precision(mAP)of 63.2%,outperforming state-of-the-art methods while maintaining a relatively small number of parameters(26.8 M).This study highlights the effectiveness of multi-scale feature fusion in enhancing the detection of cervical abnormal cells,contributing to more accurate and efficient cervical cancer screening.展开更多
Advanced chemical engineering for simultaneous modulation of nanomaterial morphology, defects, interfaces, and structure to enhance electromagnetic and microwave absorption (MA) performance. However, accurately distin...Advanced chemical engineering for simultaneous modulation of nanomaterial morphology, defects, interfaces, and structure to enhance electromagnetic and microwave absorption (MA) performance. However, accurately distinguishing the MA contributions of different scale factors and tuning the optimal combined effects remains a formidable challenge. This study employs a synergistic approach combining template protection etching and vacuum annealing to construct a controlled system of micrometer-sized cavities and amorphous carbon matrices in metal-organic framework (MOF) derivatives. The results demonstrate that the spatial effects introduced by the hollow structure enhance dielectric loss but significantly weaken impedance matching. By increasing the proportion of amorphous carbon, the balance between electromagnetic loss and impedance matching can be effectively maintained. Importantly, in a suitable graphitization environment, the presence of oxygen vacancies in amorphous carbon can induce significant polarization to compensate for the reduced conductivity loss due to the absence of sp2 carbon. Through the synergistic effects of morphology and composition, the samples exhibit a broader absorption bandwidth (6.28 GHz) and stronger reflection loss (−61.64 dB) compared to the original MOF. In conclusion, this study aims to elucidate the multiscale impacts of macroscopic micro-nano structure and microscopic defect engineering, providing valuable insights for future research in this field.展开更多
With the rapid growth of socialmedia,the spread of fake news has become a growing problem,misleading the public and causing significant harm.As social media content is often composed of both images and text,the use of...With the rapid growth of socialmedia,the spread of fake news has become a growing problem,misleading the public and causing significant harm.As social media content is often composed of both images and text,the use of multimodal approaches for fake news detection has gained significant attention.To solve the problems existing in previous multi-modal fake news detection algorithms,such as insufficient feature extraction and insufficient use of semantic relations between modes,this paper proposes the MFFFND-Co(Multimodal Feature Fusion Fake News Detection with Co-Attention Block)model.First,the model deeply explores the textual content,image content,and frequency domain features.Then,it employs a Co-Attention mechanism for cross-modal fusion.Additionally,a semantic consistency detectionmodule is designed to quantify semantic deviations,thereby enhancing the performance of fake news detection.Experimentally verified on two commonly used datasets,Twitter and Weibo,the model achieved F1 scores of 90.0% and 94.0%,respectively,significantly outperforming the pre-modified MFFFND(Multimodal Feature Fusion Fake News Detection with Attention Block)model and surpassing other baseline models.This improves the accuracy of detecting fake information in artificial intelligence detection and engineering software detection.展开更多
This paper aims to develop a nonrigid registration method of preoperative and intraoperative thoracoabdominal CT images in computer-assisted interventional surgeries for accurate tumor localization and tissue visualiz...This paper aims to develop a nonrigid registration method of preoperative and intraoperative thoracoabdominal CT images in computer-assisted interventional surgeries for accurate tumor localization and tissue visualization enhancement.However,fine structure registration of complex thoracoabdominal organs and large deformation registration caused by respiratory motion is challenging.To deal with this problem,we propose a 3D multi-scale attention VoxelMorph(MAVoxelMorph)registration network.To alleviate the large deformation problem,a multi-scale axial attention mechanism is utilized by using a residual dilated pyramid pooling for multi-scale feature extraction,and position-aware axial attention for long-distance dependencies between pixels capture.To further improve the large deformation and fine structure registration results,a multi-scale context channel attention mechanism is employed utilizing content information via adjacent encoding layers.Our method was evaluated on four public lung datasets(DIR-Lab dataset,Creatis dataset,Learn2Reg dataset,OASIS dataset)and a local dataset.Results proved that the proposed method achieved better registration performance than current state-of-the-art methods,especially in handling the registration of large deformations and fine structures.It also proved to be fast in 3D image registration,using about 1.5 s,and faster than most methods.Qualitative and quantitative assessments proved that the proposed MA-VoxelMorph has the potential to realize precise and fast tumor localization in clinical interventional surgeries.展开更多
Improving the volumetric energy density of supercapacitors is essential for practical applications,which highly relies on the dense storage of ions in carbon-based electrodes.The functional units of carbon-based elect...Improving the volumetric energy density of supercapacitors is essential for practical applications,which highly relies on the dense storage of ions in carbon-based electrodes.The functional units of carbon-based electrode exhibit multi-scale structural characteristics including macroscopic electrode morphologies,mesoscopic microcrystals and pores,and microscopic defects and dopants in the carbon basal plane.Therefore,the ordered combination of multi-scale structures of carbon electrode is crucial for achieving dense energy storage and high volumetric performance by leveraging the functions of various scale structu re.Considering that previous reviews have focused more on the discussion of specific scale structu re of carbon electrodes,this review takes a multi-scale perspective in which recent progresses regarding the structureperformance relationship,underlying mechanism and directional design of carbon-based multi-scale structures including carbon morphology,pore structure,carbon basal plane micro-environment and electrode technology on dense energy storage and volumetric property of supercapacitors are systematically discussed.We analyzed in detail the effects of the morphology,pore,and micro-environment of carbon electrode materials on ion dense storage,summarized the specific effects of different scale structures on volumetric property and recent research progress,and proposed the mutual influence and trade-off relationship between various scale structures.In addition,the challenges and outlooks for improving the dense storage and volumetric performance of carbon-based supercapacitors are analyzed,which can provide feasible technical reference and guidance for the design and manufacture of dense carbon-based electrode materials.展开更多
基金supported by the National Natural Science Foundation of China(Grant Nos.62101275 and 62101274).
文摘In foggy traffic scenarios, existing object detection algorithms face challenges such as low detection accuracy, poor robustness, occlusion, missed detections, and false detections. To address this issue, a multi-scale object detection algorithm based on an improved YOLOv8 has been proposed. Firstly, a lightweight attention mechanism, Triplet Attention, is introduced to enhance the algorithm’s ability to extract multi-dimensional and multi-scale features, thereby improving the receptive capability of the feature maps. Secondly, the Diverse Branch Block (DBB) is integrated into the CSP Bottleneck with two Convolutions (C2F) module to strengthen the fusion of semantic information across different layers. Thirdly, a new decoupled detection head is proposed by redesigning the original network head based on the Diverse Branch Block module to improve detection accuracy and reduce missed and false detections. Finally, the Minimum Point Distance based Intersection-over-Union (MPDIoU) is used to replace the original YOLOv8 Complete Intersection-over-Union (CIoU) to accelerate the network’s training convergence. Comparative experiments and dehazing pre-processing tests were conducted on the RTTS and VOC-Fog datasets. Compared to the baseline YOLOv8 model, the improved algorithm achieved mean Average Precision (mAP) improvements of 4.6% and 3.8%, respectively. After defogging pre-processing, the mAP increased by 5.3% and 4.4%, respectively. The experimental results demonstrate that the improved algorithm exhibits high practicality and effectiveness in foggy traffic scenarios.
基金supported by the Program of Introducing Talents of Discipline to Universities(111 Plan)of China(B14010)the National Natural Science Foundation of China(31727901)
文摘While moving ahead with the object detection technology, especially deep neural networks, many related tasks, such as medical application and industrial automation, have achieved great success. However, the detection of objects with multiple aspect ratios and scales is still a key problem. This paper proposes a top-down and bottom-up feature pyramid network(TDBU-FPN),which combines multi-scale feature representation and anchor generation at multiple aspect ratios. First, in order to build the multi-scale feature map, this paper puts a number of fully convolutional layers after the backbone. Second, to link neighboring feature maps, top-down and bottom-up flows are adopted to introduce context information via top-down flow and supplement suboriginal information via bottom-up flow. The top-down flow refers to the deconvolution procedure, and the bottom-up flow refers to the pooling procedure. Third, the problem of adapting different object aspect ratios is tackled via many anchor shapes with different aspect ratios on each multi-scale feature map. The proposed method is evaluated on the pattern analysis, statistical modeling and computational learning visual object classes(PASCAL VOC)dataset and reaches an accuracy of 79%, which exhibits a 1.8% improvement with a detection speed of 23 fps.
文摘This paper mainly focuses on the issues about generic multi-scale object perception for detection or recognition. A novel computational model in visually-feature space is presented for scene & object representation to purse the underlying textural manifold statistically in nonparametric manner. The associative method approximately makes perceptual hierarchy in human-vision biologically coherency in specific quad-tree-pyramid structure, and the appropriate scale-value of different objects can automatically be selected by evaluating from well-defined scale function without any priori knowledge. The sufficient experiments truly demonstrate the effectiveness of scale determination in textural manifold with object localization rapidly.
基金supported by the Science and Technology Project of Sichuan(Nos.2019YFG0504,2021YFG0314,2020YFG0459)the National Natural Science Foundation of China(Grant Nos.61872066 and U19A2078).
文摘Multi-scale object detection is a research hotspot,and it has critical applications in many secure systems.Although the object detection algorithms have constantly been progressing recently,how to perform highly accurate and reliable multi-class object detection is still a challenging task due to the influence of many factors,such as the deformation and occlusion of the object in the actual scene.The more interference factors,the more complicated the semantic information,so we need a deeper network to extract deep information.However,deep neural networks often suffer from network degradation.To prevent the occurrence of degradation on deep neural networks,we put forth a new model using a newly-designed Pre-ReLU,which inserts a ReLU layer before the convolution layer for the sake of preventing network degradation and ensuring the performance of deep networks.This structure can transfer the semantic information more smoothly from the shallow to the deep layer.However,the deep networks will encounter not only degradation,but also a decline in efficiency.Therefore,to speed up the two-stage detector,we divide the feature map into many groups so as to diminish the number of parameters.Correspondingly,calculation speed has been enhanced,achieving a balance between speed and accuracy.Through mathematical demonstration,a Balanced Loss(BL)is proposed by a balance factor to decrease the weight of the negative sample during the training phase to balance the positives and negatives.Finally,our detector demonstrates rosy results in a range of experiments and gains an mAP of 73.38 on PASCAL VOC2007,which approaches the requirement of many security systems.
基金supported by the Wuhan Pilot construction of a strong Transportation Country Science and Technology Joint Research Project(No.2024-1-10).
文摘Computer vision-based traffic object detection plays a critical role in road traffic safety.Under hazy weather conditions,images captured by road monitoring systems exhibit three main challenges:significant scale variations,abundant background noise,and diverse perspectives.These factors lead to insufficient detection accuracy and limited real-time performance in object detection algorithms.We propose AMC-YOLO an improved YOLOv11-based traffic detection algorithm to address these challenges.In this work,we replace the C3k block's bottleneck module with our novel attention-gate convolution(AGConv),which improves contextual information capture,enhances feature extraction,and reduces computational redundancy.Additionally,we introduce the multi-dilation sharing convolution(MDSC)module to prevent feature information loss during pooling operations,enhancing the model's sensitivity to multi-scale features.We design a lightweight and efficient cross-channel feature fusion module(CCFM)for the path aggregation neck to adaptively adjust feature weights and optimize the model's overall performance.Experimental results demonstrate that AMC-YOLO achieves a 1.1%improvement in mAP@0.5 and a 2.7%increase in mAP@0.5:0.95 compared to YOLOv11n.On graphics processing unit(GPU)hardware,it achieves real-time performance at 376(FPS)with only 2.6 million parameters,ensuring high-precision traffic detection while meeting deployment requirements on resource-constrained devices.
基金funded by the National Natural Science Foundation of China under Grant No.62371187the Open Program of Hunan Intelligent Rehabilitation Robot and Auxiliary Equipment Engineering Technology Research Center under Grant No.2024JS101.
文摘The ubiquity of mobile devices has driven advancements in mobile object detection.However,challenges in multi-scale object detection in open,complex environments persist due to limited computational resources.Traditional approaches like network compression,quantization,and lightweight design often sacrifice accuracy or feature representation robustness.This article introduces the Fast Multi-scale Channel Shuffling Network(FMCSNet),a novel lightweight detection model optimized for mobile devices.FMCSNet integrates a fully convolutional Multilayer Perceptron(MLP)module,offering global perception without significantly increasing parameters,effectively bridging the gap between CNNs and Vision Transformers.FMCSNet achieves a delicate balance between computation and accuracy mainly by two key modules:the ShiftMLP module,including a shift operation and an MLP module,and a Partial group Convolutional(PGConv)module,reducing computation while enhancing information exchange between channels.With a computational complexity of 1.4G FLOPs and 1.3M parameters,FMCSNet outperforms CNN-based and DWConv-based ShuffleNetv2 by 1%and 4.5%mAP on the Pascal VOC 2007 dataset,respectively.Additionally,FMCSNet achieves a mAP of 30.0(0.5:0.95 IoU threshold)with only 2.5G FLOPs and 2.0M parameters.It achieves 32 FPS on low-performance i5-series CPUs,meeting real-time detection requirements.The versatility of the PGConv module’s adaptability across scenarios further highlights FMCSNet as a promising solution for real-time mobile object detection.
基金Opening Foundation of Key Laboratory of Explosive Energy Utilization and Control,Anhui Province(BP20240104)Graduate Innovation Program of China University of Mining and Technology(2024WLJCRCZL049)Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX24_2701)。
文摘Because of the challenge of compounding lightweight,high-strength Ti/Al alloys due to their considerable disparity in properties,Al 6063 as intermediate layer was proposed to fabricate TC4/Al 6063/Al 7075 three-layer composite plate by explosive welding.The microscopic properties of each bonding interface were elucidated through field emission scanning electron microscope and electron backscattered diffraction(EBSD).A methodology combining finite element method-smoothed particle hydrodynamics(FEM-SPH)and molecular dynamics(MD)was proposed for the analysis of the forming and evolution characteristics of explosive welding interfaces at multi-scale.The results demonstrate that the bonding interface morphologies of TC4/Al 6063 and Al 6063/Al 7075 exhibit a flat and wavy configuration,without discernible defects or cracks.The phenomenon of grain refinement is observed in the vicinity of the two bonding interfaces.Furthermore,the degree of plastic deformation of TC4 and Al 7075 is more pronounced than that of Al 6063 in the intermediate layer.The interface morphology characteristics obtained by FEM-SPH simulation exhibit a high degree of similarity to the experimental results.MD simulations reveal that the diffusion of interfacial elements predominantly occurs during the unloading phase,and the simulated thickness of interfacial diffusion aligns well with experimental outcomes.The introduction of intermediate layer in the explosive welding process can effectively produce high-quality titanium/aluminum alloy composite plates.Furthermore,this approach offers a multi-scale simulation strategy for the study of explosive welding bonding interfaces.
基金supported by the Natural Science Foundation of China(Grant No.42302170)National Postdoctoral Innovative Talent Support Program(Grant No.BX20220062)+3 种基金CNPC Innovation Found(Grant No.2022DQ02-0104)National Science Foundation of Heilongjiang Province of China(Grant No.YQ2023D001)Postdoctoral Science Foundation of Heilongjiang Province of China(Grant No.LBH-Z22091)the Natural Science Foundation of Shandong Province(Grant No.ZR2022YQ30).
文摘Prediction of production decline and evaluation of the adsorbed/free gas ratio are critical for determining the lifespan and production status of shale gas wells.Traditional production prediction methods have some shortcomings because of the low permeability and tightness of shale,complex gas flow behavior of multi-scale gas transport regions and multiple gas transport mechanism superpositions,and complex and variable production regimes of shale gas wells.Recent research has demonstrated the existence of a multi-stage isotope fractionation phenomenon during shale gas production,with the fractionation characteristics of each stage associated with the pore structure,gas in place(GIP),adsorption/desorption,and gas production process.This study presents a new approach for estimating shale gas well production and evaluating the adsorbed/free gas ratio throughout production using isotope fractionation techniques.A reservoir-scale carbon isotope fractionation(CIF)model applicable to the production process of shale gas wells was developed for the first time in this research.In contrast to the traditional model,this model improves production prediction accuracy by simultaneously fitting the gas production rate and δ^(13)C_(1) data and provides a new evaluation method of the adsorbed/free gas ratio during shale gas production.The results indicate that the diffusion and adsorption/desorption properties of rock,bottom-hole flowing pressure(BHP)of gas well,and multi-scale gas transport regions of the reservoir all affect isotope fractionation,with the diffusion and adsorption/desorption parameters of rock having the greatest effect on isotope fractionation being D∗/D,PL,VL,α,and others in that order.We effectively tested the universality of the four-stage isotope fractionation feature and revealed a unique isotope fractionation mechanism caused by the superimposed coupling of multi-scale gas transport regions during shale gas well production.Finally,we applied the established CIF model to a shale gas well in the Sichuan Basin,China,and calculated the estimated ultimate recovery(EUR)of the well to be 3.33×10^(8) m^(3);the adsorbed gas ratio during shale gas production was 1.65%,10.03%,and 23.44%in the first,fifth,and tenth years,respectively.The findings are significant for understanding the isotope fractionation mechanism during natural gas transport in complex systems and for formulating and optimizing unconventional natural gas development strategies.
基金supported by the National Natural Science Foundation of China(62272049,62236006,62172045)the Key Projects of Beijing Union University(ZKZD202301).
文摘In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accuracy significantly declines when the data is occluded.To enhance the accuracy of gait emotion recognition under occlusion,this paper proposes a Multi-scale Suppression Graph ConvolutionalNetwork(MS-GCN).TheMS-GCN consists of three main components:Joint Interpolation Module(JI Moudle),Multi-scale Temporal Convolution Network(MS-TCN),and Suppression Graph Convolutional Network(SGCN).The JI Module completes the spatially occluded skeletal joints using the(K-Nearest Neighbors)KNN interpolation method.The MS-TCN employs convolutional kernels of various sizes to comprehensively capture the emotional information embedded in the gait,compensating for the temporal occlusion of gait information.The SGCN extracts more non-prominent human gait features by suppressing the extraction of key body part features,thereby reducing the negative impact of occlusion on emotion recognition results.The proposed method is evaluated on two comprehensive datasets:Emotion-Gait,containing 4227 real gaits from sources like BML,ICT-Pollick,and ELMD,and 1000 synthetic gaits generated using STEP-Gen technology,and ELMB,consisting of 3924 gaits,with 1835 labeled with emotions such as“Happy,”“Sad,”“Angry,”and“Neutral.”On the standard datasets Emotion-Gait and ELMB,the proposed method achieved accuracies of 0.900 and 0.896,respectively,attaining performance comparable to other state-ofthe-artmethods.Furthermore,on occlusion datasets,the proposedmethod significantly mitigates the performance degradation caused by occlusion compared to other methods,the accuracy is significantly higher than that of other methods.
基金Supported by the Henan Province Key Research and Development Project(231111211300)the Central Government of Henan Province Guides Local Science and Technology Development Funds(Z20231811005)+2 种基金Henan Province Key Research and Development Project(231111110100)Henan Provincial Outstanding Foreign Scientist Studio(GZS2024006)Henan Provincial Joint Fund for Scientific and Technological Research and Development Plan(Application and Overcoming Technical Barriers)(242103810028)。
文摘The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images.To meet these requirements,an autoencoder-based method for infrared and visible image fusion is proposed.The encoder designed according to the optimization objective consists of a base encoder and a detail encoder,which is used to extract low-frequency and high-frequency information from the image.This extraction may lead to some information not being captured,so a compensation encoder is proposed to supplement the missing information.Multi-scale decomposition is also employed to extract image features more comprehensively.The decoder combines low-frequency,high-frequency and supplementary information to obtain multi-scale features.Subsequently,the attention strategy and fusion module are introduced to perform multi-scale fusion for image reconstruction.Experimental results on three datasets show that the fused images generated by this network effectively retain salient targets while being more consistent with human visual perception.
基金supported by the National Natural Science Foundation of China(Grant Nos.42150204 and 2288101)supported by the China National Postdoctoral Program for Innovative Talents(BX20230045)the China Postdoctoral Science Foundation(2023M730279)。
文摘A nonlinear multi-scale interaction(NMI)model was proposed and developed by the first author for nearly 30 years to represent the evolution of atmospheric blocking.In this review paper,we first review the creation and development of the NMI model and then emphasize that the NMI model represents a new tool for identifying the basic physics of how climate change influences mid-to-high latitude weather extremes.The building of the NMI model took place over three main periods.In the 1990s,a nonlinear Schr?dinger(NLS)equation model was presented to describe atmospheric blocking as a wave packet;however,it could not depict the lifetime(10-20 days)of atmospheric blocking.In the 2000s,we proposed an NMI model of atmospheric blocking in a uniform basic flow by making a scale-separation assumption and deriving an eddyforced NLS equation.This model succeeded in describing the life cycle of atmospheric blocking.In the 2020s,the NMI model was extended to include the impact of a changing climate mainly by altering the basic zonal winds and the magnitude of the meridional background potential vorticity gradient(PVy).Model results show that when PVy is smaller,blocking has a weaker dispersion and a stronger nonlinearity,so blocking can be more persistent and have a larger zonal scale and weaker eastward movement,thus favoring stronger weather extremes.However,when PVy is much smaller and below a critical threshold under much stronger winter Arctic warming of global warming,atmospheric blocking becomes locally less persistent and shows a much stronger westward movement,which acts to inhibit local cold extremes.Such a case does not happen in summer under global warming because PVy fails to fall below the critical threshold.Thus,our theory indicates that global warming can render summer-blocking anticyclones and mid-to-high latitude heatwaves more persistent,intense,and widespread.
基金the Scientific Research Fund of Hunan Provincial Education Department(23A0423).
文摘Remote sensing imagery,due to its high altitude,presents inherent challenges characterized by multiple scales,limited target areas,and intricate backgrounds.These inherent traits often lead to increased miss and false detection rates when applying object recognition algorithms tailored for remote sensing imagery.Additionally,these complexities contribute to inaccuracies in target localization and hinder precise target categorization.This paper addresses these challenges by proposing a solution:The YOLO-MFD model(YOLO-MFD:Remote Sensing Image Object Detection withMulti-scale Fusion Dynamic Head).Before presenting our method,we delve into the prevalent issues faced in remote sensing imagery analysis.Specifically,we emphasize the struggles of existing object recognition algorithms in comprehensively capturing critical image features amidst varying scales and complex backgrounds.To resolve these issues,we introduce a novel approach.First,we propose the implementation of a lightweight multi-scale module called CEF.This module significantly improves the model’s ability to comprehensively capture important image features by merging multi-scale feature information.It effectively addresses the issues of missed detection and mistaken alarms that are common in remote sensing imagery.Second,an additional layer of small target detection heads is added,and a residual link is established with the higher-level feature extraction module in the backbone section.This allows the model to incorporate shallower information,significantly improving the accuracy of target localization in remotely sensed images.Finally,a dynamic head attentionmechanism is introduced.This allows themodel to exhibit greater flexibility and accuracy in recognizing shapes and targets of different sizes.Consequently,the precision of object detection is significantly improved.The trial results show that the YOLO-MFD model shows improvements of 6.3%,3.5%,and 2.5%over the original YOLOv8 model in Precision,map@0.5 and map@0.5:0.95,separately.These results illustrate the clear advantages of the method.
文摘Mental health problems and potential psychological crises affect the healthy growth and learning performance of college students.Effective and suitable prevention of psychological crises among college students is a continuous challenge university managers face.To explore a method of preventing psychological crises among college students,we measured 38661 students by using SCL-90(symptom check list-90)and screened out 5790 students with positive results.Then,we measured 33188 students by using PHQ-9(patient health questionnaire-9)and screened out 603 students with suicidal ideation or behavior;we interviewed 392 students by using GAQ(growth adversity questionnaire).The number of students who had positive results at both phases is 155.As a result,we obtained a data set(N=76)by integrating the students who tested positive on the PHQ-9(i.e.total score≥20)with those who completed the PHQ-9 and GAQ.In addition,we obtained a data set(N=50)by excluding the cases in which the GAQ score is 0.With regard to QCA(qualitative comparative analysis)results,the data set(N=76)exhibits 5 constellations of solutions with a coverage rate greater than 0.7,and the first eight indicators of the PHQ-9 constitute the explanatory variables in the combined solutions.About the data set(N=50),the combined solutions are extremely complicated and the explanatory variables encompass indicators from both the PHQ-9 and GAQ.All these mean that the multi-scale could more comprehensively reflect mental health states of college students,thus enhance the accuracy and effectiveness of the corresponding hierarchical intervention,and finally provide support for preventing psychological crises in universities.
基金the Key Research and Development Program of Hainan Province(Grant Nos.ZDYF2023GXJS163,ZDYF2024GXJS014)National Natural Science Foundation of China(NSFC)(Grant Nos.62162022,62162024)+2 种基金the Major Science and Technology Project of Hainan Province(Grant No.ZDKJ2020012)Hainan Provincial Natural Science Foundation of China(Grant No.620MS021)Youth Foundation Project of Hainan Natural Science Foundation(621QN211).
文摘Accurately identifying small objects in high-resolution aerial images presents a complex and crucial task in thefield of small object detection on unmanned aerial vehicles(UAVs).This task is challenging due to variations inUAV flight altitude,differences in object scales,as well as factors like flight speed and motion blur.To enhancethe detection efficacy of small targets in drone aerial imagery,we propose an enhanced You Only Look Onceversion 7(YOLOv7)algorithm based on multi-scale spatial context.We build the MSC-YOLO model,whichincorporates an additional prediction head,denoted as P2,to improve adaptability for small objects.We replaceconventional downsampling with a Spatial-to-Depth Convolutional Combination(CSPDC)module to mitigatethe loss of intricate feature details related to small objects.Furthermore,we propose a Spatial Context Pyramidwith Multi-Scale Attention(SCPMA)module,which captures spatial and channel-dependent features of smalltargets acrossmultiple scales.This module enhances the perception of spatial contextual features and the utilizationof multiscale feature information.On the Visdrone2023 and UAVDT datasets,MSC-YOLO achieves remarkableresults,outperforming the baseline method YOLOv7 by 3.0%in terms ofmean average precision(mAP).The MSCYOLOalgorithm proposed in this paper has demonstrated satisfactory performance in detecting small targets inUAV aerial photography,providing strong support for practical applications.
基金supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region,China(Project No.15308024)a grant from Research Centre for Carbon-Strategic Catalysis,The Hong Kong Polytechnic University(CE2X).
文摘Water electrolyzers play a crucial role in green hydrogen production.However,their efficiency and scalability are often compromised by bubble dynamics across various scales,from nanoscale to macroscale components.This review explores multi-scale modeling as a tool to visualize multi-phase flow and improve mass transport in water electrolyzers.At the nanoscale,molecular dynamics(MD)simulations reveal how electrode surface features and wettability influence nanobubble nucleation and stability.Moving to the mesoscale,models such as volume of fluid(VOF)and lattice Boltzmann method(LBM)shed light on bubble transport in porous transport layers(PTLs).These insights inform innovative designs,including gradient porosity and hydrophilic-hydrophobic patterning,aimed at minimizing gas saturation.At the macroscale,VOF simulations elucidate two-phase flow regimes within channels,showing how flow field geometry and wettability affect bubble discharging.Moreover,artificial intelligence(AI)-driven surrogate models expedite the optimization process,allowing for rapid exploration of structural parameters in channel-rib flow fields and porous flow field designs.By integrating these approaches,we can bridge theoretical insights with experimental validation,ultimately enhancing water electrolyzer performance,reducing costs,and advancing affordable,high-efficiency hydrogen production.
基金funded by the China Chongqing Municipal Science and Technology Bureau,grant numbers 2024TIAD-CYKJCXX0121,2024NSCQ-LZX0135Chongqing Municipal Commission of Housing and Urban-Rural Development,grant number CKZ2024-87+3 种基金the Chongqing University of Technology graduate education high-quality development project,grant number gzlsz202401the Chongqing University of Technology-Chongqing LINGLUE Technology Co.,Ltd.,Electronic Information(Artificial Intelligence)graduate joint training basethe Postgraduate Education and Teaching Reform Research Project in Chongqing,grant number yjg213116the Chongqing University of Technology-CISDI Chongqing Information Technology Co.,Ltd.,Computer Technology graduate joint training base.
文摘Detecting abnormal cervical cells is crucial for early identification and timely treatment of cervical cancer.However,this task is challenging due to the morphological similarities between abnormal and normal cells and the significant variations in cell size.Pathologists often refer to surrounding cells to identify abnormalities.To emulate this slide examination behavior,this study proposes a Multi-Scale Feature Fusion Network(MSFF-Net)for detecting cervical abnormal cells.MSFF-Net employs a Cross-Scale Pooling Model(CSPM)to effectively capture diverse features and contextual information,ranging from local details to the overall structure.Additionally,a Multi-Scale Fusion Attention(MSFA)module is introduced to mitigate the impact of cell size variations by adaptively fusing local and global information at different scales.To handle the complex environment of cervical cell images,such as cell adhesion and overlapping,the Inner-CIoU loss function is utilized to more precisely measure the overlap between bounding boxes,thereby improving detection accuracy in such scenarios.Experimental results on the Comparison detector dataset demonstrate that MSFF-Net achieves a mean average precision(mAP)of 63.2%,outperforming state-of-the-art methods while maintaining a relatively small number of parameters(26.8 M).This study highlights the effectiveness of multi-scale feature fusion in enhancing the detection of cervical abnormal cells,contributing to more accurate and efficient cervical cancer screening.
基金supported by the National Natural Science Foundation of China(52172091,52172295)Defense Industrial Technology Development Program(JCKY2023605C002)+4 种基金Frontier Leading Technology Basic Research Major Project of Jiangsu Province(SBK2023050110)the National Key Laboratory on Electromagnetic Environmental Effects and Electro-optical Engineering(NO.61422062301)the Opening Project of Science and Technology on Reliability Physics and Application Technology of Electronic Component Laboratory(ZHD202305)the Opening Project of Jiangsu Key Laboratory of Advanced Structural Materials and Application Technology(ASMA202303)the Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX23_0371).
文摘Advanced chemical engineering for simultaneous modulation of nanomaterial morphology, defects, interfaces, and structure to enhance electromagnetic and microwave absorption (MA) performance. However, accurately distinguishing the MA contributions of different scale factors and tuning the optimal combined effects remains a formidable challenge. This study employs a synergistic approach combining template protection etching and vacuum annealing to construct a controlled system of micrometer-sized cavities and amorphous carbon matrices in metal-organic framework (MOF) derivatives. The results demonstrate that the spatial effects introduced by the hollow structure enhance dielectric loss but significantly weaken impedance matching. By increasing the proportion of amorphous carbon, the balance between electromagnetic loss and impedance matching can be effectively maintained. Importantly, in a suitable graphitization environment, the presence of oxygen vacancies in amorphous carbon can induce significant polarization to compensate for the reduced conductivity loss due to the absence of sp2 carbon. Through the synergistic effects of morphology and composition, the samples exhibit a broader absorption bandwidth (6.28 GHz) and stronger reflection loss (−61.64 dB) compared to the original MOF. In conclusion, this study aims to elucidate the multiscale impacts of macroscopic micro-nano structure and microscopic defect engineering, providing valuable insights for future research in this field.
基金supported by Communication University of China(HG23035)partly supported by the Fundamental Research Funds for the Central Universities(CUC230A013).
文摘With the rapid growth of socialmedia,the spread of fake news has become a growing problem,misleading the public and causing significant harm.As social media content is often composed of both images and text,the use of multimodal approaches for fake news detection has gained significant attention.To solve the problems existing in previous multi-modal fake news detection algorithms,such as insufficient feature extraction and insufficient use of semantic relations between modes,this paper proposes the MFFFND-Co(Multimodal Feature Fusion Fake News Detection with Co-Attention Block)model.First,the model deeply explores the textual content,image content,and frequency domain features.Then,it employs a Co-Attention mechanism for cross-modal fusion.Additionally,a semantic consistency detectionmodule is designed to quantify semantic deviations,thereby enhancing the performance of fake news detection.Experimentally verified on two commonly used datasets,Twitter and Weibo,the model achieved F1 scores of 90.0% and 94.0%,respectively,significantly outperforming the pre-modified MFFFND(Multimodal Feature Fusion Fake News Detection with Attention Block)model and surpassing other baseline models.This improves the accuracy of detecting fake information in artificial intelligence detection and engineering software detection.
基金supported in part by the National Natural Science Foundation of China[62301374]Hubei Provincial Natural Science Foundation of China[2022CFB804]+2 种基金Hubei Provincial Education Research Project[B2022057]the Youths Science Foundation of Wuhan Institute of Technology[K202240]the 15th Graduate Education Innovation Fund of Wuhan Institute of Technology[CX2023295].
文摘This paper aims to develop a nonrigid registration method of preoperative and intraoperative thoracoabdominal CT images in computer-assisted interventional surgeries for accurate tumor localization and tissue visualization enhancement.However,fine structure registration of complex thoracoabdominal organs and large deformation registration caused by respiratory motion is challenging.To deal with this problem,we propose a 3D multi-scale attention VoxelMorph(MAVoxelMorph)registration network.To alleviate the large deformation problem,a multi-scale axial attention mechanism is utilized by using a residual dilated pyramid pooling for multi-scale feature extraction,and position-aware axial attention for long-distance dependencies between pixels capture.To further improve the large deformation and fine structure registration results,a multi-scale context channel attention mechanism is employed utilizing content information via adjacent encoding layers.Our method was evaluated on four public lung datasets(DIR-Lab dataset,Creatis dataset,Learn2Reg dataset,OASIS dataset)and a local dataset.Results proved that the proposed method achieved better registration performance than current state-of-the-art methods,especially in handling the registration of large deformations and fine structures.It also proved to be fast in 3D image registration,using about 1.5 s,and faster than most methods.Qualitative and quantitative assessments proved that the proposed MA-VoxelMorph has the potential to realize precise and fast tumor localization in clinical interventional surgeries.
基金funded by the Joint Fund for Regional Innovation and Development of National Natural Science Foundation of China(U21A20143)the National Science Fund for Excellent Young Scholars(52322607)the Excellent Youth Foundation of Heilongjiang Scientific Committee(YQ2022E028)。
文摘Improving the volumetric energy density of supercapacitors is essential for practical applications,which highly relies on the dense storage of ions in carbon-based electrodes.The functional units of carbon-based electrode exhibit multi-scale structural characteristics including macroscopic electrode morphologies,mesoscopic microcrystals and pores,and microscopic defects and dopants in the carbon basal plane.Therefore,the ordered combination of multi-scale structures of carbon electrode is crucial for achieving dense energy storage and high volumetric performance by leveraging the functions of various scale structu re.Considering that previous reviews have focused more on the discussion of specific scale structu re of carbon electrodes,this review takes a multi-scale perspective in which recent progresses regarding the structureperformance relationship,underlying mechanism and directional design of carbon-based multi-scale structures including carbon morphology,pore structure,carbon basal plane micro-environment and electrode technology on dense energy storage and volumetric property of supercapacitors are systematically discussed.We analyzed in detail the effects of the morphology,pore,and micro-environment of carbon electrode materials on ion dense storage,summarized the specific effects of different scale structures on volumetric property and recent research progress,and proposed the mutual influence and trade-off relationship between various scale structures.In addition,the challenges and outlooks for improving the dense storage and volumetric performance of carbon-based supercapacitors are analyzed,which can provide feasible technical reference and guidance for the design and manufacture of dense carbon-based electrode materials.