Traditional data-driven fault diagnosis methods depend on expert experience to manually extract effective fault features of signals,which has certain limitations.Conversely,deep learning techniques have gained promine...Traditional data-driven fault diagnosis methods depend on expert experience to manually extract effective fault features of signals,which has certain limitations.Conversely,deep learning techniques have gained prominence as a central focus of research in the field of fault diagnosis by strong fault feature extraction ability and end-to-end fault diagnosis efficiency.Recently,utilizing the respective advantages of convolution neural network(CNN)and Transformer in local and global feature extraction,research on cooperating the two have demonstrated promise in the field of fault diagnosis.However,the cross-channel convolution mechanism in CNN and the self-attention calculations in Transformer contribute to excessive complexity in the cooperative model.This complexity results in high computational costs and limited industrial applicability.To tackle the above challenges,this paper proposes a lightweight CNN-Transformer named as SEFormer for rotating machinery fault diagnosis.First,a separable multiscale depthwise convolution block is designed to extract and integrate multiscale feature information from different channel dimensions of vibration signals.Then,an efficient self-attention block is developed to capture critical fine-grained features of the signal from a global perspective.Finally,experimental results on the planetary gearbox dataset and themotor roller bearing dataset prove that the proposed framework can balance the advantages of robustness,generalization and lightweight compared to recent state-of-the-art fault diagnosis models based on CNN and Transformer.This study presents a feasible strategy for developing a lightweight rotating machinery fault diagnosis framework aimed at economical deployment.展开更多
Located in northern China,the Hetao Plain is an important agro-economic zone and population centre.The deterioration of local groundwater quality has had a serious impact on human health and economic development.Nowad...Located in northern China,the Hetao Plain is an important agro-economic zone and population centre.The deterioration of local groundwater quality has had a serious impact on human health and economic development.Nowadays,the groundwater vulnerability assessment(GVA)has become an essential task to identify the current status and development trend of groundwater quality.In this study,the Convolutional Neural Network(CNN)and Long Short-Term Memory(LSTM)models are integrated to realize the spatio-temporal prediction of regional groundwater vulnerability by introducing the Self-attention mechanism.The study firstly builds the CNN-LSTM modelwith self-attention(SA)mechanism and evaluates the prediction accuracy of the model for groundwater vulnerability compared to other common machine learning models such as Support Vector Machine(SVM),Random Forest(RF),and Extreme Gradient Boosting(XGBoost).The results indicate that the CNNLSTM model outperforms thesemodels,demonstrating its significance in groundwater vulnerability assessment.It can be posited that the predictions indicate an increased risk of groundwater vulnerability in the study area over the coming years.This increase can be attributed to the synergistic impact of global climate anomalies and intensified local human activities.Moreover,the overall groundwater vulnerability risk in the entire region has increased,evident fromboth the notably high value and standard deviation.This suggests that the spatial variability of groundwater vulnerability in the area is expected to expand in the future due to the sustained progression of climate change and human activities.The model can be optimized for diverse applications across regional environmental assessment,pollution prediction,and risk statistics.This study holds particular significance for ecological protection and groundwater resource management.展开更多
Visual object tracking plays a crucial role in computer vision.In recent years,researchers have proposed various methods to achieve high-performance object tracking.Among these,methods based on Transformers have becom...Visual object tracking plays a crucial role in computer vision.In recent years,researchers have proposed various methods to achieve high-performance object tracking.Among these,methods based on Transformers have become a research hotspot due to their ability to globally model and contextualize information.However,current Transformer-based object tracking methods still face challenges such as low tracking accuracy and the presence of redundant feature information.In this paper,we introduce self-calibration multi-head self-attention Transformer(SMSTracker)as a solution to these challenges.It employs a hybrid tensor decomposition self-organizing multihead self-attention transformermechanism,which not only compresses and accelerates Transformer operations but also significantly reduces redundant data,thereby enhancing the accuracy and efficiency of tracking.Additionally,we introduce a self-calibration attention fusion block to resolve common issues of attention ambiguities and inconsistencies found in traditional trackingmethods,ensuring the stability and reliability of tracking performance across various scenarios.By integrating a hybrid tensor decomposition approach with a self-organizingmulti-head self-attentive transformer mechanism,SMSTracker enhances the efficiency and accuracy of the tracking process.Experimental results show that SMSTracker achieves competitive performance in visual object tracking,promising more robust and efficient tracking systems,demonstrating its potential to providemore robust and efficient tracking solutions in real-world applications.展开更多
Lightweight convolutional neural networks(CNNs)have simple structures but struggle to comprehensively and accurately extract important semantic information from images.While attention mechanisms can enhance CNNs by le...Lightweight convolutional neural networks(CNNs)have simple structures but struggle to comprehensively and accurately extract important semantic information from images.While attention mechanisms can enhance CNNs by learning distinctive representations,most existing spatial and hybrid attention methods focus on local regions with extensive parameters,making them unsuitable for lightweight CNNs.In this paper,we propose a self-attention mechanism tailored for lightweight networks,namely the brief self-attention module(BSAM).BSAM consists of the brief spatial attention(BSA)and advanced channel attention blocks.Unlike conventional self-attention methods with many parameters,our BSA block improves the performance of lightweight networks by effectively learning global semantic representations.Moreover,BSAM can be seamlessly integrated into lightweight CNNs for end-to-end training,maintaining the network’s lightweight and mobile characteristics.We validate the effectiveness of the proposed method on image classification tasks using the Food-101,Caltech-256,and Mini-ImageNet datasets.展开更多
A healthy brain is vital to every person since the brain controls every movement and emotion.Sometimes,some brain cells grow unexpectedly to be uncontrollable and cancerous.These cancerous cells are called brain tumor...A healthy brain is vital to every person since the brain controls every movement and emotion.Sometimes,some brain cells grow unexpectedly to be uncontrollable and cancerous.These cancerous cells are called brain tumors.For diagnosed patients,their lives depend mainly on the early diagnosis of these tumors to provide suitable treatment plans.Nowadays,Physicians and radiologists rely on Magnetic Resonance Imaging(MRI)pictures for their clinical evaluations of brain tumors.These evaluations are time-consuming,expensive,and require expertise with high skills to provide an accurate diagnosis.Scholars and industrials have recently partnered to implement automatic solutions to diagnose the disease with high accuracy.Due to their accuracy,some of these solutions depend on deep-learning(DL)methodologies.These techniques have become important due to their roles in the diagnosis process,which includes identification and classification.Therefore,there is a need for a solid and robust approach based on a deep-learning method to diagnose brain tumors.The purpose of this study is to develop an intelligent automatic framework for brain tumor diagnosis.The proposed solution is based on a novel dense dynamic residual self-attention transfer adaptive learning fusion approach(NDDRSATALFA),carried over two implemented deep-learning networks:VGG19 and UNET to identify and classify brain tumors.In addition,this solution applies a transfer learning approach to exchange extracted features and data within the two neural networks.The presented framework is trained,validated,and tested on six public datasets of MRIs to detect brain tumors and categorize these tumors into three suitable classes,which are glioma,meningioma,and pituitary.The proposed framework yielded remarkable findings on variously evaluated performance indicators:99.32%accuracy,98.74%sensitivity,98.89%specificity,99.01%Dice,98.93%Area Under the Curve(AUC),and 99.81%F1-score.In addition,a comparative analysis with recent state-of-the-art methods was performed and according to the comparative analysis,NDDRSATALFA shows an admirable level of reliability in simplifying the timely identification of diverse brain tumors.Moreover,this framework can be applied by healthcare providers to assist radiologists,pathologists,and physicians in their evaluations.The attained outcomes open doors for advanced automatic solutions that improve clinical evaluations and provide reasonable treatment plans.展开更多
Medical image analysis based on deep learning has become an important technical requirement in the field of smart healthcare.In view of the difficulties in collaborative modeling of local details and global features i...Medical image analysis based on deep learning has become an important technical requirement in the field of smart healthcare.In view of the difficulties in collaborative modeling of local details and global features in multimodal image analysis of ophthalmology,as well as the existence of information redundancy in cross-modal data fusion,this paper proposes amultimodal fusion framework based on cross-modal collaboration and weighted attention mechanism.In terms of feature extraction,the framework collaboratively extracts local fine-grained features and global structural dependencies through a parallel dual-branch architecture,overcoming the limitations of traditional single-modality models in capturing either local or global information;in terms of fusion strategy,the framework innovatively designs a cross-modal dynamic fusion strategy,combining overlappingmulti-head self-attention modules with a bidirectional feature alignment mechanism,addressing the bottlenecks of low feature interaction efficiency and excessive attention fusion computations in traditional parallel fusion,and further introduces cross-domain local integration technology,which enhances the representation ability of the lesion area through pixel-level feature recalibration and optimizes the diagnostic robustness of complex cases.Experiments show that the framework exhibits excellent feature expression and generalization performance in cross-domain scenarios of ophthalmic medical images and natural images,providing a high-precision,low-redundancy fusion paradigm for multimodal medical image analysis,and promoting the upgrade of intelligent diagnosis and treatment fromsingle-modal static analysis to dynamic decision-making.展开更多
Deep Learning-based systems for Finger vein recognition have gained rising attention in recent years due to improved efficiency and enhanced security.The performance of existing CNN-based methods is limited by the pun...Deep Learning-based systems for Finger vein recognition have gained rising attention in recent years due to improved efficiency and enhanced security.The performance of existing CNN-based methods is limited by the puny generalization of learned features and deficiency of the finger vein image training data.Considering the concerns of existing methods,in this work,a simplified deep transfer learning-based framework for finger-vein recognition is developed using an EfficientNet model of deep learning with a self-attention mechanism.Data augmentation using various geometrical methods is employed to address the problem of training data shortage required for a deep learning model.The proposed model is tested using K-fold cross-validation on three publicly available datasets:HKPU,FVUSM,and SDUMLA.Also,the developed network is compared with other modern deep nets to check its effectiveness.In addition,a comparison of the proposed method with other existing Finger vein recognition(FVR)methods is also done.The experimental results exhibited superior recognition accuracy of the proposed method compared to other existing methods.In addition,the developed method proves to be more effective and less sophisticated at extracting robust features.The proposed EffAttenNet achieves an accuracy of 98.14%on HKPU,99.03%on FVUSM,and 99.50%on SDUMLA databases.展开更多
For image compression sensing reconstruction,most algorithms use the method of reconstructing image blocks one by one and stacking many convolutional layers,which usually have defects of obvious block effects,high com...For image compression sensing reconstruction,most algorithms use the method of reconstructing image blocks one by one and stacking many convolutional layers,which usually have defects of obvious block effects,high computational complexity,and long reconstruction time.An image compressed sensing reconstruction network based on self-attention mechanism(SAMNet)was proposed.For the compressed sampling,self-attention convolution was designed,which was conducive to capturing richer features,so that the compressed sensing measurement value retained more image structure information.For the reconstruction,a self-attention mechanism was introduced in the convolutional neural network.A reconstruction network including residual blocks,bottleneck transformer(BoTNet),and dense blocks was proposed,which strengthened the transfer of image features and reduced the amount of parameters dramatically.Under the Set5 dataset,when the measurement rates are 0.01,0.04,0.10,and 0.25,the average peak signal-to-noise ratio(PSNR)of SAMNet is improved by 1.27,1.23,0.50,and 0.15 dB,respectively,compared to the CSNet+.The running time of reconstructing a 256×256 image is reduced by 0.1473,0.1789,0.2310,and 0.2524 s compared to ReconNet.Experimental results showed that SAMNet improved the quality of reconstructed images and reduced the reconstruction time.展开更多
As the complexity of scientific satellite missions increases,the requirements for their magnetic fields,magnetic field fluctuations,and even magnetic field gradients and variations become increasingly stringent.Additi...As the complexity of scientific satellite missions increases,the requirements for their magnetic fields,magnetic field fluctuations,and even magnetic field gradients and variations become increasingly stringent.Additionally,there is a growing need to address the alternating magnetic fields produced by the spacecraft itself.This paper introduces a novel modeling method for spacecraft magnetic dipoles using an integrated self-attention mechanism and a transformer combined with Kolmogorov-Arnold Networks.The self-attention mechanism captures correlations among globally sparse data,establishing dependencies b.etween sparse magnetometer readings.Concurrently,the Kolmogorov-Arnold Network,proficient in modeling implicit numerical relationships between data features,enhances the ability to learn subtle patterns.Comparative experiments validate the capability of the proposed method to precisely model magnetic dipoles,achieving maximum Root Mean Square Errors of 24.06 mA·m^(2)and 0.32 cm for size and location modeling,respectively.The spacecraft magnetic model established using this method accurately computes magnetic fields and alternating magnetic fields at designated surfaces or points.This approach facilitates the rapid and precise construction of individual and complete spacecraft magnetic models,enabling the verification of magnetic specifications from the spacecraft design phase.展开更多
The development of deep learning has made non-biochemical methods for molecular property prediction screening a reality,which can increase the experimental speed and reduce the experimental cost of relevant experiment...The development of deep learning has made non-biochemical methods for molecular property prediction screening a reality,which can increase the experimental speed and reduce the experimental cost of relevant experiments.There are currently two main approaches to representing molecules:(a)representing molecules by fixing molecular descriptors,and(b)representing molecules by graph convolutional neural networks.Currently,both of these Representative methods have achieved some results in their respective experiments.Based on past efforts,we propose a Dual Self-attention Fusion Message Neural Network(DSFMNN).DSFMNN uses a combination of dual self-attention mechanism and graph convolutional neural network.Advantages of DSFMNN:(1)The dual self-attention mechanism focuses not only on the relationship between individual subunits in a molecule but also on the relationship between the atoms and chemical bonds contained in each subunit.(2)On the directed molecular graph,a message delivery approach centered on directed molecular bonds is used.We test the performance of the model on eight publicly available datasets and compare the performance with several models.Based on the current experimental results,DSFMNN has superior performance compared to previous models on the datasets applied in this paper.展开更多
Currently,most trains are equipped with dedicated cameras for capturing pantograph videos.Pantographs are core to the high-speed-railway pantograph-catenary system,and their failure directly affects the normal operati...Currently,most trains are equipped with dedicated cameras for capturing pantograph videos.Pantographs are core to the high-speed-railway pantograph-catenary system,and their failure directly affects the normal operation of high-speed trains.However,given the complex and variable real-world operational conditions of high-speed railways,there is no real-time and robust pantograph fault-detection method capable of handling large volumes of surveillance video.Hence,it is of paramount importance to maintain real-time monitoring and analysis of pantographs.Our study presents a real-time intelligent detection technology for identifying faults in high-speed railway pantographs,utilizing a fusion of self-attention and convolution features.We delved into lightweight multi-scale feature-extraction and fault-detection models based on deep learning to detect pantograph anomalies.Compared with traditional methods,this approach achieves high recall and accuracy in pantograph recognition,accurately pinpointing issues like discharge sparks,pantograph horns,and carbon pantograph-slide malfunctions.After experimentation and validation with actual surveillance videos of electric multiple-unit train,our algorithmic model demonstrates real-time,high-accuracy performance even under complex operational conditions.展开更多
Synaptic plasticity is essential for maintaining neuronal function in the central nervous system and serves as a critical indicator of the effects of neurodegenerative disease.Glaucoma directly impairs retinal ganglio...Synaptic plasticity is essential for maintaining neuronal function in the central nervous system and serves as a critical indicator of the effects of neurodegenerative disease.Glaucoma directly impairs retinal ganglion cells and their axons,leading to axonal transport dysfuntion,subsequently causing secondary damage to anterior or posterior ends of the visual system.Accordingly,recent evidence indicates that glaucoma is a degenerative disease of the central nervous system that causes damage throughout the visual pathway.However,the effects of glaucoma on synaptic plasticity in the primary visual cortex remain unclear.In this study,we established a mouse model of unilateral chronic ocular hypertension by injecting magnetic microbeads into the anterior chamber of one eye.We found that,after 4 weeks of chronic ocular hypertension,the neuronal somas were smaller in the superior colliculus and lateral geniculate body regions of the brain contralateral to the affected eye.This was accompanied by glial cell activation and increased expression of inflammatory factors.After 8 weeks of ocular hypertension,we observed a reduction in the number of excitatory and inhibitory synapses,dendritic spines,and activation of glial cells in the primary visual cortex contralateral to the affected eye.These findings suggest that glaucoma not only directly damages the retina but also induces alterations in synapses and dendritic spines in the primary visual cortex,providing new insights into the pathogenesis of glaucoma.展开更多
AIM:To compare the visual outcomes between bilateral implantation of Tecnis ZXR00 extended depth-of-focus(EDOF)intraocular lenses(IOLs)and mixed implantation of Tecnis ZXR00(EDOF)with Tecnis ZMB00(bifocal)IOLs.METHODS...AIM:To compare the visual outcomes between bilateral implantation of Tecnis ZXR00 extended depth-of-focus(EDOF)intraocular lenses(IOLs)and mixed implantation of Tecnis ZXR00(EDOF)with Tecnis ZMB00(bifocal)IOLs.METHODS:This postoperative cross-sectional study enrolled patients who underwent phacoemulsification combined with IOL implantation.Patients were divided into two groups:the bilateral ZXR00 group(ZXR00-only group)and the mixed IOL group(ZXR00+ZMB00 group).Primary outcome measures included uncorrected and corrected distance visual acuity(UDVA,CDVA),uncorrected and distance-corrected near visual acuity(UNVA,DCNVA),uncorrected and distance-corrected intermediate visual acuity(UIVA,DCIVA),and defocus curves.Secondary outcome measures were visual quality,spectacle independence,patient satisfaction,photic phenomena,and stereopsis.RESULTS:A total of 47 patients(94 eyes)were included,with 26 patients(11 males,15 females)in the ZXR00-only group(mean age:62.73±7.24y)and 21 patients(7 males,14 females)in the mixed group(mean age:65.71±9.16y).There was no statistically significant difference in age between the two groups(P=0.218).The mixed group showed significantly better binocular DCNVA compared to the ZXR00-only group(P=0.002).Defocus curve analysis revealed that the mixed group exhibited superior performance at−2.5 to−4.0 D but inferior performance at−0.5 and−1.5 D.Near stereoacuity was significantly poorer in the mixed group(Randot:5.589±0.744 vs 6.240±0.394 ln arcsec;Contour:4.966±0.973 vs 5.740±0.833 ln arcsec;both P<0.01).Both groups achieved high levels of spectacle independence and patient satisfaction,with no significant differences in photic phenomena or questionnaire scores.CONCLUSION:Mixed implantation of EDOF and bifocal IOLs improve near visual acuity but may compromise near stereopsis.This approach provides a viable option for patients prioritizing near vision;however,caution is recommended for individuals requiring fine stereoscopic vision for daily or professional tasks.展开更多
AIM:To evaluate the clinical characteristics and risk factors associated with visual prognosis in patients with open globe injuries(OGIs)treated at Vietnam National Eye Hospital.METHODS:A prospective observational stu...AIM:To evaluate the clinical characteristics and risk factors associated with visual prognosis in patients with open globe injuries(OGIs)treated at Vietnam National Eye Hospital.METHODS:A prospective observational study included patients with OGIs treated between June 2023 and June 2024.Data on demographics,injury features,and clinical findings were extracted from medical records.Poor visual outcome was defined as final best-corrected visual acuity(BCVA)worse than 20/400 or no light perception.Multivariable logistic regression was performed to identify independent risk factors.RESULTS:Among 509 patients(636 eyes),the mean age was 35.13y(range 20–51y),and 67.6%were male.After treatment,the proportion of eyes achieving≥20/40 increased from 12.6%to 42.1%,while no light perception decreased from 29.1%to 9.4%.Independent predictors of poor visual outcomes included delayed admission[>4h,odds ratio(OR)=3.33,95%confidence intervals(CI):1.76–6.33,P<0.001],Zone III injury(OR=5.90,95%CI:2.85–12.24,P<0.001),wound length>10 mm(OR=2.59,95%CI:1.60–4.18,P<0.001),relative afferent pupillary defect(RAPD,OR=1.65,95%CI:1.03–2.64,P=0.039),endophthalmitis(OR=1.75,95%CI:1.01–3.03,P=0.047),retinal detachment(OR=3.32,95%CI:2.02–5.45,P<0.001),and eyelid lacerations(OR=1.94,95%CI:1.13–3.33,P=0.016)associated with OGIs.Vitreous hemorrhage(OR=0.44,95%CI:0.22–0.89,P=0.023)was associated with better outcomes,and female gender appeared protective.CONCLUSION:Poor visual outcomes remain common after OGIs,despite improve visual acuity in many cases.Several clinical and injury-related factors are strongly associated with prognosis.Early recognition of these predictors can support risk stratification and improve trauma care in similar settings.展开更多
The intersection of visual impairment and mental health has profound effects on quality of life and warrants attention from healthcare providers,educators,and policymakers.With 20 million children under the age of 14 ...The intersection of visual impairment and mental health has profound effects on quality of life and warrants attention from healthcare providers,educators,and policymakers.With 20 million children under the age of 14 affected globally,older adults also experience significant psychological impact including depression,anxiety,and cognitive impairment.The implications of vision-related challenges extend far beyond mere sight.Depression and anxiety,exacerbated by social isolation and reduced physical activity,underscore the need for comprehensive interventions that address both medical and psychosocial dimensions.By recognizing the profound impact of ocular morbidities like strabismus,myopia,glaucoma,and age-related macular degeneration on mental health and investing in effective treatments and inclusive practices,society can pave the way for a healthier,more equitable future for affected individuals.There is evidence that myopic children experience a higher prevalence of depressive symptoms compared to their normal peers,and interventions like the correction of strabismus can enhance psychological outcome-demonstrating the value of an integrated management approach.展开更多
AIM:To investigate the clinical characteristics and treatment outcomes,including visual function and overall survival(OS)of patients with ocular adnexal diffuse large B-cell lymphoma(OA-DLBCL).METHODS:This retrospecti...AIM:To investigate the clinical characteristics and treatment outcomes,including visual function and overall survival(OS)of patients with ocular adnexal diffuse large B-cell lymphoma(OA-DLBCL).METHODS:This retrospective cohort study enrolled 29 patients diagnosed with OA-DLBCL based on histopathological biopsy between 2006 and 2023.Patients were stratified into two subgroups:primary OA-DLBCL(no prior history of lymphoma)and secondary OA-DLBCL(history of DLBCL at non-ocular adnexal sites).OS was defined as the time interval from OA-DLBCL diagnosis to death from any cause.Survival analysis was performed using the Kaplan–Meier method,and prognostic factors affecting OS were identified using multivariate Cox proportional hazards regression with a stepwise selection approach.RESULTS:The cohort included 24 patients with primary OA-DLBCL(13 males,11 females;mean age:61.36±18.29y)and 5 patients with secondary OA-DLBCL(2 males,3 females;mean age:50.94±18.17y).Among the primary OA-DLBCL subgroup,12 patients(50%)presented with advanced disease(Ann Arbor stage IIIE–IV),and 16 patients(66%)were classified as T4 disease according to the tumor-node-metastasis(TNM)staging system.The mean final visual acuity was 1.72±1.10 in the primary group and 0.90±1.18 in the secondary group.The 5-year OS rate for the entire cohort was 27.7%.Multivariate analysis identified five factors significantly associated with poor survival outcomes:epiphora[adjusted hazard ratio(aHR),36.95],atherosclerotic cardiovascular disease(aHR,10.08),human immunodeficiency virus(HIV)infection(aHR,12.47),M1 stage(aHR,6.99),and secondary OA-DLBCL(aHR,6.03;all P<0.05).The median OS was 1.68y for primary OA-DLBCL and 1.12y for secondary OA-DLBCL.CONCLUSION:A substantial proportion of patients with primary OA-DLBCL present with advanced-stage disease at diagnosis.Epiphora,atherosclerotic cardiovascular disease,HIV infection,M1 stage,and secondary OA-DLBCL are independent prognostic factors for poor survival outcomes.These findings emphasize the urgent need for optimized therapeutic strategies and early screening protocols to improve the management of OA-DLBCL,particularly in developing countries.展开更多
基金supported by the National Natural Science Foundation of China(No.52277055).
文摘Traditional data-driven fault diagnosis methods depend on expert experience to manually extract effective fault features of signals,which has certain limitations.Conversely,deep learning techniques have gained prominence as a central focus of research in the field of fault diagnosis by strong fault feature extraction ability and end-to-end fault diagnosis efficiency.Recently,utilizing the respective advantages of convolution neural network(CNN)and Transformer in local and global feature extraction,research on cooperating the two have demonstrated promise in the field of fault diagnosis.However,the cross-channel convolution mechanism in CNN and the self-attention calculations in Transformer contribute to excessive complexity in the cooperative model.This complexity results in high computational costs and limited industrial applicability.To tackle the above challenges,this paper proposes a lightweight CNN-Transformer named as SEFormer for rotating machinery fault diagnosis.First,a separable multiscale depthwise convolution block is designed to extract and integrate multiscale feature information from different channel dimensions of vibration signals.Then,an efficient self-attention block is developed to capture critical fine-grained features of the signal from a global perspective.Finally,experimental results on the planetary gearbox dataset and themotor roller bearing dataset prove that the proposed framework can balance the advantages of robustness,generalization and lightweight compared to recent state-of-the-art fault diagnosis models based on CNN and Transformer.This study presents a feasible strategy for developing a lightweight rotating machinery fault diagnosis framework aimed at economical deployment.
基金supported by the National Key Research and Development Program of China(No.2021YFA0715900).
文摘Located in northern China,the Hetao Plain is an important agro-economic zone and population centre.The deterioration of local groundwater quality has had a serious impact on human health and economic development.Nowadays,the groundwater vulnerability assessment(GVA)has become an essential task to identify the current status and development trend of groundwater quality.In this study,the Convolutional Neural Network(CNN)and Long Short-Term Memory(LSTM)models are integrated to realize the spatio-temporal prediction of regional groundwater vulnerability by introducing the Self-attention mechanism.The study firstly builds the CNN-LSTM modelwith self-attention(SA)mechanism and evaluates the prediction accuracy of the model for groundwater vulnerability compared to other common machine learning models such as Support Vector Machine(SVM),Random Forest(RF),and Extreme Gradient Boosting(XGBoost).The results indicate that the CNNLSTM model outperforms thesemodels,demonstrating its significance in groundwater vulnerability assessment.It can be posited that the predictions indicate an increased risk of groundwater vulnerability in the study area over the coming years.This increase can be attributed to the synergistic impact of global climate anomalies and intensified local human activities.Moreover,the overall groundwater vulnerability risk in the entire region has increased,evident fromboth the notably high value and standard deviation.This suggests that the spatial variability of groundwater vulnerability in the area is expected to expand in the future due to the sustained progression of climate change and human activities.The model can be optimized for diverse applications across regional environmental assessment,pollution prediction,and risk statistics.This study holds particular significance for ecological protection and groundwater resource management.
基金supported by the National Natural Science Foundation of China under Grant 62177029the Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX21_0740),China.
文摘Visual object tracking plays a crucial role in computer vision.In recent years,researchers have proposed various methods to achieve high-performance object tracking.Among these,methods based on Transformers have become a research hotspot due to their ability to globally model and contextualize information.However,current Transformer-based object tracking methods still face challenges such as low tracking accuracy and the presence of redundant feature information.In this paper,we introduce self-calibration multi-head self-attention Transformer(SMSTracker)as a solution to these challenges.It employs a hybrid tensor decomposition self-organizing multihead self-attention transformermechanism,which not only compresses and accelerates Transformer operations but also significantly reduces redundant data,thereby enhancing the accuracy and efficiency of tracking.Additionally,we introduce a self-calibration attention fusion block to resolve common issues of attention ambiguities and inconsistencies found in traditional trackingmethods,ensuring the stability and reliability of tracking performance across various scenarios.By integrating a hybrid tensor decomposition approach with a self-organizingmulti-head self-attentive transformer mechanism,SMSTracker enhances the efficiency and accuracy of the tracking process.Experimental results show that SMSTracker achieves competitive performance in visual object tracking,promising more robust and efficient tracking systems,demonstrating its potential to providemore robust and efficient tracking solutions in real-world applications.
文摘Lightweight convolutional neural networks(CNNs)have simple structures but struggle to comprehensively and accurately extract important semantic information from images.While attention mechanisms can enhance CNNs by learning distinctive representations,most existing spatial and hybrid attention methods focus on local regions with extensive parameters,making them unsuitable for lightweight CNNs.In this paper,we propose a self-attention mechanism tailored for lightweight networks,namely the brief self-attention module(BSAM).BSAM consists of the brief spatial attention(BSA)and advanced channel attention blocks.Unlike conventional self-attention methods with many parameters,our BSA block improves the performance of lightweight networks by effectively learning global semantic representations.Moreover,BSAM can be seamlessly integrated into lightweight CNNs for end-to-end training,maintaining the network’s lightweight and mobile characteristics.We validate the effectiveness of the proposed method on image classification tasks using the Food-101,Caltech-256,and Mini-ImageNet datasets.
基金funded by the Deanship of Scientific Research(DSR)at King Abdulaziz University,Jeddah,Saudi Arabia under Grant No.(GPIP:1055-829-2024).
文摘A healthy brain is vital to every person since the brain controls every movement and emotion.Sometimes,some brain cells grow unexpectedly to be uncontrollable and cancerous.These cancerous cells are called brain tumors.For diagnosed patients,their lives depend mainly on the early diagnosis of these tumors to provide suitable treatment plans.Nowadays,Physicians and radiologists rely on Magnetic Resonance Imaging(MRI)pictures for their clinical evaluations of brain tumors.These evaluations are time-consuming,expensive,and require expertise with high skills to provide an accurate diagnosis.Scholars and industrials have recently partnered to implement automatic solutions to diagnose the disease with high accuracy.Due to their accuracy,some of these solutions depend on deep-learning(DL)methodologies.These techniques have become important due to their roles in the diagnosis process,which includes identification and classification.Therefore,there is a need for a solid and robust approach based on a deep-learning method to diagnose brain tumors.The purpose of this study is to develop an intelligent automatic framework for brain tumor diagnosis.The proposed solution is based on a novel dense dynamic residual self-attention transfer adaptive learning fusion approach(NDDRSATALFA),carried over two implemented deep-learning networks:VGG19 and UNET to identify and classify brain tumors.In addition,this solution applies a transfer learning approach to exchange extracted features and data within the two neural networks.The presented framework is trained,validated,and tested on six public datasets of MRIs to detect brain tumors and categorize these tumors into three suitable classes,which are glioma,meningioma,and pituitary.The proposed framework yielded remarkable findings on variously evaluated performance indicators:99.32%accuracy,98.74%sensitivity,98.89%specificity,99.01%Dice,98.93%Area Under the Curve(AUC),and 99.81%F1-score.In addition,a comparative analysis with recent state-of-the-art methods was performed and according to the comparative analysis,NDDRSATALFA shows an admirable level of reliability in simplifying the timely identification of diverse brain tumors.Moreover,this framework can be applied by healthcare providers to assist radiologists,pathologists,and physicians in their evaluations.The attained outcomes open doors for advanced automatic solutions that improve clinical evaluations and provide reasonable treatment plans.
基金funded by the Ongoing Research Funding Program(ORF-2025-102),King Saud University,Riyadh,Saudi Arabiaby the Science and Technology Research Programof Chongqing Municipal Education Commission(Grant No.KJQN202400813)by the Graduate Research Innovation Project(Grant Nos.yjscxx2025-269-193 and CYS25618).
文摘Medical image analysis based on deep learning has become an important technical requirement in the field of smart healthcare.In view of the difficulties in collaborative modeling of local details and global features in multimodal image analysis of ophthalmology,as well as the existence of information redundancy in cross-modal data fusion,this paper proposes amultimodal fusion framework based on cross-modal collaboration and weighted attention mechanism.In terms of feature extraction,the framework collaboratively extracts local fine-grained features and global structural dependencies through a parallel dual-branch architecture,overcoming the limitations of traditional single-modality models in capturing either local or global information;in terms of fusion strategy,the framework innovatively designs a cross-modal dynamic fusion strategy,combining overlappingmulti-head self-attention modules with a bidirectional feature alignment mechanism,addressing the bottlenecks of low feature interaction efficiency and excessive attention fusion computations in traditional parallel fusion,and further introduces cross-domain local integration technology,which enhances the representation ability of the lesion area through pixel-level feature recalibration and optimizes the diagnostic robustness of complex cases.Experiments show that the framework exhibits excellent feature expression and generalization performance in cross-domain scenarios of ophthalmic medical images and natural images,providing a high-precision,low-redundancy fusion paradigm for multimodal medical image analysis,and promoting the upgrade of intelligent diagnosis and treatment fromsingle-modal static analysis to dynamic decision-making.
文摘Deep Learning-based systems for Finger vein recognition have gained rising attention in recent years due to improved efficiency and enhanced security.The performance of existing CNN-based methods is limited by the puny generalization of learned features and deficiency of the finger vein image training data.Considering the concerns of existing methods,in this work,a simplified deep transfer learning-based framework for finger-vein recognition is developed using an EfficientNet model of deep learning with a self-attention mechanism.Data augmentation using various geometrical methods is employed to address the problem of training data shortage required for a deep learning model.The proposed model is tested using K-fold cross-validation on three publicly available datasets:HKPU,FVUSM,and SDUMLA.Also,the developed network is compared with other modern deep nets to check its effectiveness.In addition,a comparison of the proposed method with other existing Finger vein recognition(FVR)methods is also done.The experimental results exhibited superior recognition accuracy of the proposed method compared to other existing methods.In addition,the developed method proves to be more effective and less sophisticated at extracting robust features.The proposed EffAttenNet achieves an accuracy of 98.14%on HKPU,99.03%on FVUSM,and 99.50%on SDUMLA databases.
基金supported by National Natural Science Foundation of China(Nos.61261016,61661025)Science and Technology Plan of Gansu Province(No.20JR10RA273).
文摘For image compression sensing reconstruction,most algorithms use the method of reconstructing image blocks one by one and stacking many convolutional layers,which usually have defects of obvious block effects,high computational complexity,and long reconstruction time.An image compressed sensing reconstruction network based on self-attention mechanism(SAMNet)was proposed.For the compressed sampling,self-attention convolution was designed,which was conducive to capturing richer features,so that the compressed sensing measurement value retained more image structure information.For the reconstruction,a self-attention mechanism was introduced in the convolutional neural network.A reconstruction network including residual blocks,bottleneck transformer(BoTNet),and dense blocks was proposed,which strengthened the transfer of image features and reduced the amount of parameters dramatically.Under the Set5 dataset,when the measurement rates are 0.01,0.04,0.10,and 0.25,the average peak signal-to-noise ratio(PSNR)of SAMNet is improved by 1.27,1.23,0.50,and 0.15 dB,respectively,compared to the CSNet+.The running time of reconstructing a 256×256 image is reduced by 0.1473,0.1789,0.2310,and 0.2524 s compared to ReconNet.Experimental results showed that SAMNet improved the quality of reconstructed images and reduced the reconstruction time.
基金supported by the National Key Research and Development Program of China(2020YFC2200901)。
文摘As the complexity of scientific satellite missions increases,the requirements for their magnetic fields,magnetic field fluctuations,and even magnetic field gradients and variations become increasingly stringent.Additionally,there is a growing need to address the alternating magnetic fields produced by the spacecraft itself.This paper introduces a novel modeling method for spacecraft magnetic dipoles using an integrated self-attention mechanism and a transformer combined with Kolmogorov-Arnold Networks.The self-attention mechanism captures correlations among globally sparse data,establishing dependencies b.etween sparse magnetometer readings.Concurrently,the Kolmogorov-Arnold Network,proficient in modeling implicit numerical relationships between data features,enhances the ability to learn subtle patterns.Comparative experiments validate the capability of the proposed method to precisely model magnetic dipoles,achieving maximum Root Mean Square Errors of 24.06 mA·m^(2)and 0.32 cm for size and location modeling,respectively.The spacecraft magnetic model established using this method accurately computes magnetic fields and alternating magnetic fields at designated surfaces or points.This approach facilitates the rapid and precise construction of individual and complete spacecraft magnetic models,enabling the verification of magnetic specifications from the spacecraft design phase.
文摘The development of deep learning has made non-biochemical methods for molecular property prediction screening a reality,which can increase the experimental speed and reduce the experimental cost of relevant experiments.There are currently two main approaches to representing molecules:(a)representing molecules by fixing molecular descriptors,and(b)representing molecules by graph convolutional neural networks.Currently,both of these Representative methods have achieved some results in their respective experiments.Based on past efforts,we propose a Dual Self-attention Fusion Message Neural Network(DSFMNN).DSFMNN uses a combination of dual self-attention mechanism and graph convolutional neural network.Advantages of DSFMNN:(1)The dual self-attention mechanism focuses not only on the relationship between individual subunits in a molecule but also on the relationship between the atoms and chemical bonds contained in each subunit.(2)On the directed molecular graph,a message delivery approach centered on directed molecular bonds is used.We test the performance of the model on eight publicly available datasets and compare the performance with several models.Based on the current experimental results,DSFMNN has superior performance compared to previous models on the datasets applied in this paper.
基金supported by the National Key R&D Program of China(No.2022YFB4301102).
文摘Currently,most trains are equipped with dedicated cameras for capturing pantograph videos.Pantographs are core to the high-speed-railway pantograph-catenary system,and their failure directly affects the normal operation of high-speed trains.However,given the complex and variable real-world operational conditions of high-speed railways,there is no real-time and robust pantograph fault-detection method capable of handling large volumes of surveillance video.Hence,it is of paramount importance to maintain real-time monitoring and analysis of pantographs.Our study presents a real-time intelligent detection technology for identifying faults in high-speed railway pantographs,utilizing a fusion of self-attention and convolution features.We delved into lightweight multi-scale feature-extraction and fault-detection models based on deep learning to detect pantograph anomalies.Compared with traditional methods,this approach achieves high recall and accuracy in pantograph recognition,accurately pinpointing issues like discharge sparks,pantograph horns,and carbon pantograph-slide malfunctions.After experimentation and validation with actual surveillance videos of electric multiple-unit train,our algorithmic model demonstrates real-time,high-accuracy performance even under complex operational conditions.
基金supported by the National Natural Science Foundation of China,No.82271115(to MY).
文摘Synaptic plasticity is essential for maintaining neuronal function in the central nervous system and serves as a critical indicator of the effects of neurodegenerative disease.Glaucoma directly impairs retinal ganglion cells and their axons,leading to axonal transport dysfuntion,subsequently causing secondary damage to anterior or posterior ends of the visual system.Accordingly,recent evidence indicates that glaucoma is a degenerative disease of the central nervous system that causes damage throughout the visual pathway.However,the effects of glaucoma on synaptic plasticity in the primary visual cortex remain unclear.In this study,we established a mouse model of unilateral chronic ocular hypertension by injecting magnetic microbeads into the anterior chamber of one eye.We found that,after 4 weeks of chronic ocular hypertension,the neuronal somas were smaller in the superior colliculus and lateral geniculate body regions of the brain contralateral to the affected eye.This was accompanied by glial cell activation and increased expression of inflammatory factors.After 8 weeks of ocular hypertension,we observed a reduction in the number of excitatory and inhibitory synapses,dendritic spines,and activation of glial cells in the primary visual cortex contralateral to the affected eye.These findings suggest that glaucoma not only directly damages the retina but also induces alterations in synapses and dendritic spines in the primary visual cortex,providing new insights into the pathogenesis of glaucoma.
文摘AIM:To compare the visual outcomes between bilateral implantation of Tecnis ZXR00 extended depth-of-focus(EDOF)intraocular lenses(IOLs)and mixed implantation of Tecnis ZXR00(EDOF)with Tecnis ZMB00(bifocal)IOLs.METHODS:This postoperative cross-sectional study enrolled patients who underwent phacoemulsification combined with IOL implantation.Patients were divided into two groups:the bilateral ZXR00 group(ZXR00-only group)and the mixed IOL group(ZXR00+ZMB00 group).Primary outcome measures included uncorrected and corrected distance visual acuity(UDVA,CDVA),uncorrected and distance-corrected near visual acuity(UNVA,DCNVA),uncorrected and distance-corrected intermediate visual acuity(UIVA,DCIVA),and defocus curves.Secondary outcome measures were visual quality,spectacle independence,patient satisfaction,photic phenomena,and stereopsis.RESULTS:A total of 47 patients(94 eyes)were included,with 26 patients(11 males,15 females)in the ZXR00-only group(mean age:62.73±7.24y)and 21 patients(7 males,14 females)in the mixed group(mean age:65.71±9.16y).There was no statistically significant difference in age between the two groups(P=0.218).The mixed group showed significantly better binocular DCNVA compared to the ZXR00-only group(P=0.002).Defocus curve analysis revealed that the mixed group exhibited superior performance at−2.5 to−4.0 D but inferior performance at−0.5 and−1.5 D.Near stereoacuity was significantly poorer in the mixed group(Randot:5.589±0.744 vs 6.240±0.394 ln arcsec;Contour:4.966±0.973 vs 5.740±0.833 ln arcsec;both P<0.01).Both groups achieved high levels of spectacle independence and patient satisfaction,with no significant differences in photic phenomena or questionnaire scores.CONCLUSION:Mixed implantation of EDOF and bifocal IOLs improve near visual acuity but may compromise near stereopsis.This approach provides a viable option for patients prioritizing near vision;however,caution is recommended for individuals requiring fine stereoscopic vision for daily or professional tasks.
文摘AIM:To evaluate the clinical characteristics and risk factors associated with visual prognosis in patients with open globe injuries(OGIs)treated at Vietnam National Eye Hospital.METHODS:A prospective observational study included patients with OGIs treated between June 2023 and June 2024.Data on demographics,injury features,and clinical findings were extracted from medical records.Poor visual outcome was defined as final best-corrected visual acuity(BCVA)worse than 20/400 or no light perception.Multivariable logistic regression was performed to identify independent risk factors.RESULTS:Among 509 patients(636 eyes),the mean age was 35.13y(range 20–51y),and 67.6%were male.After treatment,the proportion of eyes achieving≥20/40 increased from 12.6%to 42.1%,while no light perception decreased from 29.1%to 9.4%.Independent predictors of poor visual outcomes included delayed admission[>4h,odds ratio(OR)=3.33,95%confidence intervals(CI):1.76–6.33,P<0.001],Zone III injury(OR=5.90,95%CI:2.85–12.24,P<0.001),wound length>10 mm(OR=2.59,95%CI:1.60–4.18,P<0.001),relative afferent pupillary defect(RAPD,OR=1.65,95%CI:1.03–2.64,P=0.039),endophthalmitis(OR=1.75,95%CI:1.01–3.03,P=0.047),retinal detachment(OR=3.32,95%CI:2.02–5.45,P<0.001),and eyelid lacerations(OR=1.94,95%CI:1.13–3.33,P=0.016)associated with OGIs.Vitreous hemorrhage(OR=0.44,95%CI:0.22–0.89,P=0.023)was associated with better outcomes,and female gender appeared protective.CONCLUSION:Poor visual outcomes remain common after OGIs,despite improve visual acuity in many cases.Several clinical and injury-related factors are strongly associated with prognosis.Early recognition of these predictors can support risk stratification and improve trauma care in similar settings.
文摘The intersection of visual impairment and mental health has profound effects on quality of life and warrants attention from healthcare providers,educators,and policymakers.With 20 million children under the age of 14 affected globally,older adults also experience significant psychological impact including depression,anxiety,and cognitive impairment.The implications of vision-related challenges extend far beyond mere sight.Depression and anxiety,exacerbated by social isolation and reduced physical activity,underscore the need for comprehensive interventions that address both medical and psychosocial dimensions.By recognizing the profound impact of ocular morbidities like strabismus,myopia,glaucoma,and age-related macular degeneration on mental health and investing in effective treatments and inclusive practices,society can pave the way for a healthier,more equitable future for affected individuals.There is evidence that myopic children experience a higher prevalence of depressive symptoms compared to their normal peers,and interventions like the correction of strabismus can enhance psychological outcome-demonstrating the value of an integrated management approach.
基金Supported by the Faculty of Medicine,Prince of Songkla University.Wainipitapong S has received grants from the Faculty of Medicine,Prince of Songkla University。
文摘AIM:To investigate the clinical characteristics and treatment outcomes,including visual function and overall survival(OS)of patients with ocular adnexal diffuse large B-cell lymphoma(OA-DLBCL).METHODS:This retrospective cohort study enrolled 29 patients diagnosed with OA-DLBCL based on histopathological biopsy between 2006 and 2023.Patients were stratified into two subgroups:primary OA-DLBCL(no prior history of lymphoma)and secondary OA-DLBCL(history of DLBCL at non-ocular adnexal sites).OS was defined as the time interval from OA-DLBCL diagnosis to death from any cause.Survival analysis was performed using the Kaplan–Meier method,and prognostic factors affecting OS were identified using multivariate Cox proportional hazards regression with a stepwise selection approach.RESULTS:The cohort included 24 patients with primary OA-DLBCL(13 males,11 females;mean age:61.36±18.29y)and 5 patients with secondary OA-DLBCL(2 males,3 females;mean age:50.94±18.17y).Among the primary OA-DLBCL subgroup,12 patients(50%)presented with advanced disease(Ann Arbor stage IIIE–IV),and 16 patients(66%)were classified as T4 disease according to the tumor-node-metastasis(TNM)staging system.The mean final visual acuity was 1.72±1.10 in the primary group and 0.90±1.18 in the secondary group.The 5-year OS rate for the entire cohort was 27.7%.Multivariate analysis identified five factors significantly associated with poor survival outcomes:epiphora[adjusted hazard ratio(aHR),36.95],atherosclerotic cardiovascular disease(aHR,10.08),human immunodeficiency virus(HIV)infection(aHR,12.47),M1 stage(aHR,6.99),and secondary OA-DLBCL(aHR,6.03;all P<0.05).The median OS was 1.68y for primary OA-DLBCL and 1.12y for secondary OA-DLBCL.CONCLUSION:A substantial proportion of patients with primary OA-DLBCL present with advanced-stage disease at diagnosis.Epiphora,atherosclerotic cardiovascular disease,HIV infection,M1 stage,and secondary OA-DLBCL are independent prognostic factors for poor survival outcomes.These findings emphasize the urgent need for optimized therapeutic strategies and early screening protocols to improve the management of OA-DLBCL,particularly in developing countries.
文摘随着手机短信成为人们日常生活交往的重要手段,垃圾短信的识别具有重要的现实意义.针对此提出一种结合TFIDF的self-attention-based Bi-LSTM的神经网络模型.该模型首先将短信文本以词向量的方式输入到Bi-LSTM层,经过特征提取并结合TFIDF和self-attention层的信息聚焦获得最后的特征向量,最后将特征向量通过Softmax分类器进行分类得到短信文本分类结果.实验结果表明,结合TFIDF的self-attention-based Bi-LSTM模型相比于传统分类模型的短信文本识别准确率提高了2.1%–4.6%,运行时间减少了0.6 s–10.2 s.