Accurate detection of driver fatigue is essential for improving road safety.This study investigates the effectiveness of using multimodal physiological signals for fatigue detection while incorporating uncertainty qua...Accurate detection of driver fatigue is essential for improving road safety.This study investigates the effectiveness of using multimodal physiological signals for fatigue detection while incorporating uncertainty quantification to enhance the reliability of predictions.Physiological signals,including Electrocardiogram(ECG),Galvanic Skin Response(GSR),and Electroencephalogram(EEG),were transformed into image representations and analyzed using pretrained deep neu-ral networks.The extracted features were classified through a feedforward neural network,and prediction reliability was assessed using uncertainty quantification techniques such as Monte Carlo Dropout(MCD),model ensembles,and combined approaches.Evaluation metrics included standard measures(sensitivity,specificity,precision,and accuracy)along with uncertainty-aware metrics such as uncertainty sensitivity and uncertainty precision.Across all evaluations,ECG-based models consistently demonstrated strong performance.The findings indicate that combining multimodal physi-ological signals,Transfer Learning(TL),and uncertainty quantification can significantly improve both the accuracy and trustworthiness of fatigue detection systems.This approach supports the development of more reliable driver assistance technologies aimed at preventing fatigue-related accidents.展开更多
In this paper,we propose a systematic approach for accelerating finite element-type methods by machine learning for the numerical solution of partial differential equations(PDEs).The main idea is to use a neural netwo...In this paper,we propose a systematic approach for accelerating finite element-type methods by machine learning for the numerical solution of partial differential equations(PDEs).The main idea is to use a neural network to learn the solution map of the PDEs and to do so in an element-wise fashion.This map takes input of the element geometry and the PDE’s parameters on that element,and gives output of two operators:(1)the in2out operator for inter-element communication,and(2)the in2sol operator(Green’s function)for element-wise solution recovery.A significant advantage of this approach is that,once trained,this network can be used for the numerical solution of the PDE for any domain geometry and any parameter distribution without retraining.Also,the training is significantly simpler since it is done on the element level instead on the entire domain.We call this approach element learning.This method is closely related to hybridizable discontinuous Galerkin(HDG)methods in the sense that the local solvers of HDG are replaced by machine learning approaches.Numerical tests are presented for an example PDE,the radiative transfer or radiation transport equation,in a variety of scenarios with idealized or realistic cloud fields,with smooth or sharp gradient in the cloud boundary transition.Under a fixed accuracy level of 10^(−3) in the relative L^(2) error,and polynomial degree p=6 in each element,we observe an approximately 5 to 10 times speed-up by element learning compared to a classical finite element-type method.展开更多
The spatial offset of bridge has a significant impact on the safety,comfort,and durability of high-speed railway(HSR)operations,so it is crucial to rapidly and effectively detect the spatial offset of operational HSR ...The spatial offset of bridge has a significant impact on the safety,comfort,and durability of high-speed railway(HSR)operations,so it is crucial to rapidly and effectively detect the spatial offset of operational HSR bridges.Drive-by monitoring of bridge uneven settlement demonstrates significant potential due to its practicality,cost-effectiveness,and efficiency.However,existing drive-by methods for detecting bridge offset have limitations such as reliance on a single data source,low detection accuracy,and the inability to identify lateral deformations of bridges.This paper proposes a novel drive-by inspection method for spatial offset of HSR bridge based on multi-source data fusion of comprehensive inspection train.Firstly,dung beetle optimizer-variational mode decomposition was employed to achieve adaptive decomposition of non-stationary dynamic signals,and explore the hidden temporal relationships in the data.Subsequently,a long short-term memory neural network was developed to achieve feature fusion of multi-source signal and accurate prediction of spatial settlement of HSR bridge.A dataset of track irregularities and CRH380A high-speed train responses was generated using a 3D train-track-bridge interaction model,and the accuracy and effectiveness of the proposed hybrid deep learning model were numerically validated.Finally,the reliability of the proposed drive-by inspection method was further validated by analyzing the actual measurement data obtained from comprehensive inspection train.The research findings indicate that the proposed approach enables rapid and accurate detection of spatial offset in HSR bridge,ensuring the long-term operational safety of HSR bridges.展开更多
Domain adaptation aims to reduce the distribution gap between the training data(source domain)and the target data.This enables effective predictions even for domains not seen during training.However,most conventional ...Domain adaptation aims to reduce the distribution gap between the training data(source domain)and the target data.This enables effective predictions even for domains not seen during training.However,most conventional domain adaptation methods assume a single source domain,making them less suitable for modern deep learning settings that rely on diverse and large-scale datasets.To address this limitation,recent research has focused on Multi-Source Domain Adaptation(MSDA),which aims to learn effectively from multiple source domains.In this paper,we propose Efficient Domain Transition for Multi-source(EDTM),a novel and efficient framework designed to tackle two major challenges in existing MSDA approaches:(1)integrating knowledge across different source domains and(2)aligning label distributions between source and target domains.EDTM leverages an ensemble-based classifier expert mechanism to enhance the contribution of source domains that are more similar to the target domain.To further stabilize the learning process and improve performance,we incorporate imitation learning into the training of the target model.In addition,Maximum Classifier Discrepancy(MCD)is employed to align class-wise label distributions between the source and target domains.Experiments were conducted using Digits-Five,one of the most representative benchmark datasets for MSDA.The results show that EDTM consistently outperforms existing methods in terms of average classification accuracy.Notably,EDTM achieved significantly higher performance on target domains such as Modified National Institute of Standards and Technolog with blended background images(MNIST-M)and Street View House Numbers(SVHN)datasets,demonstrating enhanced generalization compared to baseline approaches.Furthermore,an ablation study analyzing the contribution of each loss component validated the effectiveness of the framework,highlighting the importance of each module in achieving optimal performance.展开更多
Federated Learning(FL)has become a leading decentralized solution that enables multiple clients to train a model in a collaborative environment without directly sharing raw data,making it suitable for privacy-sensitiv...Federated Learning(FL)has become a leading decentralized solution that enables multiple clients to train a model in a collaborative environment without directly sharing raw data,making it suitable for privacy-sensitive applications such as healthcare,finance,and smart systems.As the field continues to evolve,the research field has become more complex and scattered,covering different system designs,training methods,and privacy techniques.This survey is organized around the three core challenges:how the data is distributed,how models are synchronized,and how to defend against attacks.It provides a structured and up-to-date review of FL research from 2023 to 2025,offering a unified taxonomy that categorizes works by data distribution(Horizontal FL,Vertical FL,Federated Transfer Learning,and Personalized FL),training synchronization(synchronous and asynchronous FL),optimization strategies,and threat models(data leakage and poisoning attacks).In particular,we summarize the latest contributions in Vertical FL frameworks for secure multi-party learning,communication-efficient Horizontal FL,and domain-adaptive Federated Transfer Learning.Furthermore,we examine synchronization techniques addressing system heterogeneity,including straggler mitigation in synchronous FL and staleness management in asynchronous FL.The survey covers security threats in FL,such as gradient inversion,membership inference,and poisoning attacks,as well as their defense strategies that include privacy-preserving aggregation and anomaly detection.The paper concludes by outlining unresolved issues and highlighting challenges in handling personalized models,scalability,and real-world adoption.展开更多
The current deep learning models for braced excavation cannot predict deformation from the beginning of excavation due to the need for a substantial corpus of sufficient historical data for training purposes.To addres...The current deep learning models for braced excavation cannot predict deformation from the beginning of excavation due to the need for a substantial corpus of sufficient historical data for training purposes.To address this issue,this study proposes a transfer learning model based on a sequence-to-sequence twodimensional(2D)convolutional long short-term memory neural network(S2SCL2D).The model can use the existing data from other adjacent similar excavations to achieve wall deflection prediction once a limited amount of monitoring data from the target excavation has been recorded.In the absence of adjacent excavation data,numerical simulation data from the target project can be employed instead.A weight update strategy is proposed to improve the prediction accuracy by integrating the stochastic gradient masking with an early stopping mechanism.To illustrate the proposed methodology,an excavation project in Hangzhou,China is adopted.The proposed deep transfer learning model,which uses either adjacent excavation data or numerical simulation data as the source domain,shows a significant improvement in performance when compared to the non-transfer learning model.Using the simulation data from the target project even leads to better prediction performance than using the actual monitoring data from other adjacent excavations.The results demonstrate that the proposed model can reasonably predict the deformation with limited data from the target project.展开更多
Predicting the behavior of renewable energy systems requires models capable of generating accurate forecasts from limited historical data,a challenge that becomes especially pronounced when commissioning new facil-iti...Predicting the behavior of renewable energy systems requires models capable of generating accurate forecasts from limited historical data,a challenge that becomes especially pronounced when commissioning new facil-ities where operational records are scarce.This review aims to synthesize recent progress in data-efficient deep learning approaches for addressing such“cold-start”forecasting problems.It primarily covers three interrelated domains—solar photovoltaic(PV),wind power,and electrical load forecasting—where data scarcity and operational variability are most critical,while also including representative studies on hydropower and carbon emission prediction to provide a broader systems perspective.To this end,we examined trends from over 150 predominantly peer-reviewed studies published between 2019 and mid-2025,highlighting advances in zero-shot and few-shot meta-learning frameworks that enable rapid model adaptation with minimal labeled data.Moreover,transfer learning approaches combined with spatiotemporal graph neural networks have been employed to transfer knowledge from existing energy assets to new,data-sparse environments,effectively capturing hidden dependencies among geographic features,meteorological dynamics,and grid structures.Synthetic data generation has further proven valuable for expanding training samples and mitigating overfitting in cold-start scenarios.In addition,large language models and explainable artificial intelligence(XAI)—notably conversational XAI systems—have been used to interpret and communicate complex model behaviors in accessible terms,fostering operator trust from the earliest deployment stages.By consolidating methodological advances,unresolved challenges,and open-source resources,this review provides a coherent overview of deep learning strategies that can shorten the data-sparse ramp-up period of new energy infrastructures and accelerate the transition toward resilient,low-carbon electricity grids.展开更多
Aerodynamic evaluation under multi-condition is indispensable for the design of aircraft,and the requirement for mass data still means a high cost.To address this problem,we propose a novel point-cloud multi-condition...Aerodynamic evaluation under multi-condition is indispensable for the design of aircraft,and the requirement for mass data still means a high cost.To address this problem,we propose a novel point-cloud multi-condition aerodynamics transfer learning(PCMCA-TL)framework that enables aerodynamic prediction in data-scarce sce-narios by transferring knowledge from well-learned scenarios.We modified the PointNeXt segmentation archi-tecture to a PointNeXtReg+regression model,including a working condition input module.The model is first pre-trained on a public dataset with 2000 shapes but only one working condition and then fine-tuned on a multi-condition small-scale spaceplane dataset.The effectiveness of the PCMCA-TL framework is verified by comparing the pressure coefficients predicted by direct training,pre-training,and TL models.Furthermore,by comparing the aerodynamic force coefficients calculated by predicted pressure coefficients in seconds with the correspond-ing CFD results obtained in hours,the accuracy highlights the development potential of deep transfer learning in aerodynamic evaluation.展开更多
In radiology,magnetic resonance imaging(MRI)is an essential diagnostic tool that provides detailed images of a patient’s anatomical and physiological structures.MRI is particularly effective for detecting soft tissue...In radiology,magnetic resonance imaging(MRI)is an essential diagnostic tool that provides detailed images of a patient’s anatomical and physiological structures.MRI is particularly effective for detecting soft tissue anomalies.Traditionally,radiologists manually interpret these images,which can be labor-intensive and time-consuming due to the vast amount of data.To address this challenge,machine learning,and deep learning approaches can be utilized to improve the accuracy and efficiency of anomaly detection in MRI scans.This manuscript presents the use of the Deep AlexNet50 model for MRI classification with discriminative learning methods.There are three stages for learning;in the first stage,the whole dataset is used to learn the features.In the second stage,some layers of AlexNet50 are frozen with an augmented dataset,and in the third stage,AlexNet50 with an augmented dataset with the augmented dataset.This method used three publicly available MRI classification datasets:Harvard whole brain atlas(HWBA-dataset),the School of Biomedical Engineering of Southern Medical University(SMU-dataset),and The National Institute of Neuroscience and Hospitals brain MRI dataset(NINS-dataset)for analysis.Various hyperparameter optimizers like Adam,stochastic gradient descent(SGD),Root mean square propagation(RMS prop),Adamax,and AdamW have been used to compare the performance of the learning process.HWBA-dataset registers maximum classification performance.We evaluated the performance of the proposed classification model using several quantitative metrics,achieving an average accuracy of 98%.展开更多
This study utilized a sequential mediating model to examine the role of motivation to learn and transfer selfefficacy in the relationships between perceived content validity,mentoring function,continuous learning work...This study utilized a sequential mediating model to examine the role of motivation to learn and transfer selfefficacy in the relationships between perceived content validity,mentoring function,continuous learning work culture and intention to transfer learning.The sample comprized 429 final-year apprentices in Guangdong province,China(females=69.9%,Engineering&Medicine=69%,mean age=20.99,SD=1.60).The apprentices completed standardized measures of motivation to learn,transfer self-efficacy perceived content validity,mentoring function,and continuous learning work culture.Structural equation modeling was used to analyze the data.Results showed perceived content validity,mentoring function,continuous learning culture to predict intention to transfer learning.Of these factors,perceived content validity was the strongest predictor of intention to transfer learning.Of these factors,perceived content validity was the most influential predictor of intention to transfer learning.The motivation to learn and transfer self-efficacy sequentially mediated the relationship between mentoring function and intention to learning transfer to be stronger than by either alone.Although perceived content validity and continuous learning culture exhibited no significant direct effects on intention to transfer learning,they demonstrated positive indirect associations with intention to transfer via motivation to learn and transfer self-efficacy.These study findings extend the applications of the learning transfer framework to individuals undergoing apprenticeship training which also would apply to other a long-term work-based learning programs.展开更多
The COVID-19 pandemic,which was declared by the WHO,had created a global health crisis and disrupted people’s daily lives.A large number of people were affected by the COVID-19 pandemic.Therefore,a diagnostic model n...The COVID-19 pandemic,which was declared by the WHO,had created a global health crisis and disrupted people’s daily lives.A large number of people were affected by the COVID-19 pandemic.Therefore,a diagnostic model needs to be generated which can effectively classify the COVID and non-COVID cases.In this work,our aim is to develop a diagnostic model based on deep features using effectiveness of Chest X-ray(CXR)in distinguishing COVID from non-COVID cases.The proposed diagnostic framework utilizes CXR to diagnose COVID-19 and includes Grad-CAM visualizations for a visual interpretation of predicted images.The model’s performance was evaluated using various metrics,including accuracy,precision,recall,F1-score,and Gmean.Several machine learning models,such as random forest,dense neural network,SVM,twin SVM,extreme learning machine,random vector functional link,and kernel ridge regression,were selected to diagnose COVID-19 cases.Transfer learning was used to extract deep features.For feature extraction many CNN-based models such as Inception V3,MobileNet,ResNet50,VGG16 and Xception models are used.It was evident from the experiments that ResNet50 architecture outperformed all other CNN architectures based on AUC.The TWSVM classifier achieved the highest AUC score of 0.98 based on the ResNet50 feature vector.展开更多
Background:Pneumoconioses,a group of occupational lung diseases caused by inhalation of mineral dust,pose significant health risks to affected individuals.Accurate assessment of profusion(extent of lung involvement)in...Background:Pneumoconioses,a group of occupational lung diseases caused by inhalation of mineral dust,pose significant health risks to affected individuals.Accurate assessment of profusion(extent of lung involvement)in chest radiographs is essential for screening,diagnosis and monitoring of the diseases along with epidemiological classification.This study explores an automated classification system combining U-Net-based segmentation for lung field delineation and DenseNet121 with ImageNet-based transfer learning for profusion classification.Methods:Lung field segmentation using U-Net achieved precise delineation,ensuring accurate region-of-interest definition.Transfer learning with DenseNet121 leveraged pre-trained knowledge from ImageNet,minimizing the need for extensive training.The model was fine-tuned with International Labour Organization(ILO)-2022 version standard chest radiographs and evaluated on a diverse dataset of ILO-2000 version standardized radiographs.Results:The U-Net-based segmentation demonstrated robust performance(Accuracy 94%and Dice Coefficient 90%),facilitating subsequent profusion classification.The DenseNet121-based transfer learning model exhibited high accuracy(95%),precision(92%),and recall(94%)for classifying four profusion levels on test ILO 2000/2011D dataset.The final Evaluation on ILO-2000 radiographs highlighted its generalization capability.Conclusion:The proposed system offers clinical promise,aiding radiologists,pulmonologists,general physicians,and occupational health specialists in pneumoconioses screening,diagnosis,monitoring and epidemiological classification.Best of our knowledge,this is the first work in the field of automated Classification of Profusion in Chest Radiographs of Pneumoconioses based on recently published latest ILO-2022 standard.Future research should focus on further refinement and real-world validation.This approach exemplifies the potential of deep learning for enhancing the accuracy and efficiency of pneumoconioses assessment,benefiting industrial workers,patients,and healthcare providers.展开更多
A healthy brain is vital to every person since the brain controls every movement and emotion.Sometimes,some brain cells grow unexpectedly to be uncontrollable and cancerous.These cancerous cells are called brain tumor...A healthy brain is vital to every person since the brain controls every movement and emotion.Sometimes,some brain cells grow unexpectedly to be uncontrollable and cancerous.These cancerous cells are called brain tumors.For diagnosed patients,their lives depend mainly on the early diagnosis of these tumors to provide suitable treatment plans.Nowadays,Physicians and radiologists rely on Magnetic Resonance Imaging(MRI)pictures for their clinical evaluations of brain tumors.These evaluations are time-consuming,expensive,and require expertise with high skills to provide an accurate diagnosis.Scholars and industrials have recently partnered to implement automatic solutions to diagnose the disease with high accuracy.Due to their accuracy,some of these solutions depend on deep-learning(DL)methodologies.These techniques have become important due to their roles in the diagnosis process,which includes identification and classification.Therefore,there is a need for a solid and robust approach based on a deep-learning method to diagnose brain tumors.The purpose of this study is to develop an intelligent automatic framework for brain tumor diagnosis.The proposed solution is based on a novel dense dynamic residual self-attention transfer adaptive learning fusion approach(NDDRSATALFA),carried over two implemented deep-learning networks:VGG19 and UNET to identify and classify brain tumors.In addition,this solution applies a transfer learning approach to exchange extracted features and data within the two neural networks.The presented framework is trained,validated,and tested on six public datasets of MRIs to detect brain tumors and categorize these tumors into three suitable classes,which are glioma,meningioma,and pituitary.The proposed framework yielded remarkable findings on variously evaluated performance indicators:99.32%accuracy,98.74%sensitivity,98.89%specificity,99.01%Dice,98.93%Area Under the Curve(AUC),and 99.81%F1-score.In addition,a comparative analysis with recent state-of-the-art methods was performed and according to the comparative analysis,NDDRSATALFA shows an admirable level of reliability in simplifying the timely identification of diverse brain tumors.Moreover,this framework can be applied by healthcare providers to assist radiologists,pathologists,and physicians in their evaluations.The attained outcomes open doors for advanced automatic solutions that improve clinical evaluations and provide reasonable treatment plans.展开更多
Many applications,including security systems,medical diagnostics,and human-computer interfaces,depend on eye gaze recognition.However,due to factors including individual variations,occlusions,and shifting illumination...Many applications,including security systems,medical diagnostics,and human-computer interfaces,depend on eye gaze recognition.However,due to factors including individual variations,occlusions,and shifting illumination conditions,real-world scenarios continue to provide difficulties for accurate and consistent eye gaze recognition.This work is aimed at investigating the potential benefits of employing transfer learning to improve eye gaze detection ability and efficiency.Transfer learning is the process of fine-tuning pre-trained models on smaller,domain-specific datasets after they have been trained on larger datasets.We study several transfer learning algorithms and evaluate their effectiveness on eye gaze identification,including both Regression and Classification tasks,using a range of deep learning architectures,namely AlexNet,Visual Geometry Group(VGG),InceptionV3,and ResNet.In this study,we evaluate the effectiveness of transfer learning-basedmodels against models that were trained fromscratch using eye-gazing datasets on grounds of various performance and loss metrics such as Precision,Accuracy,and Mean Absolute Error.We investigate the effects of different pre-trainedmodels,dataset sizes,and domain gaps on the transfer learning process,and the findings of our study clarify the efficacy of transfer learning for eye gaze detection and offer suggestions for the most successful transfer learning strategies to apply in real-world situations.展开更多
This study presents an emergency control method for sub-synchronous oscillations in wind power gridconnected systems based on transfer learning,addressing the issue of insufficient generalization ability of traditiona...This study presents an emergency control method for sub-synchronous oscillations in wind power gridconnected systems based on transfer learning,addressing the issue of insufficient generalization ability of traditional methods in complex real-world scenarios.By combining deep reinforcement learning with a transfer learning framework,cross-scenario knowledge transfer is achieved,significantly enhancing the adaptability of the control strategy.First,a sub-synchronous oscillation emergency control model for the wind power grid integration system is constructed under fixed scenarios based on deep reinforcement learning.A reward evaluation system based on the active power oscillation pattern of the system is proposed,introducing penalty functions for the number of machine-shedding rounds and the number of machines shed.This avoids the economic losses and grid security risks caused by the excessive one-time shedding of wind turbines.Furthermore,transfer learning is introduced into model training to enhance the model’s generalization capability in dealing with complex scenarios of actual wind power grid integration systems.By introducing the Maximum Mean Discrepancy(MMD)algorithm to calculate the distribution differences between source data and target data,the online decision-making reliability of the emergency control model is improved.Finally,the effectiveness of the proposed emergency control method for multi-scenario sub-synchronous oscillation in wind power grid integration systems based on transfer learning is analyzed using the New England 39-bus system.展开更多
The Internet of Things(IoT)is an innovation that combines imagined space with the actual world on a single platform.Because of the recent rapid rise of IoT devices,there has been a lack of standards,leading to a massi...The Internet of Things(IoT)is an innovation that combines imagined space with the actual world on a single platform.Because of the recent rapid rise of IoT devices,there has been a lack of standards,leading to a massive increase in unprotected devices connecting to networks.Consequently,cyberattacks on IoT are becoming more common,particularly keylogging attacks,which are often caused by security vulnerabilities on IoT networks.This research focuses on the role of transfer learning and ensemble classifiers in enhancing the detection of keylogging attacks within small,imbalanced IoT datasets.The authors propose a model that combines transfer learning with ensemble classification methods,leading to improved detection accuracy.By leveraging the BoT-IoT and keylogger_detection datasets,they facilitate the transfer of knowledge across various domains.The results reveal that the integration of transfer learning and ensemble classifiers significantly improves detection capabilities,even in scenarios with limited data availability.The proposed TRANS-ENS model showcases exceptional accuracy and a minimal false positive rate,outperforming current deep learning approaches.The primary objectives include:(i)introducing an ensemble feature selection technique to identify common features across models,(ii)creating a pre-trained deep learning model through transfer learning for the detection of keylogging attacks,and(iii)developing a transfer learning-ensemble model dedicated to keylogging detection.Experimental findings indicate that the TRANS-ENS model achieves a detection accuracy of 96.06%and a false alarm rate of 0.12%,surpassing existing models such as CNN,RNN,and LSTM.展开更多
Calculating the inter-layer ion diffusion barrier, a crucial metric for evaluating the rate performance of 2D electrode materials, is time-consuming using the transition state search approach. A novel electrostatic po...Calculating the inter-layer ion diffusion barrier, a crucial metric for evaluating the rate performance of 2D electrode materials, is time-consuming using the transition state search approach. A novel electrostatic potential distribution image (EPDI) transfer learning method has been proposed to efficiently and accurately predict the lithium diffusion barriers on metal element-doped transition metal dichalcogenide (TMD) surfaces. Through the analysis of the mean electrostatic potential (MEP) around binding sites, a positive correlation between binding energy and MEP in VIB-TMDs was identified. Subsequently, transfer learning techniques were used to develop a DenseNet121-TL model for establishing a more accurate mapping between the binding energy and electrostatic potential distribution. Trained on training sets containing 33% and 50% transition state search calculation results, which save 66% and 50% of the calculation time, respectively, the model achieves accurate predictions of the saddle point binding energy with mean absolute errors (MAEs) of 0.0444 and 0.0287 eV on the testing set. Based on the prediction of saddle point binding energies, we obtained a diffusion minimum energy profile with an MAE of 0.0235 eV. Furthermore, by analyzing the diffusion data, we observed that the diffusion barrier was lowered by 10% on V-doped TiS2 compared to the stoichiometric surface. Our findings are expected to provide new insights for the high-throughput calculation of ion diffusion on 2D materials.展开更多
Developing machine learning frameworks with predictive power,interpretability,and transferability is crucial,yet it faces challenges in the field of electrocatalysis.To achieve this,we employed rigorous feature engine...Developing machine learning frameworks with predictive power,interpretability,and transferability is crucial,yet it faces challenges in the field of electrocatalysis.To achieve this,we employed rigorous feature engineering to establish a finely tuned gradient boosting regressor(GBR)model,which adeptly captures the physical complexity from feature space to target variables.We demonstrated that environmental electron effects and atomic number significantly govern the success of the mapping process via global and local explanations.The finely tuned GBR model exhibits exceptional robustness in predicting CO adsorption energies(R_(ave)^(2)=0.937,RMSE=0.153 eV).Moreover,the model demonstrated remarkable transfer learning ability,showing excellent predictive power for OH,NO,and N_(2) adsorption.Importantly,the GBR model exhibits exceptional predictive capability across an extensive search space,thereby demonstrating profound adaptability and versatility.Our research framework significantly enhances the interpretability and transferability of machine learning in electrocatalysis,offering vital insights for further advancements.展开更多
Transfer learning is the predominant method for adapting pre-trained models on another task to new domains while preserving their internal architectures and augmenting them with requisite layers in Deep Neural Network...Transfer learning is the predominant method for adapting pre-trained models on another task to new domains while preserving their internal architectures and augmenting them with requisite layers in Deep Neural Network models.Training intricate pre-trained models on a sizable dataset requires significant resources to fine-tune hyperparameters carefully.Most existing initialization methods mainly focus on gradient flow-related problems,such as gradient vanishing or exploding,or other existing approaches that require extra models that do not consider our setting,which is more practical.To address these problems,we suggest employing gradient-free heuristic methods to initialize the weights of the final new-added fully connected layer in neural networks froma small set of training data with fewer classes.The approach relies on partitioning the output values from pre-trained models for a small set into two separate intervals determined by the targets.This process is framed as an optimization problem for each output neuron and class.The optimization selects the highest values as weights,considering their direction towards the respective classes.Furthermore,empirical 145 experiments involve a variety of neural networkmodels tested acrossmultiple benchmarks and domains,occasionally yielding accuracies comparable to those achieved with gradient descent methods by using only small subsets.展开更多
Microseismic monitoring technology is widely used in tunnel and coal mine safety production.For signals generated by ultra-weak microseismic events,traditional sensors encounter limitations in terms of detection sensi...Microseismic monitoring technology is widely used in tunnel and coal mine safety production.For signals generated by ultra-weak microseismic events,traditional sensors encounter limitations in terms of detection sensitivity.Given the complex engineering environment,automatic multi-classification of microseismic data is highly required.In this study,we use acceleration sensors to collect signals and combine the improved Visual Geometry Group with a convolutional block attention module to obtain a new network structure,termed CNN_BAM,for automatic classification and identification of microseismic events.We use the dataset collected from the Hanjiang-to-Weihe River Diversion Project to train and validate the network model.Results show that the CNN_BAM model exhibits good feature extraction ability,achieving a recognition accuracy of 99.29%,surpassing all its counterparts.The stability and accuracy of the classification algorithm improve remarkably.In addition,through fine-tuning and migration to the Pan Ⅱ Mine Project,the network demonstrates reliable generalization performance.This outcome reflects its adaptability across different projects and promising application prospects.展开更多
基金the Australian Research Council Discovery Projects funding scheme(DP190102181,DP210101465).
文摘Accurate detection of driver fatigue is essential for improving road safety.This study investigates the effectiveness of using multimodal physiological signals for fatigue detection while incorporating uncertainty quantification to enhance the reliability of predictions.Physiological signals,including Electrocardiogram(ECG),Galvanic Skin Response(GSR),and Electroencephalogram(EEG),were transformed into image representations and analyzed using pretrained deep neu-ral networks.The extracted features were classified through a feedforward neural network,and prediction reliability was assessed using uncertainty quantification techniques such as Monte Carlo Dropout(MCD),model ensembles,and combined approaches.Evaluation metrics included standard measures(sensitivity,specificity,precision,and accuracy)along with uncertainty-aware metrics such as uncertainty sensitivity and uncertainty precision.Across all evaluations,ECG-based models consistently demonstrated strong performance.The findings indicate that combining multimodal physi-ological signals,Transfer Learning(TL),and uncertainty quantification can significantly improve both the accuracy and trustworthiness of fatigue detection systems.This approach supports the development of more reliable driver assistance technologies aimed at preventing fatigue-related accidents.
基金partially supported by the NSF(Grant No.DMS 2324368)by the Office of the Vice Chancellor for Research and Graduate Education at the University of Wisconsin-Madison with funding from the Wisconsin Alumni Research Foundation.
文摘In this paper,we propose a systematic approach for accelerating finite element-type methods by machine learning for the numerical solution of partial differential equations(PDEs).The main idea is to use a neural network to learn the solution map of the PDEs and to do so in an element-wise fashion.This map takes input of the element geometry and the PDE’s parameters on that element,and gives output of two operators:(1)the in2out operator for inter-element communication,and(2)the in2sol operator(Green’s function)for element-wise solution recovery.A significant advantage of this approach is that,once trained,this network can be used for the numerical solution of the PDE for any domain geometry and any parameter distribution without retraining.Also,the training is significantly simpler since it is done on the element level instead on the entire domain.We call this approach element learning.This method is closely related to hybridizable discontinuous Galerkin(HDG)methods in the sense that the local solvers of HDG are replaced by machine learning approaches.Numerical tests are presented for an example PDE,the radiative transfer or radiation transport equation,in a variety of scenarios with idealized or realistic cloud fields,with smooth or sharp gradient in the cloud boundary transition.Under a fixed accuracy level of 10^(−3) in the relative L^(2) error,and polynomial degree p=6 in each element,we observe an approximately 5 to 10 times speed-up by element learning compared to a classical finite element-type method.
基金sponsored by the National Natural Science Foundation of China(Grant No.52178100).
文摘The spatial offset of bridge has a significant impact on the safety,comfort,and durability of high-speed railway(HSR)operations,so it is crucial to rapidly and effectively detect the spatial offset of operational HSR bridges.Drive-by monitoring of bridge uneven settlement demonstrates significant potential due to its practicality,cost-effectiveness,and efficiency.However,existing drive-by methods for detecting bridge offset have limitations such as reliance on a single data source,low detection accuracy,and the inability to identify lateral deformations of bridges.This paper proposes a novel drive-by inspection method for spatial offset of HSR bridge based on multi-source data fusion of comprehensive inspection train.Firstly,dung beetle optimizer-variational mode decomposition was employed to achieve adaptive decomposition of non-stationary dynamic signals,and explore the hidden temporal relationships in the data.Subsequently,a long short-term memory neural network was developed to achieve feature fusion of multi-source signal and accurate prediction of spatial settlement of HSR bridge.A dataset of track irregularities and CRH380A high-speed train responses was generated using a 3D train-track-bridge interaction model,and the accuracy and effectiveness of the proposed hybrid deep learning model were numerically validated.Finally,the reliability of the proposed drive-by inspection method was further validated by analyzing the actual measurement data obtained from comprehensive inspection train.The research findings indicate that the proposed approach enables rapid and accurate detection of spatial offset in HSR bridge,ensuring the long-term operational safety of HSR bridges.
基金supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(No.RS-2024-00406320)the Institute of Information&Communica-tions Technology Planning&Evaluation(IITP)-Innovative Human Resource Development for Local Intellectualization Program Grant funded by the Korea government(MSIT)(IITP-2026-RS-2023-00259678).
文摘Domain adaptation aims to reduce the distribution gap between the training data(source domain)and the target data.This enables effective predictions even for domains not seen during training.However,most conventional domain adaptation methods assume a single source domain,making them less suitable for modern deep learning settings that rely on diverse and large-scale datasets.To address this limitation,recent research has focused on Multi-Source Domain Adaptation(MSDA),which aims to learn effectively from multiple source domains.In this paper,we propose Efficient Domain Transition for Multi-source(EDTM),a novel and efficient framework designed to tackle two major challenges in existing MSDA approaches:(1)integrating knowledge across different source domains and(2)aligning label distributions between source and target domains.EDTM leverages an ensemble-based classifier expert mechanism to enhance the contribution of source domains that are more similar to the target domain.To further stabilize the learning process and improve performance,we incorporate imitation learning into the training of the target model.In addition,Maximum Classifier Discrepancy(MCD)is employed to align class-wise label distributions between the source and target domains.Experiments were conducted using Digits-Five,one of the most representative benchmark datasets for MSDA.The results show that EDTM consistently outperforms existing methods in terms of average classification accuracy.Notably,EDTM achieved significantly higher performance on target domains such as Modified National Institute of Standards and Technolog with blended background images(MNIST-M)and Street View House Numbers(SVHN)datasets,demonstrating enhanced generalization compared to baseline approaches.Furthermore,an ablation study analyzing the contribution of each loss component validated the effectiveness of the framework,highlighting the importance of each module in achieving optimal performance.
文摘Federated Learning(FL)has become a leading decentralized solution that enables multiple clients to train a model in a collaborative environment without directly sharing raw data,making it suitable for privacy-sensitive applications such as healthcare,finance,and smart systems.As the field continues to evolve,the research field has become more complex and scattered,covering different system designs,training methods,and privacy techniques.This survey is organized around the three core challenges:how the data is distributed,how models are synchronized,and how to defend against attacks.It provides a structured and up-to-date review of FL research from 2023 to 2025,offering a unified taxonomy that categorizes works by data distribution(Horizontal FL,Vertical FL,Federated Transfer Learning,and Personalized FL),training synchronization(synchronous and asynchronous FL),optimization strategies,and threat models(data leakage and poisoning attacks).In particular,we summarize the latest contributions in Vertical FL frameworks for secure multi-party learning,communication-efficient Horizontal FL,and domain-adaptive Federated Transfer Learning.Furthermore,we examine synchronization techniques addressing system heterogeneity,including straggler mitigation in synchronous FL and staleness management in asynchronous FL.The survey covers security threats in FL,such as gradient inversion,membership inference,and poisoning attacks,as well as their defense strategies that include privacy-preserving aggregation and anomaly detection.The paper concludes by outlining unresolved issues and highlighting challenges in handling personalized models,scalability,and real-world adoption.
基金supported by the National Key Research and Development Program of China(Grant No.2023YFC3009400)the National Natural Science Foundation of China(Grant Nos.42307218 and U2239251).
文摘The current deep learning models for braced excavation cannot predict deformation from the beginning of excavation due to the need for a substantial corpus of sufficient historical data for training purposes.To address this issue,this study proposes a transfer learning model based on a sequence-to-sequence twodimensional(2D)convolutional long short-term memory neural network(S2SCL2D).The model can use the existing data from other adjacent similar excavations to achieve wall deflection prediction once a limited amount of monitoring data from the target excavation has been recorded.In the absence of adjacent excavation data,numerical simulation data from the target project can be employed instead.A weight update strategy is proposed to improve the prediction accuracy by integrating the stochastic gradient masking with an early stopping mechanism.To illustrate the proposed methodology,an excavation project in Hangzhou,China is adopted.The proposed deep transfer learning model,which uses either adjacent excavation data or numerical simulation data as the source domain,shows a significant improvement in performance when compared to the non-transfer learning model.Using the simulation data from the target project even leads to better prediction performance than using the actual monitoring data from other adjacent excavations.The results demonstrate that the proposed model can reasonably predict the deformation with limited data from the target project.
文摘Predicting the behavior of renewable energy systems requires models capable of generating accurate forecasts from limited historical data,a challenge that becomes especially pronounced when commissioning new facil-ities where operational records are scarce.This review aims to synthesize recent progress in data-efficient deep learning approaches for addressing such“cold-start”forecasting problems.It primarily covers three interrelated domains—solar photovoltaic(PV),wind power,and electrical load forecasting—where data scarcity and operational variability are most critical,while also including representative studies on hydropower and carbon emission prediction to provide a broader systems perspective.To this end,we examined trends from over 150 predominantly peer-reviewed studies published between 2019 and mid-2025,highlighting advances in zero-shot and few-shot meta-learning frameworks that enable rapid model adaptation with minimal labeled data.Moreover,transfer learning approaches combined with spatiotemporal graph neural networks have been employed to transfer knowledge from existing energy assets to new,data-sparse environments,effectively capturing hidden dependencies among geographic features,meteorological dynamics,and grid structures.Synthetic data generation has further proven valuable for expanding training samples and mitigating overfitting in cold-start scenarios.In addition,large language models and explainable artificial intelligence(XAI)—notably conversational XAI systems—have been used to interpret and communicate complex model behaviors in accessible terms,fostering operator trust from the earliest deployment stages.By consolidating methodological advances,unresolved challenges,and open-source resources,this review provides a coherent overview of deep learning strategies that can shorten the data-sparse ramp-up period of new energy infrastructures and accelerate the transition toward resilient,low-carbon electricity grids.
基金supported by the Natural Science Foundation of Hunan Province of China(Grant No.2021JJ10045).
文摘Aerodynamic evaluation under multi-condition is indispensable for the design of aircraft,and the requirement for mass data still means a high cost.To address this problem,we propose a novel point-cloud multi-condition aerodynamics transfer learning(PCMCA-TL)framework that enables aerodynamic prediction in data-scarce sce-narios by transferring knowledge from well-learned scenarios.We modified the PointNeXt segmentation archi-tecture to a PointNeXtReg+regression model,including a working condition input module.The model is first pre-trained on a public dataset with 2000 shapes but only one working condition and then fine-tuned on a multi-condition small-scale spaceplane dataset.The effectiveness of the PCMCA-TL framework is verified by comparing the pressure coefficients predicted by direct training,pre-training,and TL models.Furthermore,by comparing the aerodynamic force coefficients calculated by predicted pressure coefficients in seconds with the correspond-ing CFD results obtained in hours,the accuracy highlights the development potential of deep transfer learning in aerodynamic evaluation.
文摘In radiology,magnetic resonance imaging(MRI)is an essential diagnostic tool that provides detailed images of a patient’s anatomical and physiological structures.MRI is particularly effective for detecting soft tissue anomalies.Traditionally,radiologists manually interpret these images,which can be labor-intensive and time-consuming due to the vast amount of data.To address this challenge,machine learning,and deep learning approaches can be utilized to improve the accuracy and efficiency of anomaly detection in MRI scans.This manuscript presents the use of the Deep AlexNet50 model for MRI classification with discriminative learning methods.There are three stages for learning;in the first stage,the whole dataset is used to learn the features.In the second stage,some layers of AlexNet50 are frozen with an augmented dataset,and in the third stage,AlexNet50 with an augmented dataset with the augmented dataset.This method used three publicly available MRI classification datasets:Harvard whole brain atlas(HWBA-dataset),the School of Biomedical Engineering of Southern Medical University(SMU-dataset),and The National Institute of Neuroscience and Hospitals brain MRI dataset(NINS-dataset)for analysis.Various hyperparameter optimizers like Adam,stochastic gradient descent(SGD),Root mean square propagation(RMS prop),Adamax,and AdamW have been used to compare the performance of the learning process.HWBA-dataset registers maximum classification performance.We evaluated the performance of the proposed classification model using several quantitative metrics,achieving an average accuracy of 98%.
基金funded by Hanshan Normal University School-Level Research Initiation Program(grant numbers QD202244QD2024207)+3 种基金the Guangdong Higher Education Society’s“Fourteenth Five-Year”Plan 2024 Higher Education Research(grant number 24GYB43)the 2024 Guangdong Provincial Undergraduate Teaching Quality and Teaching Reform Engineering Project:Excellence Program for Cultivating Publicly-Funded Pre-service Teachers for Primary Education in the Context of Rural Revitalizationthe Hanshan Normal University Guangdong East Regional Education Collaborative Innovation Research Centerfunded by these sources.
文摘This study utilized a sequential mediating model to examine the role of motivation to learn and transfer selfefficacy in the relationships between perceived content validity,mentoring function,continuous learning work culture and intention to transfer learning.The sample comprized 429 final-year apprentices in Guangdong province,China(females=69.9%,Engineering&Medicine=69%,mean age=20.99,SD=1.60).The apprentices completed standardized measures of motivation to learn,transfer self-efficacy perceived content validity,mentoring function,and continuous learning work culture.Structural equation modeling was used to analyze the data.Results showed perceived content validity,mentoring function,continuous learning culture to predict intention to transfer learning.Of these factors,perceived content validity was the strongest predictor of intention to transfer learning.Of these factors,perceived content validity was the most influential predictor of intention to transfer learning.The motivation to learn and transfer self-efficacy sequentially mediated the relationship between mentoring function and intention to learning transfer to be stronger than by either alone.Although perceived content validity and continuous learning culture exhibited no significant direct effects on intention to transfer learning,they demonstrated positive indirect associations with intention to transfer via motivation to learn and transfer self-efficacy.These study findings extend the applications of the learning transfer framework to individuals undergoing apprenticeship training which also would apply to other a long-term work-based learning programs.
文摘The COVID-19 pandemic,which was declared by the WHO,had created a global health crisis and disrupted people’s daily lives.A large number of people were affected by the COVID-19 pandemic.Therefore,a diagnostic model needs to be generated which can effectively classify the COVID and non-COVID cases.In this work,our aim is to develop a diagnostic model based on deep features using effectiveness of Chest X-ray(CXR)in distinguishing COVID from non-COVID cases.The proposed diagnostic framework utilizes CXR to diagnose COVID-19 and includes Grad-CAM visualizations for a visual interpretation of predicted images.The model’s performance was evaluated using various metrics,including accuracy,precision,recall,F1-score,and Gmean.Several machine learning models,such as random forest,dense neural network,SVM,twin SVM,extreme learning machine,random vector functional link,and kernel ridge regression,were selected to diagnose COVID-19 cases.Transfer learning was used to extract deep features.For feature extraction many CNN-based models such as Inception V3,MobileNet,ResNet50,VGG16 and Xception models are used.It was evident from the experiments that ResNet50 architecture outperformed all other CNN architectures based on AUC.The TWSVM classifier achieved the highest AUC score of 0.98 based on the ResNet50 feature vector.
文摘Background:Pneumoconioses,a group of occupational lung diseases caused by inhalation of mineral dust,pose significant health risks to affected individuals.Accurate assessment of profusion(extent of lung involvement)in chest radiographs is essential for screening,diagnosis and monitoring of the diseases along with epidemiological classification.This study explores an automated classification system combining U-Net-based segmentation for lung field delineation and DenseNet121 with ImageNet-based transfer learning for profusion classification.Methods:Lung field segmentation using U-Net achieved precise delineation,ensuring accurate region-of-interest definition.Transfer learning with DenseNet121 leveraged pre-trained knowledge from ImageNet,minimizing the need for extensive training.The model was fine-tuned with International Labour Organization(ILO)-2022 version standard chest radiographs and evaluated on a diverse dataset of ILO-2000 version standardized radiographs.Results:The U-Net-based segmentation demonstrated robust performance(Accuracy 94%and Dice Coefficient 90%),facilitating subsequent profusion classification.The DenseNet121-based transfer learning model exhibited high accuracy(95%),precision(92%),and recall(94%)for classifying four profusion levels on test ILO 2000/2011D dataset.The final Evaluation on ILO-2000 radiographs highlighted its generalization capability.Conclusion:The proposed system offers clinical promise,aiding radiologists,pulmonologists,general physicians,and occupational health specialists in pneumoconioses screening,diagnosis,monitoring and epidemiological classification.Best of our knowledge,this is the first work in the field of automated Classification of Profusion in Chest Radiographs of Pneumoconioses based on recently published latest ILO-2022 standard.Future research should focus on further refinement and real-world validation.This approach exemplifies the potential of deep learning for enhancing the accuracy and efficiency of pneumoconioses assessment,benefiting industrial workers,patients,and healthcare providers.
基金funded by the Deanship of Scientific Research(DSR)at King Abdulaziz University,Jeddah,Saudi Arabia under Grant No.(GPIP:1055-829-2024).
文摘A healthy brain is vital to every person since the brain controls every movement and emotion.Sometimes,some brain cells grow unexpectedly to be uncontrollable and cancerous.These cancerous cells are called brain tumors.For diagnosed patients,their lives depend mainly on the early diagnosis of these tumors to provide suitable treatment plans.Nowadays,Physicians and radiologists rely on Magnetic Resonance Imaging(MRI)pictures for their clinical evaluations of brain tumors.These evaluations are time-consuming,expensive,and require expertise with high skills to provide an accurate diagnosis.Scholars and industrials have recently partnered to implement automatic solutions to diagnose the disease with high accuracy.Due to their accuracy,some of these solutions depend on deep-learning(DL)methodologies.These techniques have become important due to their roles in the diagnosis process,which includes identification and classification.Therefore,there is a need for a solid and robust approach based on a deep-learning method to diagnose brain tumors.The purpose of this study is to develop an intelligent automatic framework for brain tumor diagnosis.The proposed solution is based on a novel dense dynamic residual self-attention transfer adaptive learning fusion approach(NDDRSATALFA),carried over two implemented deep-learning networks:VGG19 and UNET to identify and classify brain tumors.In addition,this solution applies a transfer learning approach to exchange extracted features and data within the two neural networks.The presented framework is trained,validated,and tested on six public datasets of MRIs to detect brain tumors and categorize these tumors into three suitable classes,which are glioma,meningioma,and pituitary.The proposed framework yielded remarkable findings on variously evaluated performance indicators:99.32%accuracy,98.74%sensitivity,98.89%specificity,99.01%Dice,98.93%Area Under the Curve(AUC),and 99.81%F1-score.In addition,a comparative analysis with recent state-of-the-art methods was performed and according to the comparative analysis,NDDRSATALFA shows an admirable level of reliability in simplifying the timely identification of diverse brain tumors.Moreover,this framework can be applied by healthcare providers to assist radiologists,pathologists,and physicians in their evaluations.The attained outcomes open doors for advanced automatic solutions that improve clinical evaluations and provide reasonable treatment plans.
文摘Many applications,including security systems,medical diagnostics,and human-computer interfaces,depend on eye gaze recognition.However,due to factors including individual variations,occlusions,and shifting illumination conditions,real-world scenarios continue to provide difficulties for accurate and consistent eye gaze recognition.This work is aimed at investigating the potential benefits of employing transfer learning to improve eye gaze detection ability and efficiency.Transfer learning is the process of fine-tuning pre-trained models on smaller,domain-specific datasets after they have been trained on larger datasets.We study several transfer learning algorithms and evaluate their effectiveness on eye gaze identification,including both Regression and Classification tasks,using a range of deep learning architectures,namely AlexNet,Visual Geometry Group(VGG),InceptionV3,and ResNet.In this study,we evaluate the effectiveness of transfer learning-basedmodels against models that were trained fromscratch using eye-gazing datasets on grounds of various performance and loss metrics such as Precision,Accuracy,and Mean Absolute Error.We investigate the effects of different pre-trainedmodels,dataset sizes,and domain gaps on the transfer learning process,and the findings of our study clarify the efficacy of transfer learning for eye gaze detection and offer suggestions for the most successful transfer learning strategies to apply in real-world situations.
基金funded by Sponsorship of Science and Technology Project of State Grid Xinjiang Electric Power Co.,Ltd.,grant number SGXJ0000TKJS2400168.
文摘This study presents an emergency control method for sub-synchronous oscillations in wind power gridconnected systems based on transfer learning,addressing the issue of insufficient generalization ability of traditional methods in complex real-world scenarios.By combining deep reinforcement learning with a transfer learning framework,cross-scenario knowledge transfer is achieved,significantly enhancing the adaptability of the control strategy.First,a sub-synchronous oscillation emergency control model for the wind power grid integration system is constructed under fixed scenarios based on deep reinforcement learning.A reward evaluation system based on the active power oscillation pattern of the system is proposed,introducing penalty functions for the number of machine-shedding rounds and the number of machines shed.This avoids the economic losses and grid security risks caused by the excessive one-time shedding of wind turbines.Furthermore,transfer learning is introduced into model training to enhance the model’s generalization capability in dealing with complex scenarios of actual wind power grid integration systems.By introducing the Maximum Mean Discrepancy(MMD)algorithm to calculate the distribution differences between source data and target data,the online decision-making reliability of the emergency control model is improved.Finally,the effectiveness of the proposed emergency control method for multi-scenario sub-synchronous oscillation in wind power grid integration systems based on transfer learning is analyzed using the New England 39-bus system.
基金the Deanship of Graduate Studies and Scientific Research at Najran University for supporting the research project through the Group Research,with the project code NU/GP/SERC/13/712。
文摘The Internet of Things(IoT)is an innovation that combines imagined space with the actual world on a single platform.Because of the recent rapid rise of IoT devices,there has been a lack of standards,leading to a massive increase in unprotected devices connecting to networks.Consequently,cyberattacks on IoT are becoming more common,particularly keylogging attacks,which are often caused by security vulnerabilities on IoT networks.This research focuses on the role of transfer learning and ensemble classifiers in enhancing the detection of keylogging attacks within small,imbalanced IoT datasets.The authors propose a model that combines transfer learning with ensemble classification methods,leading to improved detection accuracy.By leveraging the BoT-IoT and keylogger_detection datasets,they facilitate the transfer of knowledge across various domains.The results reveal that the integration of transfer learning and ensemble classifiers significantly improves detection capabilities,even in scenarios with limited data availability.The proposed TRANS-ENS model showcases exceptional accuracy and a minimal false positive rate,outperforming current deep learning approaches.The primary objectives include:(i)introducing an ensemble feature selection technique to identify common features across models,(ii)creating a pre-trained deep learning model through transfer learning for the detection of keylogging attacks,and(iii)developing a transfer learning-ensemble model dedicated to keylogging detection.Experimental findings indicate that the TRANS-ENS model achieves a detection accuracy of 96.06%and a false alarm rate of 0.12%,surpassing existing models such as CNN,RNN,and LSTM.
基金supported by the National Natural Science Foundation of China(Nos.51974056 and 51474047)the Foundation of the Supercomputing Center of Dalian University of Technology,and the Foundation of the Key Laboratory of Solidification Control and Digital Preparation Technology(Liaoning Province),China.
文摘Calculating the inter-layer ion diffusion barrier, a crucial metric for evaluating the rate performance of 2D electrode materials, is time-consuming using the transition state search approach. A novel electrostatic potential distribution image (EPDI) transfer learning method has been proposed to efficiently and accurately predict the lithium diffusion barriers on metal element-doped transition metal dichalcogenide (TMD) surfaces. Through the analysis of the mean electrostatic potential (MEP) around binding sites, a positive correlation between binding energy and MEP in VIB-TMDs was identified. Subsequently, transfer learning techniques were used to develop a DenseNet121-TL model for establishing a more accurate mapping between the binding energy and electrostatic potential distribution. Trained on training sets containing 33% and 50% transition state search calculation results, which save 66% and 50% of the calculation time, respectively, the model achieves accurate predictions of the saddle point binding energy with mean absolute errors (MAEs) of 0.0444 and 0.0287 eV on the testing set. Based on the prediction of saddle point binding energies, we obtained a diffusion minimum energy profile with an MAE of 0.0235 eV. Furthermore, by analyzing the diffusion data, we observed that the diffusion barrier was lowered by 10% on V-doped TiS2 compared to the stoichiometric surface. Our findings are expected to provide new insights for the high-throughput calculation of ion diffusion on 2D materials.
基金supported by the Research Grants Council of Hong Kong(CityU 11305919 and 11308620)and NSFC/RGC Joint Research Scheme N_CityU104/19Hong Kong Research Grant Council Collaborative Research Fund:C1002-21G and C1017-22Gsupported by the Hong Kong Research Grant Council Collaborative Research Fund:C6021-19E.
文摘Developing machine learning frameworks with predictive power,interpretability,and transferability is crucial,yet it faces challenges in the field of electrocatalysis.To achieve this,we employed rigorous feature engineering to establish a finely tuned gradient boosting regressor(GBR)model,which adeptly captures the physical complexity from feature space to target variables.We demonstrated that environmental electron effects and atomic number significantly govern the success of the mapping process via global and local explanations.The finely tuned GBR model exhibits exceptional robustness in predicting CO adsorption energies(R_(ave)^(2)=0.937,RMSE=0.153 eV).Moreover,the model demonstrated remarkable transfer learning ability,showing excellent predictive power for OH,NO,and N_(2) adsorption.Importantly,the GBR model exhibits exceptional predictive capability across an extensive search space,thereby demonstrating profound adaptability and versatility.Our research framework significantly enhances the interpretability and transferability of machine learning in electrocatalysis,offering vital insights for further advancements.
基金supported by the BK21 FOUR project(AI-driven Convergence Software Education Research Program)funded by the Ministry of Education,School of Computer Science and Engineering,Kyungpook National University,Republic of Korea(4120240214871)supported by the New Faculty Start Up Fund from LSU Health Sciences New Orleans,LA,USA.
文摘Transfer learning is the predominant method for adapting pre-trained models on another task to new domains while preserving their internal architectures and augmenting them with requisite layers in Deep Neural Network models.Training intricate pre-trained models on a sizable dataset requires significant resources to fine-tune hyperparameters carefully.Most existing initialization methods mainly focus on gradient flow-related problems,such as gradient vanishing or exploding,or other existing approaches that require extra models that do not consider our setting,which is more practical.To address these problems,we suggest employing gradient-free heuristic methods to initialize the weights of the final new-added fully connected layer in neural networks froma small set of training data with fewer classes.The approach relies on partitioning the output values from pre-trained models for a small set into two separate intervals determined by the targets.This process is framed as an optimization problem for each output neuron and class.The optimization selects the highest values as weights,considering their direction towards the respective classes.Furthermore,empirical 145 experiments involve a variety of neural networkmodels tested acrossmultiple benchmarks and domains,occasionally yielding accuracies comparable to those achieved with gradient descent methods by using only small subsets.
基金supported by the Key Research and Development Plan of Anhui Province(202104a05020059)the Excellent Scientific Research and Innovation Team of Anhui Province(2022AH010003)support from Hefei Comprehensive National Science Center is highly appreciated.
文摘Microseismic monitoring technology is widely used in tunnel and coal mine safety production.For signals generated by ultra-weak microseismic events,traditional sensors encounter limitations in terms of detection sensitivity.Given the complex engineering environment,automatic multi-classification of microseismic data is highly required.In this study,we use acceleration sensors to collect signals and combine the improved Visual Geometry Group with a convolutional block attention module to obtain a new network structure,termed CNN_BAM,for automatic classification and identification of microseismic events.We use the dataset collected from the Hanjiang-to-Weihe River Diversion Project to train and validate the network model.Results show that the CNN_BAM model exhibits good feature extraction ability,achieving a recognition accuracy of 99.29%,surpassing all its counterparts.The stability and accuracy of the classification algorithm improve remarkably.In addition,through fine-tuning and migration to the Pan Ⅱ Mine Project,the network demonstrates reliable generalization performance.This outcome reflects its adaptability across different projects and promising application prospects.