This study utilized a sequential mediating model to examine the role of motivation to learn and transfer selfefficacy in the relationships between perceived content validity,mentoring function,continuous learning work...This study utilized a sequential mediating model to examine the role of motivation to learn and transfer selfefficacy in the relationships between perceived content validity,mentoring function,continuous learning work culture and intention to transfer learning.The sample comprized 429 final-year apprentices in Guangdong province,China(females=69.9%,Engineering&Medicine=69%,mean age=20.99,SD=1.60).The apprentices completed standardized measures of motivation to learn,transfer self-efficacy perceived content validity,mentoring function,and continuous learning work culture.Structural equation modeling was used to analyze the data.Results showed perceived content validity,mentoring function,continuous learning culture to predict intention to transfer learning.Of these factors,perceived content validity was the strongest predictor of intention to transfer learning.Of these factors,perceived content validity was the most influential predictor of intention to transfer learning.The motivation to learn and transfer self-efficacy sequentially mediated the relationship between mentoring function and intention to learning transfer to be stronger than by either alone.Although perceived content validity and continuous learning culture exhibited no significant direct effects on intention to transfer learning,they demonstrated positive indirect associations with intention to transfer via motivation to learn and transfer self-efficacy.These study findings extend the applications of the learning transfer framework to individuals undergoing apprenticeship training which also would apply to other a long-term work-based learning programs.展开更多
With the rapid development of virtual reality technology,it has been widely used in the field of education.It can promote the development of learning transfer,which is an effective method for learners to learn effecti...With the rapid development of virtual reality technology,it has been widely used in the field of education.It can promote the development of learning transfer,which is an effective method for learners to learn effectively.Therefore,this paper describes how to use virtual reality technology to achieve learning transfer in order to achieve teaching goals and improve learning efficiency.展开更多
With the development of cognitive psychology, the cognitive mechanism of learning transfer has recently become the focus in the general field of cognitive psychology. Based on the same theory, tasked based language te...With the development of cognitive psychology, the cognitive mechanism of learning transfer has recently become the focus in the general field of cognitive psychology. Based on the same theory, tasked based language teaching has become more and more popular. This thesis first analyzes the various factors which affect foreign language learning, then introduces some useful teaching strategies with an aim to help students get an efficient transfer of language knowledge and communicative competence in the context of tasked-based language teaching.展开更多
The current deep learning models for braced excavation cannot predict deformation from the beginning of excavation due to the need for a substantial corpus of sufficient historical data for training purposes.To addres...The current deep learning models for braced excavation cannot predict deformation from the beginning of excavation due to the need for a substantial corpus of sufficient historical data for training purposes.To address this issue,this study proposes a transfer learning model based on a sequence-to-sequence twodimensional(2D)convolutional long short-term memory neural network(S2SCL2D).The model can use the existing data from other adjacent similar excavations to achieve wall deflection prediction once a limited amount of monitoring data from the target excavation has been recorded.In the absence of adjacent excavation data,numerical simulation data from the target project can be employed instead.A weight update strategy is proposed to improve the prediction accuracy by integrating the stochastic gradient masking with an early stopping mechanism.To illustrate the proposed methodology,an excavation project in Hangzhou,China is adopted.The proposed deep transfer learning model,which uses either adjacent excavation data or numerical simulation data as the source domain,shows a significant improvement in performance when compared to the non-transfer learning model.Using the simulation data from the target project even leads to better prediction performance than using the actual monitoring data from other adjacent excavations.The results demonstrate that the proposed model can reasonably predict the deformation with limited data from the target project.展开更多
In radiology,magnetic resonance imaging(MRI)is an essential diagnostic tool that provides detailed images of a patient’s anatomical and physiological structures.MRI is particularly effective for detecting soft tissue...In radiology,magnetic resonance imaging(MRI)is an essential diagnostic tool that provides detailed images of a patient’s anatomical and physiological structures.MRI is particularly effective for detecting soft tissue anomalies.Traditionally,radiologists manually interpret these images,which can be labor-intensive and time-consuming due to the vast amount of data.To address this challenge,machine learning,and deep learning approaches can be utilized to improve the accuracy and efficiency of anomaly detection in MRI scans.This manuscript presents the use of the Deep AlexNet50 model for MRI classification with discriminative learning methods.There are three stages for learning;in the first stage,the whole dataset is used to learn the features.In the second stage,some layers of AlexNet50 are frozen with an augmented dataset,and in the third stage,AlexNet50 with an augmented dataset with the augmented dataset.This method used three publicly available MRI classification datasets:Harvard whole brain atlas(HWBA-dataset),the School of Biomedical Engineering of Southern Medical University(SMU-dataset),and The National Institute of Neuroscience and Hospitals brain MRI dataset(NINS-dataset)for analysis.Various hyperparameter optimizers like Adam,stochastic gradient descent(SGD),Root mean square propagation(RMS prop),Adamax,and AdamW have been used to compare the performance of the learning process.HWBA-dataset registers maximum classification performance.We evaluated the performance of the proposed classification model using several quantitative metrics,achieving an average accuracy of 98%.展开更多
Background:Pneumoconioses,a group of occupational lung diseases caused by inhalation of mineral dust,pose significant health risks to affected individuals.Accurate assessment of profusion(extent of lung involvement)in...Background:Pneumoconioses,a group of occupational lung diseases caused by inhalation of mineral dust,pose significant health risks to affected individuals.Accurate assessment of profusion(extent of lung involvement)in chest radiographs is essential for screening,diagnosis and monitoring of the diseases along with epidemiological classification.This study explores an automated classification system combining U-Net-based segmentation for lung field delineation and DenseNet121 with ImageNet-based transfer learning for profusion classification.Methods:Lung field segmentation using U-Net achieved precise delineation,ensuring accurate region-of-interest definition.Transfer learning with DenseNet121 leveraged pre-trained knowledge from ImageNet,minimizing the need for extensive training.The model was fine-tuned with International Labour Organization(ILO)-2022 version standard chest radiographs and evaluated on a diverse dataset of ILO-2000 version standardized radiographs.Results:The U-Net-based segmentation demonstrated robust performance(Accuracy 94%and Dice Coefficient 90%),facilitating subsequent profusion classification.The DenseNet121-based transfer learning model exhibited high accuracy(95%),precision(92%),and recall(94%)for classifying four profusion levels on test ILO 2000/2011D dataset.The final Evaluation on ILO-2000 radiographs highlighted its generalization capability.Conclusion:The proposed system offers clinical promise,aiding radiologists,pulmonologists,general physicians,and occupational health specialists in pneumoconioses screening,diagnosis,monitoring and epidemiological classification.Best of our knowledge,this is the first work in the field of automated Classification of Profusion in Chest Radiographs of Pneumoconioses based on recently published latest ILO-2022 standard.Future research should focus on further refinement and real-world validation.This approach exemplifies the potential of deep learning for enhancing the accuracy and efficiency of pneumoconioses assessment,benefiting industrial workers,patients,and healthcare providers.展开更多
A healthy brain is vital to every person since the brain controls every movement and emotion.Sometimes,some brain cells grow unexpectedly to be uncontrollable and cancerous.These cancerous cells are called brain tumor...A healthy brain is vital to every person since the brain controls every movement and emotion.Sometimes,some brain cells grow unexpectedly to be uncontrollable and cancerous.These cancerous cells are called brain tumors.For diagnosed patients,their lives depend mainly on the early diagnosis of these tumors to provide suitable treatment plans.Nowadays,Physicians and radiologists rely on Magnetic Resonance Imaging(MRI)pictures for their clinical evaluations of brain tumors.These evaluations are time-consuming,expensive,and require expertise with high skills to provide an accurate diagnosis.Scholars and industrials have recently partnered to implement automatic solutions to diagnose the disease with high accuracy.Due to their accuracy,some of these solutions depend on deep-learning(DL)methodologies.These techniques have become important due to their roles in the diagnosis process,which includes identification and classification.Therefore,there is a need for a solid and robust approach based on a deep-learning method to diagnose brain tumors.The purpose of this study is to develop an intelligent automatic framework for brain tumor diagnosis.The proposed solution is based on a novel dense dynamic residual self-attention transfer adaptive learning fusion approach(NDDRSATALFA),carried over two implemented deep-learning networks:VGG19 and UNET to identify and classify brain tumors.In addition,this solution applies a transfer learning approach to exchange extracted features and data within the two neural networks.The presented framework is trained,validated,and tested on six public datasets of MRIs to detect brain tumors and categorize these tumors into three suitable classes,which are glioma,meningioma,and pituitary.The proposed framework yielded remarkable findings on variously evaluated performance indicators:99.32%accuracy,98.74%sensitivity,98.89%specificity,99.01%Dice,98.93%Area Under the Curve(AUC),and 99.81%F1-score.In addition,a comparative analysis with recent state-of-the-art methods was performed and according to the comparative analysis,NDDRSATALFA shows an admirable level of reliability in simplifying the timely identification of diverse brain tumors.Moreover,this framework can be applied by healthcare providers to assist radiologists,pathologists,and physicians in their evaluations.The attained outcomes open doors for advanced automatic solutions that improve clinical evaluations and provide reasonable treatment plans.展开更多
Transfer learning is the predominant method for adapting pre-trained models on another task to new domains while preserving their internal architectures and augmenting them with requisite layers in Deep Neural Network...Transfer learning is the predominant method for adapting pre-trained models on another task to new domains while preserving their internal architectures and augmenting them with requisite layers in Deep Neural Network models.Training intricate pre-trained models on a sizable dataset requires significant resources to fine-tune hyperparameters carefully.Most existing initialization methods mainly focus on gradient flow-related problems,such as gradient vanishing or exploding,or other existing approaches that require extra models that do not consider our setting,which is more practical.To address these problems,we suggest employing gradient-free heuristic methods to initialize the weights of the final new-added fully connected layer in neural networks froma small set of training data with fewer classes.The approach relies on partitioning the output values from pre-trained models for a small set into two separate intervals determined by the targets.This process is framed as an optimization problem for each output neuron and class.The optimization selects the highest values as weights,considering their direction towards the respective classes.Furthermore,empirical 145 experiments involve a variety of neural networkmodels tested acrossmultiple benchmarks and domains,occasionally yielding accuracies comparable to those achieved with gradient descent methods by using only small subsets.展开更多
Seismic fault identification is a critical step in structural interpretation,reservoir characterization,and well-drilling planning.However,fault identification in deep fault-karst carbonate formations is particularly ...Seismic fault identification is a critical step in structural interpretation,reservoir characterization,and well-drilling planning.However,fault identification in deep fault-karst carbonate formations is particularly challenging due to their deep burial depth and the complex effects of dissolution.Traditional manual interpretation methods are often labor intensive and prone to high uncertainty due to their subjective nature.To address these limitations,this study proposes a transfer learningebased strategy for fault identification in deep fault-karst carbonate formations.The proposed methodology began with the generation of a large volume of synthetic seismic samples based on statistical fault distribution patterns observed in the study area.These synthetic samples were used to pretrain an improved U-Net network architecture,enhanced with an attention mechanism,to create a robust pretrained model.Subsequently,real-world fault labels were manually annotated based on verified fault interpretations and integrated into the training dataset.This combination of synthetic and real-world data was used to fine-tune the pretrained model,significantly improving its fault interpretation accuracy.The experimental results demonstrate that the integration of synthetic and real-world samples effectively enhances the quality of the training dataset.Furthermore,the proposed transfer learning strategy significantly im-proves fault recognition accuracy.By replacing the traditional weighted cross-entropy loss function with the Dice loss function,the model successfully addresses the issue of extreme class imbalance between positive and negative samples.Practical applications confirm that the proposed transfer learning strategy can accurately identify fault structures in deep fault-karst carbonate formations,providing a novel and effective technical approach for fault interpretation in such complex geological settings.展开更多
Deep Learning-based systems for Finger vein recognition have gained rising attention in recent years due to improved efficiency and enhanced security.The performance of existing CNN-based methods is limited by the pun...Deep Learning-based systems for Finger vein recognition have gained rising attention in recent years due to improved efficiency and enhanced security.The performance of existing CNN-based methods is limited by the puny generalization of learned features and deficiency of the finger vein image training data.Considering the concerns of existing methods,in this work,a simplified deep transfer learning-based framework for finger-vein recognition is developed using an EfficientNet model of deep learning with a self-attention mechanism.Data augmentation using various geometrical methods is employed to address the problem of training data shortage required for a deep learning model.The proposed model is tested using K-fold cross-validation on three publicly available datasets:HKPU,FVUSM,and SDUMLA.Also,the developed network is compared with other modern deep nets to check its effectiveness.In addition,a comparison of the proposed method with other existing Finger vein recognition(FVR)methods is also done.The experimental results exhibited superior recognition accuracy of the proposed method compared to other existing methods.In addition,the developed method proves to be more effective and less sophisticated at extracting robust features.The proposed EffAttenNet achieves an accuracy of 98.14%on HKPU,99.03%on FVUSM,and 99.50%on SDUMLA databases.展开更多
The COVID-19 pandemic,which was declared by the WHO,had created a global health crisis and disrupted people’s daily lives.A large number of people were affected by the COVID-19 pandemic.Therefore,a diagnostic model n...The COVID-19 pandemic,which was declared by the WHO,had created a global health crisis and disrupted people’s daily lives.A large number of people were affected by the COVID-19 pandemic.Therefore,a diagnostic model needs to be generated which can effectively classify the COVID and non-COVID cases.In this work,our aim is to develop a diagnostic model based on deep features using effectiveness of Chest X-ray(CXR)in distinguishing COVID from non-COVID cases.The proposed diagnostic framework utilizes CXR to diagnose COVID-19 and includes Grad-CAM visualizations for a visual interpretation of predicted images.The model’s performance was evaluated using various metrics,including accuracy,precision,recall,F1-score,and Gmean.Several machine learning models,such as random forest,dense neural network,SVM,twin SVM,extreme learning machine,random vector functional link,and kernel ridge regression,were selected to diagnose COVID-19 cases.Transfer learning was used to extract deep features.For feature extraction many CNN-based models such as Inception V3,MobileNet,ResNet50,VGG16 and Xception models are used.It was evident from the experiments that ResNet50 architecture outperformed all other CNN architectures based on AUC.The TWSVM classifier achieved the highest AUC score of 0.98 based on the ResNet50 feature vector.展开更多
The electromagnetic pulse valve,as a key component in baghouse dust removal systems,plays a crucial role in the performance of the system.However,despite the promising results of intelligent fault diagnosis methods ba...The electromagnetic pulse valve,as a key component in baghouse dust removal systems,plays a crucial role in the performance of the system.However,despite the promising results of intelligent fault diagnosis methods based on extensive data in diagnosing electromagnetic valves,real-world diagnostic scenarios still face numerous challenges.Collecting fault data for electromagnetic pulse valves is not only time-consuming but also costly,making it difficult to obtain sufficient fault data in advance,which poses challenges for small sample fault diagnosis.To address this issue,this paper proposes a fault diagnosis method for electromagnetic pulse valves based on deep transfer learning and simulated data.This method achieves effective transfer from simulated data to real data through four parameter transfer strategies,which combine parameter freezing and fine-tuning operations.Furthermore,this paper identifies a parameter transfer strategy that simultaneously fine-tunes the feature extractor and classifier,and introduces an attention mechanism to integrate fault features,thereby enhancing the correlation and information complementarity among multi-sensor data.The effectiveness of the proposed method is evaluated through two fault diagnosis cases under different operating conditions.In this study,small sample data accounted for 7.9%and 8.2%of the total dataset,and the experimental results showed transfer accuracies of 93.5%and 94.2%,respectively,validating the reliability and effectiveness of the method under small sample conditions.展开更多
Aerodynamic evaluation under multi-condition is indispensable for the design of aircraft,and the requirement for mass data still means a high cost.To address this problem,we propose a novel point-cloud multi-condition...Aerodynamic evaluation under multi-condition is indispensable for the design of aircraft,and the requirement for mass data still means a high cost.To address this problem,we propose a novel point-cloud multi-condition aerodynamics transfer learning(PCMCA-TL)framework that enables aerodynamic prediction in data-scarce sce-narios by transferring knowledge from well-learned scenarios.We modified the PointNeXt segmentation archi-tecture to a PointNeXtReg+regression model,including a working condition input module.The model is first pre-trained on a public dataset with 2000 shapes but only one working condition and then fine-tuned on a multi-condition small-scale spaceplane dataset.The effectiveness of the PCMCA-TL framework is verified by comparing the pressure coefficients predicted by direct training,pre-training,and TL models.Furthermore,by comparing the aerodynamic force coefficients calculated by predicted pressure coefficients in seconds with the correspond-ing CFD results obtained in hours,the accuracy highlights the development potential of deep transfer learning in aerodynamic evaluation.展开更多
Many applications,including security systems,medical diagnostics,and human-computer interfaces,depend on eye gaze recognition.However,due to factors including individual variations,occlusions,and shifting illumination...Many applications,including security systems,medical diagnostics,and human-computer interfaces,depend on eye gaze recognition.However,due to factors including individual variations,occlusions,and shifting illumination conditions,real-world scenarios continue to provide difficulties for accurate and consistent eye gaze recognition.This work is aimed at investigating the potential benefits of employing transfer learning to improve eye gaze detection ability and efficiency.Transfer learning is the process of fine-tuning pre-trained models on smaller,domain-specific datasets after they have been trained on larger datasets.We study several transfer learning algorithms and evaluate their effectiveness on eye gaze identification,including both Regression and Classification tasks,using a range of deep learning architectures,namely AlexNet,Visual Geometry Group(VGG),InceptionV3,and ResNet.In this study,we evaluate the effectiveness of transfer learning-basedmodels against models that were trained fromscratch using eye-gazing datasets on grounds of various performance and loss metrics such as Precision,Accuracy,and Mean Absolute Error.We investigate the effects of different pre-trainedmodels,dataset sizes,and domain gaps on the transfer learning process,and the findings of our study clarify the efficacy of transfer learning for eye gaze detection and offer suggestions for the most successful transfer learning strategies to apply in real-world situations.展开更多
Named Entity Recognition(NER)is vital in natural language processing for the analysis of news texts,as it accurately identifies entities such as locations,persons,and organizations,which is crucial for applications li...Named Entity Recognition(NER)is vital in natural language processing for the analysis of news texts,as it accurately identifies entities such as locations,persons,and organizations,which is crucial for applications like news summarization and event tracking.However,NER in the news domain faces challenges due to insufficient annotated data,complex entity structures,and strong context dependencies.To address these issues,we propose a new Chinesenamed entity recognition method that integrates transfer learning with word embeddings.Our approach leverages the ERNIE pre-trained model for transfer learning and obtaining general language representations and incorporates the Soft-lexicon word embedding technique to handle varied entity structures.This dual-strategy enhances the model’s understanding of context and boosts its ability to process complex texts.Experimental results show that our method achieves an F1 score of 94.72% on a news dataset,surpassing baseline methods by 3%–4%,thereby confirming its effectiveness for Chinese-named entity recognition in the news domain.展开更多
Traffic flow prediction is a key component of intelligent transportation systems,particularly in datascarce regions where traditional models relying on complete datasets often fail to provide accurate forecasts.These ...Traffic flow prediction is a key component of intelligent transportation systems,particularly in datascarce regions where traditional models relying on complete datasets often fail to provide accurate forecasts.These regions are characterized by limited sensor coverage and sparse data collection,pose significant challenges for existing prediction methods.To address this,we propose a novel transfer learning framework called transfer learning with deep knowledge distillation(TL-DKD),which combines graph neural network(GNN)with deep knowledge distillation to enable effective knowledge transfer from data-rich to data-scarce domains.Our contributions are three-fold:(1)We introduce,for the first time,a unique integration of deep knowledge distillation and transfer learning,enhancing feature adaptability across diverse traffic datasets while addressing data scarcity.(2)We design an encoder-decoder architecture where the encoder retains generalized spatiotemporal patterns fromsource domains,and the decoder finetunes predictions for target domains,ensuring minimal information loss during transfer.(3)Extensive experiments on five real-world datasets(METR-LA,PeMS-Bay,PeMS03/04/08)demonstrate the framework’s robustness.The TL-DKD model achieves significant improvements in prediction accuracy,especially in data-scarce scenarios.For example,the PEMSD4 dataset in multi-region experiments,it achieves a mean absolute error(MAE)of 20.08,a mean absolute percentage error(MAPE)of 13.59%,and a root mean squared error(RMSE)of 31.75 for 30-min forecasts.Additionally,noise-augmented experiments show improved adaptability under perturbed data conditions.These results highlight the framework’s practical impact,offering a scalable solution for accurate traffic predictions in resource-constrained environments.展开更多
This study presents an emergency control method for sub-synchronous oscillations in wind power gridconnected systems based on transfer learning,addressing the issue of insufficient generalization ability of traditiona...This study presents an emergency control method for sub-synchronous oscillations in wind power gridconnected systems based on transfer learning,addressing the issue of insufficient generalization ability of traditional methods in complex real-world scenarios.By combining deep reinforcement learning with a transfer learning framework,cross-scenario knowledge transfer is achieved,significantly enhancing the adaptability of the control strategy.First,a sub-synchronous oscillation emergency control model for the wind power grid integration system is constructed under fixed scenarios based on deep reinforcement learning.A reward evaluation system based on the active power oscillation pattern of the system is proposed,introducing penalty functions for the number of machine-shedding rounds and the number of machines shed.This avoids the economic losses and grid security risks caused by the excessive one-time shedding of wind turbines.Furthermore,transfer learning is introduced into model training to enhance the model’s generalization capability in dealing with complex scenarios of actual wind power grid integration systems.By introducing the Maximum Mean Discrepancy(MMD)algorithm to calculate the distribution differences between source data and target data,the online decision-making reliability of the emergency control model is improved.Finally,the effectiveness of the proposed emergency control method for multi-scenario sub-synchronous oscillation in wind power grid integration systems based on transfer learning is analyzed using the New England 39-bus system.展开更多
With the advancements in parameter-efficient transfer learning techniques,it has become feasible to leverage large pre-trained language models for downstream tasks under low-cost and low-resource conditions.However,ap...With the advancements in parameter-efficient transfer learning techniques,it has become feasible to leverage large pre-trained language models for downstream tasks under low-cost and low-resource conditions.However,applying this technique to multimodal knowledge transfer introduces a significant challenge:ensuring alignment across modalities while minimizing the number of additional parameters required for downstream task adaptation.This paper introduces UniTrans,a framework aimed at facilitating efficient knowledge transfer across multiple modalities.UniTrans leverages Vector-based Cross-modal Random Matrix Adaptation to enable fine-tuning with minimal parameter overhead.To further enhance modality alignment,we introduce two key components:the Multimodal Consistency Alignment Module and the Query-Augmentation Side Network,specifically optimized for scenarios with extremely limited trainable parameters.Extensive evaluations on various cross-modal downstream tasks demonstrate that our approach surpasses state-of-the-art methods while using just 5%of their trainable parameters.Additionally,it achieves superior performance compared to fully fine-tuned models on certain benchmarks.展开更多
The Internet of Things(IoT)is an innovation that combines imagined space with the actual world on a single platform.Because of the recent rapid rise of IoT devices,there has been a lack of standards,leading to a massi...The Internet of Things(IoT)is an innovation that combines imagined space with the actual world on a single platform.Because of the recent rapid rise of IoT devices,there has been a lack of standards,leading to a massive increase in unprotected devices connecting to networks.Consequently,cyberattacks on IoT are becoming more common,particularly keylogging attacks,which are often caused by security vulnerabilities on IoT networks.This research focuses on the role of transfer learning and ensemble classifiers in enhancing the detection of keylogging attacks within small,imbalanced IoT datasets.The authors propose a model that combines transfer learning with ensemble classification methods,leading to improved detection accuracy.By leveraging the BoT-IoT and keylogger_detection datasets,they facilitate the transfer of knowledge across various domains.The results reveal that the integration of transfer learning and ensemble classifiers significantly improves detection capabilities,even in scenarios with limited data availability.The proposed TRANS-ENS model showcases exceptional accuracy and a minimal false positive rate,outperforming current deep learning approaches.The primary objectives include:(i)introducing an ensemble feature selection technique to identify common features across models,(ii)creating a pre-trained deep learning model through transfer learning for the detection of keylogging attacks,and(iii)developing a transfer learning-ensemble model dedicated to keylogging detection.Experimental findings indicate that the TRANS-ENS model achieves a detection accuracy of 96.06%and a false alarm rate of 0.12%,surpassing existing models such as CNN,RNN,and LSTM.展开更多
Effective fault identification is crucial for bearings, which are critical components of mechanical systems and play a pivotal role in ensuring overall safety and operational efficiency. Bearings operate under variabl...Effective fault identification is crucial for bearings, which are critical components of mechanical systems and play a pivotal role in ensuring overall safety and operational efficiency. Bearings operate under variable service conditions, and their diagnostic environments are complex and dynamic. In the process of bearing diagnosis, fault datasets are relatively scarce compared with datasets representing normal operating conditions. These challenges frequently cause the practicality of fault detection to decline, the extraction of fault features to be incomplete, and the diagnostic accuracy of many existing models to decrease. In this work, a transfer-learning framework, designated DSCNN-HA-TL, is introduced to address the enduring challenge of cross-condition diagnosis in rolling-bearing fault detection. The framework integrates a window global mixed attention mechanism with a deep separable convolutional network, thereby enabling adaptation to fault detection tasks under diverse operating conditions. First, a Convolutional Neural Network (CNN) is employed as the foundational architecture, where the original convolutional layers are enhanced through the incorporation of depthwise separable convolutions, resulting in a Depthwise Separable Convolutional Neural Network (DSCNN) architecture. Subsequently, the extraction of fault characteristics is further refined through a dual-branch network that integrates hybrid attention mechanisms, specifically windowed and global attention mechanisms. This approach enables the acquisition of multi-level feature fusion information, thereby enhancing the accuracy of fault classification. The integration of these features not only optimizes the characteristic extraction process but also yields improvements in accuracy, representational capacity, and robustness in fault feature recognition. In conclusion, the proposed method achieved average precisions of 99.93% and 99.55% in transfer learning tasks, as demonstrated by the experimental results obtained from the CWRU public dataset and the bearing fault detection platform dataset. The experimental findings further provided a detailed comparison between the diagnostic models before and after the enhancement, thereby substantiating the pronounced advantages of the DSCNN-HA-TL approach in accurately identifying faults in critical mechanical components under diverse operating conditions.展开更多
基金funded by Hanshan Normal University School-Level Research Initiation Program(grant numbers QD202244QD2024207)+3 种基金the Guangdong Higher Education Society’s“Fourteenth Five-Year”Plan 2024 Higher Education Research(grant number 24GYB43)the 2024 Guangdong Provincial Undergraduate Teaching Quality and Teaching Reform Engineering Project:Excellence Program for Cultivating Publicly-Funded Pre-service Teachers for Primary Education in the Context of Rural Revitalizationthe Hanshan Normal University Guangdong East Regional Education Collaborative Innovation Research Centerfunded by these sources.
文摘This study utilized a sequential mediating model to examine the role of motivation to learn and transfer selfefficacy in the relationships between perceived content validity,mentoring function,continuous learning work culture and intention to transfer learning.The sample comprized 429 final-year apprentices in Guangdong province,China(females=69.9%,Engineering&Medicine=69%,mean age=20.99,SD=1.60).The apprentices completed standardized measures of motivation to learn,transfer self-efficacy perceived content validity,mentoring function,and continuous learning work culture.Structural equation modeling was used to analyze the data.Results showed perceived content validity,mentoring function,continuous learning culture to predict intention to transfer learning.Of these factors,perceived content validity was the strongest predictor of intention to transfer learning.Of these factors,perceived content validity was the most influential predictor of intention to transfer learning.The motivation to learn and transfer self-efficacy sequentially mediated the relationship between mentoring function and intention to learning transfer to be stronger than by either alone.Although perceived content validity and continuous learning culture exhibited no significant direct effects on intention to transfer learning,they demonstrated positive indirect associations with intention to transfer via motivation to learn and transfer self-efficacy.These study findings extend the applications of the learning transfer framework to individuals undergoing apprenticeship training which also would apply to other a long-term work-based learning programs.
文摘With the rapid development of virtual reality technology,it has been widely used in the field of education.It can promote the development of learning transfer,which is an effective method for learners to learn effectively.Therefore,this paper describes how to use virtual reality technology to achieve learning transfer in order to achieve teaching goals and improve learning efficiency.
文摘With the development of cognitive psychology, the cognitive mechanism of learning transfer has recently become the focus in the general field of cognitive psychology. Based on the same theory, tasked based language teaching has become more and more popular. This thesis first analyzes the various factors which affect foreign language learning, then introduces some useful teaching strategies with an aim to help students get an efficient transfer of language knowledge and communicative competence in the context of tasked-based language teaching.
基金supported by the National Key Research and Development Program of China(Grant No.2023YFC3009400)the National Natural Science Foundation of China(Grant Nos.42307218 and U2239251).
文摘The current deep learning models for braced excavation cannot predict deformation from the beginning of excavation due to the need for a substantial corpus of sufficient historical data for training purposes.To address this issue,this study proposes a transfer learning model based on a sequence-to-sequence twodimensional(2D)convolutional long short-term memory neural network(S2SCL2D).The model can use the existing data from other adjacent similar excavations to achieve wall deflection prediction once a limited amount of monitoring data from the target excavation has been recorded.In the absence of adjacent excavation data,numerical simulation data from the target project can be employed instead.A weight update strategy is proposed to improve the prediction accuracy by integrating the stochastic gradient masking with an early stopping mechanism.To illustrate the proposed methodology,an excavation project in Hangzhou,China is adopted.The proposed deep transfer learning model,which uses either adjacent excavation data or numerical simulation data as the source domain,shows a significant improvement in performance when compared to the non-transfer learning model.Using the simulation data from the target project even leads to better prediction performance than using the actual monitoring data from other adjacent excavations.The results demonstrate that the proposed model can reasonably predict the deformation with limited data from the target project.
文摘In radiology,magnetic resonance imaging(MRI)is an essential diagnostic tool that provides detailed images of a patient’s anatomical and physiological structures.MRI is particularly effective for detecting soft tissue anomalies.Traditionally,radiologists manually interpret these images,which can be labor-intensive and time-consuming due to the vast amount of data.To address this challenge,machine learning,and deep learning approaches can be utilized to improve the accuracy and efficiency of anomaly detection in MRI scans.This manuscript presents the use of the Deep AlexNet50 model for MRI classification with discriminative learning methods.There are three stages for learning;in the first stage,the whole dataset is used to learn the features.In the second stage,some layers of AlexNet50 are frozen with an augmented dataset,and in the third stage,AlexNet50 with an augmented dataset with the augmented dataset.This method used three publicly available MRI classification datasets:Harvard whole brain atlas(HWBA-dataset),the School of Biomedical Engineering of Southern Medical University(SMU-dataset),and The National Institute of Neuroscience and Hospitals brain MRI dataset(NINS-dataset)for analysis.Various hyperparameter optimizers like Adam,stochastic gradient descent(SGD),Root mean square propagation(RMS prop),Adamax,and AdamW have been used to compare the performance of the learning process.HWBA-dataset registers maximum classification performance.We evaluated the performance of the proposed classification model using several quantitative metrics,achieving an average accuracy of 98%.
文摘Background:Pneumoconioses,a group of occupational lung diseases caused by inhalation of mineral dust,pose significant health risks to affected individuals.Accurate assessment of profusion(extent of lung involvement)in chest radiographs is essential for screening,diagnosis and monitoring of the diseases along with epidemiological classification.This study explores an automated classification system combining U-Net-based segmentation for lung field delineation and DenseNet121 with ImageNet-based transfer learning for profusion classification.Methods:Lung field segmentation using U-Net achieved precise delineation,ensuring accurate region-of-interest definition.Transfer learning with DenseNet121 leveraged pre-trained knowledge from ImageNet,minimizing the need for extensive training.The model was fine-tuned with International Labour Organization(ILO)-2022 version standard chest radiographs and evaluated on a diverse dataset of ILO-2000 version standardized radiographs.Results:The U-Net-based segmentation demonstrated robust performance(Accuracy 94%and Dice Coefficient 90%),facilitating subsequent profusion classification.The DenseNet121-based transfer learning model exhibited high accuracy(95%),precision(92%),and recall(94%)for classifying four profusion levels on test ILO 2000/2011D dataset.The final Evaluation on ILO-2000 radiographs highlighted its generalization capability.Conclusion:The proposed system offers clinical promise,aiding radiologists,pulmonologists,general physicians,and occupational health specialists in pneumoconioses screening,diagnosis,monitoring and epidemiological classification.Best of our knowledge,this is the first work in the field of automated Classification of Profusion in Chest Radiographs of Pneumoconioses based on recently published latest ILO-2022 standard.Future research should focus on further refinement and real-world validation.This approach exemplifies the potential of deep learning for enhancing the accuracy and efficiency of pneumoconioses assessment,benefiting industrial workers,patients,and healthcare providers.
基金funded by the Deanship of Scientific Research(DSR)at King Abdulaziz University,Jeddah,Saudi Arabia under Grant No.(GPIP:1055-829-2024).
文摘A healthy brain is vital to every person since the brain controls every movement and emotion.Sometimes,some brain cells grow unexpectedly to be uncontrollable and cancerous.These cancerous cells are called brain tumors.For diagnosed patients,their lives depend mainly on the early diagnosis of these tumors to provide suitable treatment plans.Nowadays,Physicians and radiologists rely on Magnetic Resonance Imaging(MRI)pictures for their clinical evaluations of brain tumors.These evaluations are time-consuming,expensive,and require expertise with high skills to provide an accurate diagnosis.Scholars and industrials have recently partnered to implement automatic solutions to diagnose the disease with high accuracy.Due to their accuracy,some of these solutions depend on deep-learning(DL)methodologies.These techniques have become important due to their roles in the diagnosis process,which includes identification and classification.Therefore,there is a need for a solid and robust approach based on a deep-learning method to diagnose brain tumors.The purpose of this study is to develop an intelligent automatic framework for brain tumor diagnosis.The proposed solution is based on a novel dense dynamic residual self-attention transfer adaptive learning fusion approach(NDDRSATALFA),carried over two implemented deep-learning networks:VGG19 and UNET to identify and classify brain tumors.In addition,this solution applies a transfer learning approach to exchange extracted features and data within the two neural networks.The presented framework is trained,validated,and tested on six public datasets of MRIs to detect brain tumors and categorize these tumors into three suitable classes,which are glioma,meningioma,and pituitary.The proposed framework yielded remarkable findings on variously evaluated performance indicators:99.32%accuracy,98.74%sensitivity,98.89%specificity,99.01%Dice,98.93%Area Under the Curve(AUC),and 99.81%F1-score.In addition,a comparative analysis with recent state-of-the-art methods was performed and according to the comparative analysis,NDDRSATALFA shows an admirable level of reliability in simplifying the timely identification of diverse brain tumors.Moreover,this framework can be applied by healthcare providers to assist radiologists,pathologists,and physicians in their evaluations.The attained outcomes open doors for advanced automatic solutions that improve clinical evaluations and provide reasonable treatment plans.
基金supported by the BK21 FOUR project(AI-driven Convergence Software Education Research Program)funded by the Ministry of Education,School of Computer Science and Engineering,Kyungpook National University,Republic of Korea(4120240214871)supported by the New Faculty Start Up Fund from LSU Health Sciences New Orleans,LA,USA.
文摘Transfer learning is the predominant method for adapting pre-trained models on another task to new domains while preserving their internal architectures and augmenting them with requisite layers in Deep Neural Network models.Training intricate pre-trained models on a sizable dataset requires significant resources to fine-tune hyperparameters carefully.Most existing initialization methods mainly focus on gradient flow-related problems,such as gradient vanishing or exploding,or other existing approaches that require extra models that do not consider our setting,which is more practical.To address these problems,we suggest employing gradient-free heuristic methods to initialize the weights of the final new-added fully connected layer in neural networks froma small set of training data with fewer classes.The approach relies on partitioning the output values from pre-trained models for a small set into two separate intervals determined by the targets.This process is framed as an optimization problem for each output neuron and class.The optimization selects the highest values as weights,considering their direction towards the respective classes.Furthermore,empirical 145 experiments involve a variety of neural networkmodels tested acrossmultiple benchmarks and domains,occasionally yielding accuracies comparable to those achieved with gradient descent methods by using only small subsets.
基金support provided by the China Postdoctoral Science Foundation(Grant No.2024M763650)the Excellent Young Scientists Fund Program of SINOPEC Petroleum Exploration and Production Research Institute(Grant No.yk2024010).
文摘Seismic fault identification is a critical step in structural interpretation,reservoir characterization,and well-drilling planning.However,fault identification in deep fault-karst carbonate formations is particularly challenging due to their deep burial depth and the complex effects of dissolution.Traditional manual interpretation methods are often labor intensive and prone to high uncertainty due to their subjective nature.To address these limitations,this study proposes a transfer learningebased strategy for fault identification in deep fault-karst carbonate formations.The proposed methodology began with the generation of a large volume of synthetic seismic samples based on statistical fault distribution patterns observed in the study area.These synthetic samples were used to pretrain an improved U-Net network architecture,enhanced with an attention mechanism,to create a robust pretrained model.Subsequently,real-world fault labels were manually annotated based on verified fault interpretations and integrated into the training dataset.This combination of synthetic and real-world data was used to fine-tune the pretrained model,significantly improving its fault interpretation accuracy.The experimental results demonstrate that the integration of synthetic and real-world samples effectively enhances the quality of the training dataset.Furthermore,the proposed transfer learning strategy significantly im-proves fault recognition accuracy.By replacing the traditional weighted cross-entropy loss function with the Dice loss function,the model successfully addresses the issue of extreme class imbalance between positive and negative samples.Practical applications confirm that the proposed transfer learning strategy can accurately identify fault structures in deep fault-karst carbonate formations,providing a novel and effective technical approach for fault interpretation in such complex geological settings.
文摘Deep Learning-based systems for Finger vein recognition have gained rising attention in recent years due to improved efficiency and enhanced security.The performance of existing CNN-based methods is limited by the puny generalization of learned features and deficiency of the finger vein image training data.Considering the concerns of existing methods,in this work,a simplified deep transfer learning-based framework for finger-vein recognition is developed using an EfficientNet model of deep learning with a self-attention mechanism.Data augmentation using various geometrical methods is employed to address the problem of training data shortage required for a deep learning model.The proposed model is tested using K-fold cross-validation on three publicly available datasets:HKPU,FVUSM,and SDUMLA.Also,the developed network is compared with other modern deep nets to check its effectiveness.In addition,a comparison of the proposed method with other existing Finger vein recognition(FVR)methods is also done.The experimental results exhibited superior recognition accuracy of the proposed method compared to other existing methods.In addition,the developed method proves to be more effective and less sophisticated at extracting robust features.The proposed EffAttenNet achieves an accuracy of 98.14%on HKPU,99.03%on FVUSM,and 99.50%on SDUMLA databases.
文摘The COVID-19 pandemic,which was declared by the WHO,had created a global health crisis and disrupted people’s daily lives.A large number of people were affected by the COVID-19 pandemic.Therefore,a diagnostic model needs to be generated which can effectively classify the COVID and non-COVID cases.In this work,our aim is to develop a diagnostic model based on deep features using effectiveness of Chest X-ray(CXR)in distinguishing COVID from non-COVID cases.The proposed diagnostic framework utilizes CXR to diagnose COVID-19 and includes Grad-CAM visualizations for a visual interpretation of predicted images.The model’s performance was evaluated using various metrics,including accuracy,precision,recall,F1-score,and Gmean.Several machine learning models,such as random forest,dense neural network,SVM,twin SVM,extreme learning machine,random vector functional link,and kernel ridge regression,were selected to diagnose COVID-19 cases.Transfer learning was used to extract deep features.For feature extraction many CNN-based models such as Inception V3,MobileNet,ResNet50,VGG16 and Xception models are used.It was evident from the experiments that ResNet50 architecture outperformed all other CNN architectures based on AUC.The TWSVM classifier achieved the highest AUC score of 0.98 based on the ResNet50 feature vector.
基金Supported by National Natural Science Foundation of China(Grant No.51675040)。
文摘The electromagnetic pulse valve,as a key component in baghouse dust removal systems,plays a crucial role in the performance of the system.However,despite the promising results of intelligent fault diagnosis methods based on extensive data in diagnosing electromagnetic valves,real-world diagnostic scenarios still face numerous challenges.Collecting fault data for electromagnetic pulse valves is not only time-consuming but also costly,making it difficult to obtain sufficient fault data in advance,which poses challenges for small sample fault diagnosis.To address this issue,this paper proposes a fault diagnosis method for electromagnetic pulse valves based on deep transfer learning and simulated data.This method achieves effective transfer from simulated data to real data through four parameter transfer strategies,which combine parameter freezing and fine-tuning operations.Furthermore,this paper identifies a parameter transfer strategy that simultaneously fine-tunes the feature extractor and classifier,and introduces an attention mechanism to integrate fault features,thereby enhancing the correlation and information complementarity among multi-sensor data.The effectiveness of the proposed method is evaluated through two fault diagnosis cases under different operating conditions.In this study,small sample data accounted for 7.9%and 8.2%of the total dataset,and the experimental results showed transfer accuracies of 93.5%and 94.2%,respectively,validating the reliability and effectiveness of the method under small sample conditions.
基金supported by the Natural Science Foundation of Hunan Province of China(Grant No.2021JJ10045).
文摘Aerodynamic evaluation under multi-condition is indispensable for the design of aircraft,and the requirement for mass data still means a high cost.To address this problem,we propose a novel point-cloud multi-condition aerodynamics transfer learning(PCMCA-TL)framework that enables aerodynamic prediction in data-scarce sce-narios by transferring knowledge from well-learned scenarios.We modified the PointNeXt segmentation archi-tecture to a PointNeXtReg+regression model,including a working condition input module.The model is first pre-trained on a public dataset with 2000 shapes but only one working condition and then fine-tuned on a multi-condition small-scale spaceplane dataset.The effectiveness of the PCMCA-TL framework is verified by comparing the pressure coefficients predicted by direct training,pre-training,and TL models.Furthermore,by comparing the aerodynamic force coefficients calculated by predicted pressure coefficients in seconds with the correspond-ing CFD results obtained in hours,the accuracy highlights the development potential of deep transfer learning in aerodynamic evaluation.
文摘Many applications,including security systems,medical diagnostics,and human-computer interfaces,depend on eye gaze recognition.However,due to factors including individual variations,occlusions,and shifting illumination conditions,real-world scenarios continue to provide difficulties for accurate and consistent eye gaze recognition.This work is aimed at investigating the potential benefits of employing transfer learning to improve eye gaze detection ability and efficiency.Transfer learning is the process of fine-tuning pre-trained models on smaller,domain-specific datasets after they have been trained on larger datasets.We study several transfer learning algorithms and evaluate their effectiveness on eye gaze identification,including both Regression and Classification tasks,using a range of deep learning architectures,namely AlexNet,Visual Geometry Group(VGG),InceptionV3,and ResNet.In this study,we evaluate the effectiveness of transfer learning-basedmodels against models that were trained fromscratch using eye-gazing datasets on grounds of various performance and loss metrics such as Precision,Accuracy,and Mean Absolute Error.We investigate the effects of different pre-trainedmodels,dataset sizes,and domain gaps on the transfer learning process,and the findings of our study clarify the efficacy of transfer learning for eye gaze detection and offer suggestions for the most successful transfer learning strategies to apply in real-world situations.
基金funded by Advanced Research Project(30209040702).
文摘Named Entity Recognition(NER)is vital in natural language processing for the analysis of news texts,as it accurately identifies entities such as locations,persons,and organizations,which is crucial for applications like news summarization and event tracking.However,NER in the news domain faces challenges due to insufficient annotated data,complex entity structures,and strong context dependencies.To address these issues,we propose a new Chinesenamed entity recognition method that integrates transfer learning with word embeddings.Our approach leverages the ERNIE pre-trained model for transfer learning and obtaining general language representations and incorporates the Soft-lexicon word embedding technique to handle varied entity structures.This dual-strategy enhances the model’s understanding of context and boosts its ability to process complex texts.Experimental results show that our method achieves an F1 score of 94.72% on a news dataset,surpassing baseline methods by 3%–4%,thereby confirming its effectiveness for Chinese-named entity recognition in the news domain.
基金supported by the National Natural Science Foundation of China(Grant No.52002031)the Shaanxi Province Key R&D Plan Project(No.2024GX-YBXM-002).
文摘Traffic flow prediction is a key component of intelligent transportation systems,particularly in datascarce regions where traditional models relying on complete datasets often fail to provide accurate forecasts.These regions are characterized by limited sensor coverage and sparse data collection,pose significant challenges for existing prediction methods.To address this,we propose a novel transfer learning framework called transfer learning with deep knowledge distillation(TL-DKD),which combines graph neural network(GNN)with deep knowledge distillation to enable effective knowledge transfer from data-rich to data-scarce domains.Our contributions are three-fold:(1)We introduce,for the first time,a unique integration of deep knowledge distillation and transfer learning,enhancing feature adaptability across diverse traffic datasets while addressing data scarcity.(2)We design an encoder-decoder architecture where the encoder retains generalized spatiotemporal patterns fromsource domains,and the decoder finetunes predictions for target domains,ensuring minimal information loss during transfer.(3)Extensive experiments on five real-world datasets(METR-LA,PeMS-Bay,PeMS03/04/08)demonstrate the framework’s robustness.The TL-DKD model achieves significant improvements in prediction accuracy,especially in data-scarce scenarios.For example,the PEMSD4 dataset in multi-region experiments,it achieves a mean absolute error(MAE)of 20.08,a mean absolute percentage error(MAPE)of 13.59%,and a root mean squared error(RMSE)of 31.75 for 30-min forecasts.Additionally,noise-augmented experiments show improved adaptability under perturbed data conditions.These results highlight the framework’s practical impact,offering a scalable solution for accurate traffic predictions in resource-constrained environments.
基金funded by Sponsorship of Science and Technology Project of State Grid Xinjiang Electric Power Co.,Ltd.,grant number SGXJ0000TKJS2400168.
文摘This study presents an emergency control method for sub-synchronous oscillations in wind power gridconnected systems based on transfer learning,addressing the issue of insufficient generalization ability of traditional methods in complex real-world scenarios.By combining deep reinforcement learning with a transfer learning framework,cross-scenario knowledge transfer is achieved,significantly enhancing the adaptability of the control strategy.First,a sub-synchronous oscillation emergency control model for the wind power grid integration system is constructed under fixed scenarios based on deep reinforcement learning.A reward evaluation system based on the active power oscillation pattern of the system is proposed,introducing penalty functions for the number of machine-shedding rounds and the number of machines shed.This avoids the economic losses and grid security risks caused by the excessive one-time shedding of wind turbines.Furthermore,transfer learning is introduced into model training to enhance the model’s generalization capability in dealing with complex scenarios of actual wind power grid integration systems.By introducing the Maximum Mean Discrepancy(MMD)algorithm to calculate the distribution differences between source data and target data,the online decision-making reliability of the emergency control model is improved.Finally,the effectiveness of the proposed emergency control method for multi-scenario sub-synchronous oscillation in wind power grid integration systems based on transfer learning is analyzed using the New England 39-bus system.
文摘With the advancements in parameter-efficient transfer learning techniques,it has become feasible to leverage large pre-trained language models for downstream tasks under low-cost and low-resource conditions.However,applying this technique to multimodal knowledge transfer introduces a significant challenge:ensuring alignment across modalities while minimizing the number of additional parameters required for downstream task adaptation.This paper introduces UniTrans,a framework aimed at facilitating efficient knowledge transfer across multiple modalities.UniTrans leverages Vector-based Cross-modal Random Matrix Adaptation to enable fine-tuning with minimal parameter overhead.To further enhance modality alignment,we introduce two key components:the Multimodal Consistency Alignment Module and the Query-Augmentation Side Network,specifically optimized for scenarios with extremely limited trainable parameters.Extensive evaluations on various cross-modal downstream tasks demonstrate that our approach surpasses state-of-the-art methods while using just 5%of their trainable parameters.Additionally,it achieves superior performance compared to fully fine-tuned models on certain benchmarks.
基金the Deanship of Graduate Studies and Scientific Research at Najran University for supporting the research project through the Group Research,with the project code NU/GP/SERC/13/712。
文摘The Internet of Things(IoT)is an innovation that combines imagined space with the actual world on a single platform.Because of the recent rapid rise of IoT devices,there has been a lack of standards,leading to a massive increase in unprotected devices connecting to networks.Consequently,cyberattacks on IoT are becoming more common,particularly keylogging attacks,which are often caused by security vulnerabilities on IoT networks.This research focuses on the role of transfer learning and ensemble classifiers in enhancing the detection of keylogging attacks within small,imbalanced IoT datasets.The authors propose a model that combines transfer learning with ensemble classification methods,leading to improved detection accuracy.By leveraging the BoT-IoT and keylogger_detection datasets,they facilitate the transfer of knowledge across various domains.The results reveal that the integration of transfer learning and ensemble classifiers significantly improves detection capabilities,even in scenarios with limited data availability.The proposed TRANS-ENS model showcases exceptional accuracy and a minimal false positive rate,outperforming current deep learning approaches.The primary objectives include:(i)introducing an ensemble feature selection technique to identify common features across models,(ii)creating a pre-trained deep learning model through transfer learning for the detection of keylogging attacks,and(iii)developing a transfer learning-ensemble model dedicated to keylogging detection.Experimental findings indicate that the TRANS-ENS model achieves a detection accuracy of 96.06%and a false alarm rate of 0.12%,surpassing existing models such as CNN,RNN,and LSTM.
基金supported by the National Natural Science Foundation of China(12272259)the Key Research and Development Fund of Universities in Hebei Province(2510800601A).
文摘Effective fault identification is crucial for bearings, which are critical components of mechanical systems and play a pivotal role in ensuring overall safety and operational efficiency. Bearings operate under variable service conditions, and their diagnostic environments are complex and dynamic. In the process of bearing diagnosis, fault datasets are relatively scarce compared with datasets representing normal operating conditions. These challenges frequently cause the practicality of fault detection to decline, the extraction of fault features to be incomplete, and the diagnostic accuracy of many existing models to decrease. In this work, a transfer-learning framework, designated DSCNN-HA-TL, is introduced to address the enduring challenge of cross-condition diagnosis in rolling-bearing fault detection. The framework integrates a window global mixed attention mechanism with a deep separable convolutional network, thereby enabling adaptation to fault detection tasks under diverse operating conditions. First, a Convolutional Neural Network (CNN) is employed as the foundational architecture, where the original convolutional layers are enhanced through the incorporation of depthwise separable convolutions, resulting in a Depthwise Separable Convolutional Neural Network (DSCNN) architecture. Subsequently, the extraction of fault characteristics is further refined through a dual-branch network that integrates hybrid attention mechanisms, specifically windowed and global attention mechanisms. This approach enables the acquisition of multi-level feature fusion information, thereby enhancing the accuracy of fault classification. The integration of these features not only optimizes the characteristic extraction process but also yields improvements in accuracy, representational capacity, and robustness in fault feature recognition. In conclusion, the proposed method achieved average precisions of 99.93% and 99.55% in transfer learning tasks, as demonstrated by the experimental results obtained from the CWRU public dataset and the bearing fault detection platform dataset. The experimental findings further provided a detailed comparison between the diagnostic models before and after the enhancement, thereby substantiating the pronounced advantages of the DSCNN-HA-TL approach in accurately identifying faults in critical mechanical components under diverse operating conditions.