Developing machine learning frameworks with predictive power,interpretability,and transferability is crucial,yet it faces challenges in the field of electrocatalysis.To achieve this,we employed rigorous feature engine...Developing machine learning frameworks with predictive power,interpretability,and transferability is crucial,yet it faces challenges in the field of electrocatalysis.To achieve this,we employed rigorous feature engineering to establish a finely tuned gradient boosting regressor(GBR)model,which adeptly captures the physical complexity from feature space to target variables.We demonstrated that environmental electron effects and atomic number significantly govern the success of the mapping process via global and local explanations.The finely tuned GBR model exhibits exceptional robustness in predicting CO adsorption energies(R_(ave)^(2)=0.937,RMSE=0.153 eV).Moreover,the model demonstrated remarkable transfer learning ability,showing excellent predictive power for OH,NO,and N_(2) adsorption.Importantly,the GBR model exhibits exceptional predictive capability across an extensive search space,thereby demonstrating profound adaptability and versatility.Our research framework significantly enhances the interpretability and transferability of machine learning in electrocatalysis,offering vital insights for further advancements.展开更多
Cardiovascular disease(CVD)remains a leading global health challenge due to its high mortality rate and the complexity of early diagnosis,driven by risk factors such as hypertension,high cholesterol,and irregular puls...Cardiovascular disease(CVD)remains a leading global health challenge due to its high mortality rate and the complexity of early diagnosis,driven by risk factors such as hypertension,high cholesterol,and irregular pulse rates.Traditional diagnostic methods often struggle with the nuanced interplay of these risk factors,making early detection difficult.In this research,we propose a novel artificial intelligence-enabled(AI-enabled)framework for CVD risk prediction that integrates machine learning(ML)with eXplainable AI(XAI)to provide both high-accuracy predictions and transparent,interpretable insights.Compared to existing studies that typically focus on either optimizing ML performance or using XAI separately for local or global explanations,our approach uniquely combines both local and global interpretability using Local Interpretable Model-Agnostic Explanations(LIME)and SHapley Additive exPlanations(SHAP).This dual integration enhances the interpretability of the model and facilitates clinicians to comprehensively understand not just what the model predicts but also why those predictions are made by identifying the contribution of different risk factors,which is crucial for transparent and informed decision-making in healthcare.The framework uses ML techniques such as K-nearest neighbors(KNN),gradient boosting,random forest,and decision tree,trained on a cardiovascular dataset.Additionally,the integration of LIME and SHAP provides patient-specific insights alongside global trends,ensuring that clinicians receive comprehensive and actionable information.Our experimental results achieve 98%accuracy with the Random Forest model,with precision,recall,and F1-scores of 97%,98%,and 98%,respectively.The innovative combination of SHAP and LIME sets a new benchmark in CVD prediction by integrating advanced ML accuracy with robust interpretability,fills a critical gap in existing approaches.This framework paves the way for more explainable and transparent decision-making in healthcare,ensuring that the model is not only accurate but also trustworthy and actionable for clinicians.展开更多
The bandgap is a key parameter for understanding and designing hybrid perovskite material properties,as well as developing photovoltaic devices.Traditional bandgap calculation methods like ultravioletvisible spectrosc...The bandgap is a key parameter for understanding and designing hybrid perovskite material properties,as well as developing photovoltaic devices.Traditional bandgap calculation methods like ultravioletvisible spectroscopy and first-principles calculations are time-and power-consuming,not to mention capturing bandgap change mechanisms for hybrid perovskite materials across a wide range of unknown space.In the present work,an artificial intelligence ensemble comprising two classifiers(with F1 scores of 0.9125 and 0.925)and a regressor(with mean squared error of 0.0014 eV)is constructed to achieve high-precision prediction of the bandgap.The bandgap perovskite dataset is established through highthroughput prediction of bandgaps by the ensemble.Based on the self-built dataset,partial dependence analysis(PDA)is developed to interpret the bandgap influential mechanism.Meanwhile,an interpretable mathematical model with an R^(2)of 0.8417 is generated using the genetic programming symbolic regression(GPSR)technique.The constructed PDA maps agree well with the Shapley Additive exPlanations,the GPSR model,and experiment verification.Through PDA,we reveal the boundary effect,the bowing effect,and their evolution trends with key descriptors.展开更多
Soil desiccation cracking is ubiquitous in nature and has significantpotential impacts on the engineering geological properties of soils.Previous studies have extensively examined various factors affecting soil cracki...Soil desiccation cracking is ubiquitous in nature and has significantpotential impacts on the engineering geological properties of soils.Previous studies have extensively examined various factors affecting soil cracking behavior through a numerous small-sample experiments.However,experimental studies alone cannot accurately describe soil cracking behavior.In this study,we firstly propose a modeling framework for predicting the surface crack ratio of soil desiccation cracking based on machine learning and interpretable analysis.The framework utilizes 1040 sets of soil cracking experimental data and employs random forest(RF),extreme gradient boosting(XGBoost),and artificialneural network(ANN)models to predict the surface crack ratio of soil desiccation cracking.To clarify the influenceof input features on soil cracking behavior,feature importance and Shapley additive explanations(SHAP)are applied for interpretability analysis.The results reveal that ensemble methods(RF and XGBoost)provide better predictive performance than the deep learning model(ANN).The feature importance analysis shows that soil desiccation cracking is primarily influencedby initial water content,plasticity index,finalwater content,liquid limit,sand content,clay content and thickness.Moreover,SHAP-based interpretability analysis further explores how soil cracking responds to various input variables.This study provides new insight into the evolution of soil cracking behavior,enhancing the understanding of its physical mechanisms and facilitating the assessment of potential regional development of soil desiccation cracking.展开更多
Accurately mapping the spatial distribution of soil organic carbon(SOC)is crucial for guiding agricultural management and improving soil carbon sequestration,especially in fragmented agricultural landscapes.Although r...Accurately mapping the spatial distribution of soil organic carbon(SOC)is crucial for guiding agricultural management and improving soil carbon sequestration,especially in fragmented agricultural landscapes.Although remote sensing provides spatially continuous environmental information about heterogeneous agricultural landscapes,its relationship with SOC remains unclear.In this study,we hypothesized that multi-category remote sensing-derived variables can enhance our understanding of SOC variation within complex landscape conditions.Taking the Qilu Lake watershed in Yunnan,China,as a case study area and based on 216 topsoil samples collected from irrigation areas,we applied the extreme gradient boosting(XGBoost)model to investigate the contributions of vegetation indices(VI),brightness indices(BI),moisture indices(MI),and spectral transformations(ST,principal component analysis and tasseled cap transformation)to SOC mapping.The results showed that ST contributed the most to SOC prediction accuracy,followed by MI,VI,and BI,with improvements in R2 of 29.27,26.83,19.51,and 14.43%,respectively.The dominance of ST can be attributed to the fact that it contains richer remote sensing spectral information.The optimal SOC prediction model integrated soil properties,topographic factors,location factors,and landscape metrics,as well as remote sensing-derived variables,and achieved RMSE and MAE of 15.05 and 11.42 g kg-1,and R2 and CCC of 0.57 and 0.72,respectively.The Shapley additive explanations deciphered the nonlinear and threshold effects that exist between soil moisture,vegetation status,soil brightness and SOC.Compared with traditional linear regression models,interpretable machine learning has advantages in prediction accuracy and revealing the influences of variables that reflect landscape characteristics on SOC.Overall,this study not only reveals how remote sensing-derived variables contribute to our understanding of SOC distribution in fragmented agricultural landscapes but also clarifies their efficacy.Through interpretable machine learning,we can further elucidate the causes of SOC variation,which is important for sustainable soil management and agricultural practices.展开更多
The belief rule-based(BRB)system has been popular in complexity system modeling due to its good interpretability.However,the current mainstream optimization methods of the BRB systems only focus on modeling accuracy b...The belief rule-based(BRB)system has been popular in complexity system modeling due to its good interpretability.However,the current mainstream optimization methods of the BRB systems only focus on modeling accuracy but ignore the interpretability.The single-objective optimization strategy has been applied in the interpretability-accuracy trade-off by inte-grating accuracy and interpretability into an optimization objec-tive.But the integration has a greater impact on optimization results with strong subjectivity.Thus,a multi-objective optimiza-tion framework in the modeling of BRB systems with inter-pretability-accuracy trade-off is proposed in this paper.Firstly,complexity and accuracy are taken as two independent opti-mization goals,and uniformity as a constraint to give the mathe-matical description.Secondly,a classical multi-objective opti-mization algorithm,nondominated sorting genetic algorithm II(NSGA-II),is utilized as an optimization tool to give a set of BRB systems with different accuracy and complexity.Finally,a pipeline leakage detection case is studied to verify the feasibility and effectiveness of the developed multi-objective optimization.The comparison illustrates that the proposed multi-objective optimization framework can effectively avoid the subjectivity of single-objective optimization,and has capability of joint optimiz-ing the structure and parameters of BRB systems with inter-pretability-accuracy trade-off.展开更多
Fault diagnosis plays an irreplaceable role in the normal operation of equipment.A fault diagnosis model is often required to be interpretable for increasing the trust between humans and the model.Due to the understan...Fault diagnosis plays an irreplaceable role in the normal operation of equipment.A fault diagnosis model is often required to be interpretable for increasing the trust between humans and the model.Due to the understandable knowledge expression and transparent reasoning process,the belief rule base(BRB)has extensive applications as an interpretable expert system in fault diagnosis.Optimization is an effective means to weaken the subjectivity of experts in BRB,where the interpretability of BRB may be weakened.Hence,to obtain a credible result,the weakening factors of interpretability in the BRB-based fault diagnosis model are firstly analyzed,which are manifested in deviation from the initial judgement of experts and over-optimization of parameters.For these two factors,three indexes are proposed,namely the consistency index of rules,consistency index of the rule base and over-optimization index,tomeasure the interpretability of the optimizedmodel.Considering both the accuracy and interpretability of amodel,an improved coordinate ascent(I-CA)algorithmis proposed to fine-tune the parameters of the fault diagnosis model based on BRB.In I-CA,the algorithm combined with the advance and retreat method and the golden section method is employed to be one-dimensional search algorithm.Furthermore,the random optimization sequence and adaptive step size are proposed to improve the accuracy of the model.Finally,a case study of fault diagnosis in aerospace relays based on BRB is carried out to verify the effectiveness of the proposed method.展开更多
Objective Clinical medical record data associated with hepatitis B-related acute-on-chronic liver failure(HBV-ACLF)generally have small sample sizes and a class imbalance.However,most machine learning models are desig...Objective Clinical medical record data associated with hepatitis B-related acute-on-chronic liver failure(HBV-ACLF)generally have small sample sizes and a class imbalance.However,most machine learning models are designed based on balanced data and lack interpretability.This study aimed to propose a traditional Chinese medicine(TCM)diagnostic model for HBV-ACLF based on the TCM syndrome differentiation and treatment theory,which is clinically interpretable and highly accurate.Methods We collected medical records from 261 patients diagnosed with HBV-ACLF,including three syndromes:Yang jaundice(214 cases),Yang-Yin jaundice(41 cases),and Yin jaundice(6 cases).To avoid overfitting of the machine learning model,we excluded the cases of Yin jaundice.After data standardization and cleaning,we obtained 255 relevant medical records of Yang jaundice and Yang-Yin jaundice.To address the class imbalance issue,we employed the oversampling method and five machine learning methods,including logistic regression(LR),support vector machine(SVM),decision tree(DT),random forest(RF),and extreme gradient boosting(XGBoost)to construct the syndrome diagnosis models.This study used precision,F1 score,the area under the receiver operating characteristic(ROC)curve(AUC),and accuracy as model evaluation metrics.The model with the best classification performance was selected to extract the diagnostic rule,and its clinical significance was thoroughly analyzed.Furthermore,we proposed a novel multiple-round stable rule extraction(MRSRE)method to obtain a stable rule set of features that can exhibit the model’s clinical interpretability.Results The precision of the five machine learning models built using oversampled balanced data exceeded 0.90.Among these models,the accuracy of RF classification of syndrome types was 0.92,and the mean F1 scores of the two categories of Yang jaundice and Yang-Yin jaundice were 0.93 and 0.94,respectively.Additionally,the AUC was 0.98.The extraction rules of the RF syndrome differentiation model based on the MRSRE method revealed that the common features of Yang jaundice and Yang-Yin jaundice were wiry pulse,yellowing of the urine,skin,and eyes,normal tongue body,healthy sublingual vessel,nausea,oil loathing,and poor appetite.The main features of Yang jaundice were a red tongue body and thickened sublingual vessels,whereas those of Yang-Yin jaundice were a dark tongue body,pale white tongue body,white tongue coating,lack of strength,slippery pulse,light red tongue body,slimy tongue coating,and abdominal distension.This is aligned with the classifications made by TCM experts based on TCM syndrome differentiation and treatment theory.Conclusion Our model can be utilized for differentiating HBV-ACLF syndromes,which has the potential to be applied to generate other clinically interpretable models with high accuracy on clinical data characterized by small sample sizes and a class imbalance.展开更多
With the development of Fintech, applying artificial intelligence (AI) technologies to the financial field is a general trend. However, there are some inappropriate conditions, for instance, the AI model is always tre...With the development of Fintech, applying artificial intelligence (AI) technologies to the financial field is a general trend. However, there are some inappropriate conditions, for instance, the AI model is always treated as a black box and cannot be interpreted. This paper studies the AI model interpretability when the models are applied in the financial field. We analyze the reasons of black box problem and explore the effective solutions. We propose a new kind of automatic Regtech tool—LIMER, and put forward policy suggestions, thereby continuously promoting the development of Fintech to a higher level.展开更多
Early fault diagnosis of bearings is crucial for ensuring safe and reliable operations.Convolutional neural networks(CNNs)have achieved significant breakthroughs in machinery fault diagnosis.However,complex and varyin...Early fault diagnosis of bearings is crucial for ensuring safe and reliable operations.Convolutional neural networks(CNNs)have achieved significant breakthroughs in machinery fault diagnosis.However,complex and varying working conditions can lead to inter-class similarity and intra-class variability in datasets,making it more challenging for CNNs to learn discriminative features.Furthermore,CNNs are often considered“black boxes”and lack sufficient interpretability in the fault diagnosis field.To address these issues,this paper introduces a residual mixed domain attention CNN method,referred to as RMA-CNN.This method comprises multiple residual mixed domain attention modules(RMAMs),each employing one attention mechanism to emphasize meaningful features in both time and channel domains.This significantly enhances the network’s ability to learn fault-related features.Moreover,we conduct an in-depth analysis of the inherent feature learning mechanism of the attention module RMAM to improve the interpretability of CNNs in fault diagnosis applications.Experiments conducted on two datasets—a high-speed aeronautical bearing dataset and a motor bearing dataset—demonstrate that the RMA-CNN achieves remarkable results in diagnostic tasks.展开更多
The accurate and reliable interpretation of regional land cover data is very important for natural resource monitoring and environmental assessment.At present,refined land cover data are mainly obtained by manual visu...The accurate and reliable interpretation of regional land cover data is very important for natural resource monitoring and environmental assessment.At present,refined land cover data are mainly obtained by manual visual interpretation,which has the problems of heavy workload and inconsistent interpretation scales.Deep learning has greatly improved the automatic processing and analysis of remote sensing data.However,the accurate interpretation of feature information from massive datasets remains a difficult problem in wide regional land cover classification.To improve the efficiency of deep learning-based remote sensing image interpretation,we selected multisource remote sensing data,assessed the interpretability of the U-Net model based on surface spatial scenes with different levels of complexity,and proposed a new method of stereoscopic accuracy verification(SAV)to evaluate the reliability of the classification result.The results show that classification accuracy is more highly correlated with terrain and landscape than with other factors related to image data,such as platform and spatial resolution.As the complexity of surface spatial scenes increases,the accuracy of the classification results mainly shows a fluctuating declining trend.We also find the distribution characteristics from the SAV evaluation results of different land cover types in each surface spatial scene.Based on the results observed in this study,we consider the distinction of interpretability and reliability in diverse ground object types and design targeted classification strategies for different surface scenes,which can greatly improve the classification efficiency.The key achievement of this study is to provide the theoretical basis for remote sensing information analysis and an accuracy evaluation method for regional land cover classification,and the proposed method can help improve the likelihood that intelligent interpretation can replace manual acquisition.展开更多
Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of po...Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.展开更多
Association rule learning(ARL)is a widely used technique for discovering relationships within datasets.However,it often generates excessive irrelevant or ambiguous rules.Therefore,post-processing is crucial not only f...Association rule learning(ARL)is a widely used technique for discovering relationships within datasets.However,it often generates excessive irrelevant or ambiguous rules.Therefore,post-processing is crucial not only for removing irrelevant or redundant rules but also for uncovering hidden associations that impact other factors.Recently,several post-processing methods have been proposed,each with its own strengths and weaknesses.In this paper,we propose THAPE(Tunable Hybrid Associative Predictive Engine),which combines descriptive and predictive techniques.By leveraging both techniques,our aim is to enhance the quality of analyzing generated rules.This includes removing irrelevant or redundant rules,uncovering interesting and useful rules,exploring hidden association rules that may affect other factors,and providing backtracking ability for a given product.The proposed approach offers a tailored method that suits specific goals for retailers,enabling them to gain a better understanding of customer behavior based on factual transactions in the target market.We applied THAPE to a real dataset as a case study in this paper to demonstrate its effectiveness.Through this application,we successfully mined a concise set of highly interesting and useful association rules.Out of the 11,265 rules generated,we identified 125 rules that are particularly relevant to the business context.These identified rules significantly improve the interpretability and usefulness of association rules for decision-making purposes.展开更多
Prediction systems are an important aspect of intelligent decisions.In engineering practice,the complex system structure and the external environment cause many uncertain factors in the model,which influence the model...Prediction systems are an important aspect of intelligent decisions.In engineering practice,the complex system structure and the external environment cause many uncertain factors in the model,which influence the modeling accuracy of the model.The belief rule base(BRB)can implement nonlinear modeling and express a variety of uncertain information,including fuzziness,ignorance,randomness,etc.However,the BRB system also has two main problems:Firstly,modeling methods based on expert knowledge make it difficult to guarantee the model’s accuracy.Secondly,interpretability is not considered in the optimization process of current research,resulting in the destruction of the interpretability of BRB.To balance the accuracy and interpretability of the model,a self-growth belief rule basewith interpretability constraints(SBRB-I)is proposed.The reasoning process of the SBRB-I model is based on the evidence reasoning(ER)approach.Moreover,the self-growth learning strategy ensures effective cooperation between the datadriven model and the expert system.A case study showed that the accuracy and interpretability of the model could be guaranteed.The SBRB-I model has good application prospects in prediction systems.展开更多
With the development of information technology,radio communication technology has made rapid progress.Many radio signals that have appeared in space are difficult to classify without manually labeling.Unsupervised rad...With the development of information technology,radio communication technology has made rapid progress.Many radio signals that have appeared in space are difficult to classify without manually labeling.Unsupervised radio signal clustering methods have recently become an urgent need for this situation.Meanwhile,the high complexity of deep learning makes it difficult to understand the decision results of the clustering models,making it essential to conduct interpretable analysis.This paper proposed a combined loss function for unsupervised clustering based on autoencoder.The combined loss function includes reconstruction loss and deep clustering loss.Deep clustering loss is added based on reconstruction loss,which makes similar deep features converge more in feature space.In addition,a features visualization method for signal clustering was proposed to analyze the interpretability of autoencoder utilizing Saliency Map.Extensive experiments have been conducted on a modulated signal dataset,and the results indicate the superior performance of our proposed method over other clustering algorithms.In particular,for the simulated dataset containing six modulation modes,when the SNR is 20dB,the clustering accuracy of the proposed method is greater than 78%.The interpretability analysis of the clustering model was performed to visualize the significant features of different modulated signals and verified the high separability of the features extracted by clustering model.展开更多
The balancing market in the energy sector plays a critical role in physically and financially balancing the supply and demand.Modeling dynamics in the balancing market can provide valuable insights and prognosis for p...The balancing market in the energy sector plays a critical role in physically and financially balancing the supply and demand.Modeling dynamics in the balancing market can provide valuable insights and prognosis for power grid stability and secure energy supply.While complex machine learning models can achieve high accuracy,their“blackbox”nature severely limits the model interpretability.In this paper,we explore the trade-off between model accuracy and interpretability for the energy balancing market.Particularly,we take the example of forecasting manual frequency restoration reserve(mFRR)activation price in the balancing market using real market data from different energy price zones.We explore the interpretability of mFRR forecasting using two models:extreme gradient boosting(XGBoost)machine and explainable boosting machine(EBM).We also integrate the two models,and we benchmark all the models against a baseline naive model.Our results show that EBM provides forecasting accuracy comparable to XGBoost while yielding a considerable level of interpretability.Our analysis also underscores the challenge of accurately predicting the mFRR price for the instances when the activation price deviates significantly from the spot price.Importantly,EBM's interpretability features reveal insights into non-linear mFRR price drivers and regional market dynamics.Our study demonstrates that EBM is a viable and valuable interpretable alternative to complex black-box AI models in the forecast for the balancing market.展开更多
With the rapid advancement of natural language processing(NLP),large language models(LLMs)have demonstrated excep-tional performance across tasks(Xu et al.,2024;Lee et al.,2024;Tan et al.,2023)like machine translation...With the rapid advancement of natural language processing(NLP),large language models(LLMs)have demonstrated excep-tional performance across tasks(Xu et al.,2024;Lee et al.,2024;Tan et al.,2023)like machine translation,text summarization,and question-answering,significantly accelerating NLP research.Furthermore,LLMs have also facilitated advancements across di-verse fields.In robotics,for example,LLMs enhance the interpre-tation and translation of user voice commands,enabling precise planning and execution of robotic arm movements(Driess et al.,2023).展开更多
In the rolling production of steel,predicting the performance of new products is challenging due to the low variety of data distributions resulting from standardized manufacturing processes and fixed product categorie...In the rolling production of steel,predicting the performance of new products is challenging due to the low variety of data distributions resulting from standardized manufacturing processes and fixed product categories.This scenario poses a significant hurdle for machine learning models,leading to what is commonly known as the“cold-start problem”.To address this issue,we propose a knowledge graph attention neural network for steel manufacturing(SteelKGAT).By leveraging expert knowledge and a multi-head attention mechanism,SteelKGAT aims to enhance prediction accuracy.Our experimental results demonstrate that the SteelKGAT model outperforms existing methods when generalizing to previously unseen products.Only the SteelKGAT model accurately captures the feature trend,thereby offering correct guidance in product tuning,which is of practical significance for new product development(NPD).Additionally,we employ the Integrated Gradients(IG)method to shed light on the model's predictions,revealing the relative importance of each feature within the knowledge graph.Notably,this work represents the first application of knowledge graph attention neural networks to address the cold-start problem in steel rolling production.By combining domain expertise and interpretable predictions,our knowledge-informed SteelKGAT model provides accurate insights into the mechanical properties of products even in cold-start scenarios.展开更多
This paper proposes a new approach for online power system transient security assessment(TSA)and preventive control based on XGBoost and DC optimal power flow(DCOPF).The novelty of this proposal is that it applies the...This paper proposes a new approach for online power system transient security assessment(TSA)and preventive control based on XGBoost and DC optimal power flow(DCOPF).The novelty of this proposal is that it applies the XGBoost and data selection method based on the 1-norm distance in local feature importance evaluation which can provide a certain model interpretability.The method of SMOTE+ENN is adopted for data rebalancing.The contingency-oriented XGBoost model is trained with databases generated by time domain simulations to represent the transient security constraint in the DCOPF model,which has a relatively fast speed of calculation.The transient security constrained generation rescheduling is implemented with the differential evolution algorithm,which is utilized to optimize the rescheduled generation in the preventive control.Feasibility and effectiveness of the proposed approach are demonstrated on an IEEE 39-bus test system and a 500-bus operational model for South Carolina,USA.展开更多
The prediction of structural performance plays a significant role in damage assessment of glass fiber reinforcement polymer(GFRP)elastic gridshell structures.Machine learning(ML)approaches are implemented in this stud...The prediction of structural performance plays a significant role in damage assessment of glass fiber reinforcement polymer(GFRP)elastic gridshell structures.Machine learning(ML)approaches are implemented in this study,to predict maximum stress and displacement of GFRP elastic gridshell structures.Several ML algorithms,including linear regression(LR),ridge regression(RR),support vector regression(SVR),K-nearest neighbors(KNN),decision tree(DT),random forest(RF),adaptive boosting(AdaBoost),extreme gradient boosting(XGBoost),category boosting(CatBoost),and light gradient boosting machine(LightGBM),are implemented in this study.Output features of structural performance considered in this study are the maximum stress as f1(x)and the maximum displacement to self-weight ratio as f2(x).A comparative study is conducted and the Catboost model presents the highest prediction accuracy.Finally,interpretable ML approaches,including shapely additive explanations(SHAP),partial dependence plot(PDP),and accumulated local effects(ALE),are applied to explain the predictions.SHAP is employed to describe the importance of each variable to structural performance both locally and globally.The results of sensitivity analysis(SA),feature importance of the CatBoost model and SHAP approach indicate the same parameters as the most significant variables for f1(x)and f2(x).展开更多
基金supported by the Research Grants Council of Hong Kong(CityU 11305919 and 11308620)and NSFC/RGC Joint Research Scheme N_CityU104/19Hong Kong Research Grant Council Collaborative Research Fund:C1002-21G and C1017-22Gsupported by the Hong Kong Research Grant Council Collaborative Research Fund:C6021-19E.
文摘Developing machine learning frameworks with predictive power,interpretability,and transferability is crucial,yet it faces challenges in the field of electrocatalysis.To achieve this,we employed rigorous feature engineering to establish a finely tuned gradient boosting regressor(GBR)model,which adeptly captures the physical complexity from feature space to target variables.We demonstrated that environmental electron effects and atomic number significantly govern the success of the mapping process via global and local explanations.The finely tuned GBR model exhibits exceptional robustness in predicting CO adsorption energies(R_(ave)^(2)=0.937,RMSE=0.153 eV).Moreover,the model demonstrated remarkable transfer learning ability,showing excellent predictive power for OH,NO,and N_(2) adsorption.Importantly,the GBR model exhibits exceptional predictive capability across an extensive search space,thereby demonstrating profound adaptability and versatility.Our research framework significantly enhances the interpretability and transferability of machine learning in electrocatalysis,offering vital insights for further advancements.
基金funded by Researchers Supporting Project Number(RSPD2025R947),King Saud University,Riyadh,Saudi Arabia.
文摘Cardiovascular disease(CVD)remains a leading global health challenge due to its high mortality rate and the complexity of early diagnosis,driven by risk factors such as hypertension,high cholesterol,and irregular pulse rates.Traditional diagnostic methods often struggle with the nuanced interplay of these risk factors,making early detection difficult.In this research,we propose a novel artificial intelligence-enabled(AI-enabled)framework for CVD risk prediction that integrates machine learning(ML)with eXplainable AI(XAI)to provide both high-accuracy predictions and transparent,interpretable insights.Compared to existing studies that typically focus on either optimizing ML performance or using XAI separately for local or global explanations,our approach uniquely combines both local and global interpretability using Local Interpretable Model-Agnostic Explanations(LIME)and SHapley Additive exPlanations(SHAP).This dual integration enhances the interpretability of the model and facilitates clinicians to comprehensively understand not just what the model predicts but also why those predictions are made by identifying the contribution of different risk factors,which is crucial for transparent and informed decision-making in healthcare.The framework uses ML techniques such as K-nearest neighbors(KNN),gradient boosting,random forest,and decision tree,trained on a cardiovascular dataset.Additionally,the integration of LIME and SHAP provides patient-specific insights alongside global trends,ensuring that clinicians receive comprehensive and actionable information.Our experimental results achieve 98%accuracy with the Random Forest model,with precision,recall,and F1-scores of 97%,98%,and 98%,respectively.The innovative combination of SHAP and LIME sets a new benchmark in CVD prediction by integrating advanced ML accuracy with robust interpretability,fills a critical gap in existing approaches.This framework paves the way for more explainable and transparent decision-making in healthcare,ensuring that the model is not only accurate but also trustworthy and actionable for clinicians.
基金supported by the National Research Foundation of Korea(NRF)funded by the Korean government(MSIT)(Grant number:RS-2025-02316700,and RS-2025-00522430)the China Scholarship Council Program。
文摘The bandgap is a key parameter for understanding and designing hybrid perovskite material properties,as well as developing photovoltaic devices.Traditional bandgap calculation methods like ultravioletvisible spectroscopy and first-principles calculations are time-and power-consuming,not to mention capturing bandgap change mechanisms for hybrid perovskite materials across a wide range of unknown space.In the present work,an artificial intelligence ensemble comprising two classifiers(with F1 scores of 0.9125 and 0.925)and a regressor(with mean squared error of 0.0014 eV)is constructed to achieve high-precision prediction of the bandgap.The bandgap perovskite dataset is established through highthroughput prediction of bandgaps by the ensemble.Based on the self-built dataset,partial dependence analysis(PDA)is developed to interpret the bandgap influential mechanism.Meanwhile,an interpretable mathematical model with an R^(2)of 0.8417 is generated using the genetic programming symbolic regression(GPSR)technique.The constructed PDA maps agree well with the Shapley Additive exPlanations,the GPSR model,and experiment verification.Through PDA,we reveal the boundary effect,the bowing effect,and their evolution trends with key descriptors.
基金supported by the National Key Research and Development Program of China(Grant Nos.2023YFC3707900 and 2024YFC3012700)the National Natural Science Foundation of China(Grant No.42230710).
文摘Soil desiccation cracking is ubiquitous in nature and has significantpotential impacts on the engineering geological properties of soils.Previous studies have extensively examined various factors affecting soil cracking behavior through a numerous small-sample experiments.However,experimental studies alone cannot accurately describe soil cracking behavior.In this study,we firstly propose a modeling framework for predicting the surface crack ratio of soil desiccation cracking based on machine learning and interpretable analysis.The framework utilizes 1040 sets of soil cracking experimental data and employs random forest(RF),extreme gradient boosting(XGBoost),and artificialneural network(ANN)models to predict the surface crack ratio of soil desiccation cracking.To clarify the influenceof input features on soil cracking behavior,feature importance and Shapley additive explanations(SHAP)are applied for interpretability analysis.The results reveal that ensemble methods(RF and XGBoost)provide better predictive performance than the deep learning model(ANN).The feature importance analysis shows that soil desiccation cracking is primarily influencedby initial water content,plasticity index,finalwater content,liquid limit,sand content,clay content and thickness.Moreover,SHAP-based interpretability analysis further explores how soil cracking responds to various input variables.This study provides new insight into the evolution of soil cracking behavior,enhancing the understanding of its physical mechanisms and facilitating the assessment of potential regional development of soil desiccation cracking.
基金supported by the National Key Research and Development Program of China(2022YFB3903302).
文摘Accurately mapping the spatial distribution of soil organic carbon(SOC)is crucial for guiding agricultural management and improving soil carbon sequestration,especially in fragmented agricultural landscapes.Although remote sensing provides spatially continuous environmental information about heterogeneous agricultural landscapes,its relationship with SOC remains unclear.In this study,we hypothesized that multi-category remote sensing-derived variables can enhance our understanding of SOC variation within complex landscape conditions.Taking the Qilu Lake watershed in Yunnan,China,as a case study area and based on 216 topsoil samples collected from irrigation areas,we applied the extreme gradient boosting(XGBoost)model to investigate the contributions of vegetation indices(VI),brightness indices(BI),moisture indices(MI),and spectral transformations(ST,principal component analysis and tasseled cap transformation)to SOC mapping.The results showed that ST contributed the most to SOC prediction accuracy,followed by MI,VI,and BI,with improvements in R2 of 29.27,26.83,19.51,and 14.43%,respectively.The dominance of ST can be attributed to the fact that it contains richer remote sensing spectral information.The optimal SOC prediction model integrated soil properties,topographic factors,location factors,and landscape metrics,as well as remote sensing-derived variables,and achieved RMSE and MAE of 15.05 and 11.42 g kg-1,and R2 and CCC of 0.57 and 0.72,respectively.The Shapley additive explanations deciphered the nonlinear and threshold effects that exist between soil moisture,vegetation status,soil brightness and SOC.Compared with traditional linear regression models,interpretable machine learning has advantages in prediction accuracy and revealing the influences of variables that reflect landscape characteristics on SOC.Overall,this study not only reveals how remote sensing-derived variables contribute to our understanding of SOC distribution in fragmented agricultural landscapes but also clarifies their efficacy.Through interpretable machine learning,we can further elucidate the causes of SOC variation,which is important for sustainable soil management and agricultural practices.
基金supported by the National Natural Science Foundation of China(71901212)the Science and Technology Innovation Program of Hunan Province(2020RC4046).
文摘The belief rule-based(BRB)system has been popular in complexity system modeling due to its good interpretability.However,the current mainstream optimization methods of the BRB systems only focus on modeling accuracy but ignore the interpretability.The single-objective optimization strategy has been applied in the interpretability-accuracy trade-off by inte-grating accuracy and interpretability into an optimization objec-tive.But the integration has a greater impact on optimization results with strong subjectivity.Thus,a multi-objective optimiza-tion framework in the modeling of BRB systems with inter-pretability-accuracy trade-off is proposed in this paper.Firstly,complexity and accuracy are taken as two independent opti-mization goals,and uniformity as a constraint to give the mathe-matical description.Secondly,a classical multi-objective opti-mization algorithm,nondominated sorting genetic algorithm II(NSGA-II),is utilized as an optimization tool to give a set of BRB systems with different accuracy and complexity.Finally,a pipeline leakage detection case is studied to verify the feasibility and effectiveness of the developed multi-objective optimization.The comparison illustrates that the proposed multi-objective optimization framework can effectively avoid the subjectivity of single-objective optimization,and has capability of joint optimiz-ing the structure and parameters of BRB systems with inter-pretability-accuracy trade-off.
基金supported by the Natural Science Foundation of China (No.61833016)the Shaanxi Outstanding Youth Science Foundation (No.2020JC-34)the Shaanxi Science and Technology Innovation Team (No.2022TD-24).
文摘Fault diagnosis plays an irreplaceable role in the normal operation of equipment.A fault diagnosis model is often required to be interpretable for increasing the trust between humans and the model.Due to the understandable knowledge expression and transparent reasoning process,the belief rule base(BRB)has extensive applications as an interpretable expert system in fault diagnosis.Optimization is an effective means to weaken the subjectivity of experts in BRB,where the interpretability of BRB may be weakened.Hence,to obtain a credible result,the weakening factors of interpretability in the BRB-based fault diagnosis model are firstly analyzed,which are manifested in deviation from the initial judgement of experts and over-optimization of parameters.For these two factors,three indexes are proposed,namely the consistency index of rules,consistency index of the rule base and over-optimization index,tomeasure the interpretability of the optimizedmodel.Considering both the accuracy and interpretability of amodel,an improved coordinate ascent(I-CA)algorithmis proposed to fine-tune the parameters of the fault diagnosis model based on BRB.In I-CA,the algorithm combined with the advance and retreat method and the golden section method is employed to be one-dimensional search algorithm.Furthermore,the random optimization sequence and adaptive step size are proposed to improve the accuracy of the model.Finally,a case study of fault diagnosis in aerospace relays based on BRB is carried out to verify the effectiveness of the proposed method.
基金Key research project of Hunan Provincial Administration of Traditional Chinese Medicine(A2023048)Key Research Foundation of Education Bureau of Hunan Province,China(23A0273).
文摘Objective Clinical medical record data associated with hepatitis B-related acute-on-chronic liver failure(HBV-ACLF)generally have small sample sizes and a class imbalance.However,most machine learning models are designed based on balanced data and lack interpretability.This study aimed to propose a traditional Chinese medicine(TCM)diagnostic model for HBV-ACLF based on the TCM syndrome differentiation and treatment theory,which is clinically interpretable and highly accurate.Methods We collected medical records from 261 patients diagnosed with HBV-ACLF,including three syndromes:Yang jaundice(214 cases),Yang-Yin jaundice(41 cases),and Yin jaundice(6 cases).To avoid overfitting of the machine learning model,we excluded the cases of Yin jaundice.After data standardization and cleaning,we obtained 255 relevant medical records of Yang jaundice and Yang-Yin jaundice.To address the class imbalance issue,we employed the oversampling method and five machine learning methods,including logistic regression(LR),support vector machine(SVM),decision tree(DT),random forest(RF),and extreme gradient boosting(XGBoost)to construct the syndrome diagnosis models.This study used precision,F1 score,the area under the receiver operating characteristic(ROC)curve(AUC),and accuracy as model evaluation metrics.The model with the best classification performance was selected to extract the diagnostic rule,and its clinical significance was thoroughly analyzed.Furthermore,we proposed a novel multiple-round stable rule extraction(MRSRE)method to obtain a stable rule set of features that can exhibit the model’s clinical interpretability.Results The precision of the five machine learning models built using oversampled balanced data exceeded 0.90.Among these models,the accuracy of RF classification of syndrome types was 0.92,and the mean F1 scores of the two categories of Yang jaundice and Yang-Yin jaundice were 0.93 and 0.94,respectively.Additionally,the AUC was 0.98.The extraction rules of the RF syndrome differentiation model based on the MRSRE method revealed that the common features of Yang jaundice and Yang-Yin jaundice were wiry pulse,yellowing of the urine,skin,and eyes,normal tongue body,healthy sublingual vessel,nausea,oil loathing,and poor appetite.The main features of Yang jaundice were a red tongue body and thickened sublingual vessels,whereas those of Yang-Yin jaundice were a dark tongue body,pale white tongue body,white tongue coating,lack of strength,slippery pulse,light red tongue body,slimy tongue coating,and abdominal distension.This is aligned with the classifications made by TCM experts based on TCM syndrome differentiation and treatment theory.Conclusion Our model can be utilized for differentiating HBV-ACLF syndromes,which has the potential to be applied to generate other clinically interpretable models with high accuracy on clinical data characterized by small sample sizes and a class imbalance.
文摘With the development of Fintech, applying artificial intelligence (AI) technologies to the financial field is a general trend. However, there are some inappropriate conditions, for instance, the AI model is always treated as a black box and cannot be interpreted. This paper studies the AI model interpretability when the models are applied in the financial field. We analyze the reasons of black box problem and explore the effective solutions. We propose a new kind of automatic Regtech tool—LIMER, and put forward policy suggestions, thereby continuously promoting the development of Fintech to a higher level.
基金The authors would like to acknowledge the support of the China Scholarship Council,the Flemish Government under the“Onderzoeksprogramma Artificiële Intelligentie(AI)Vlaanderen”Program and the Research Foundation–Flanders(FWO)under the ROBUSTIFY research grant no.S006119N.
文摘Early fault diagnosis of bearings is crucial for ensuring safe and reliable operations.Convolutional neural networks(CNNs)have achieved significant breakthroughs in machinery fault diagnosis.However,complex and varying working conditions can lead to inter-class similarity and intra-class variability in datasets,making it more challenging for CNNs to learn discriminative features.Furthermore,CNNs are often considered“black boxes”and lack sufficient interpretability in the fault diagnosis field.To address these issues,this paper introduces a residual mixed domain attention CNN method,referred to as RMA-CNN.This method comprises multiple residual mixed domain attention modules(RMAMs),each employing one attention mechanism to emphasize meaningful features in both time and channel domains.This significantly enhances the network’s ability to learn fault-related features.Moreover,we conduct an in-depth analysis of the inherent feature learning mechanism of the attention module RMAM to improve the interpretability of CNNs in fault diagnosis applications.Experiments conducted on two datasets—a high-speed aeronautical bearing dataset and a motor bearing dataset—demonstrate that the RMA-CNN achieves remarkable results in diagnostic tasks.
基金Under the auspices of National Natural Science Foundation of China(No.41971352)Key Research and Development Project of Shaanxi Province(No.2022ZDLSF06-01)。
文摘The accurate and reliable interpretation of regional land cover data is very important for natural resource monitoring and environmental assessment.At present,refined land cover data are mainly obtained by manual visual interpretation,which has the problems of heavy workload and inconsistent interpretation scales.Deep learning has greatly improved the automatic processing and analysis of remote sensing data.However,the accurate interpretation of feature information from massive datasets remains a difficult problem in wide regional land cover classification.To improve the efficiency of deep learning-based remote sensing image interpretation,we selected multisource remote sensing data,assessed the interpretability of the U-Net model based on surface spatial scenes with different levels of complexity,and proposed a new method of stereoscopic accuracy verification(SAV)to evaluate the reliability of the classification result.The results show that classification accuracy is more highly correlated with terrain and landscape than with other factors related to image data,such as platform and spatial resolution.As the complexity of surface spatial scenes increases,the accuracy of the classification results mainly shows a fluctuating declining trend.We also find the distribution characteristics from the SAV evaluation results of different land cover types in each surface spatial scene.Based on the results observed in this study,we consider the distinction of interpretability and reliability in diverse ground object types and design targeted classification strategies for different surface scenes,which can greatly improve the classification efficiency.The key achievement of this study is to provide the theoretical basis for remote sensing information analysis and an accuracy evaluation method for regional land cover classification,and the proposed method can help improve the likelihood that intelligent interpretation can replace manual acquisition.
基金European Commission,Joint Research Center,Grant/Award Number:HUMAINTMinisterio de Ciencia e Innovación,Grant/Award Number:PID2020‐114924RB‐I00Comunidad de Madrid,Grant/Award Number:S2018/EMT‐4362 SEGVAUTO 4.0‐CM。
文摘Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.
文摘Association rule learning(ARL)is a widely used technique for discovering relationships within datasets.However,it often generates excessive irrelevant or ambiguous rules.Therefore,post-processing is crucial not only for removing irrelevant or redundant rules but also for uncovering hidden associations that impact other factors.Recently,several post-processing methods have been proposed,each with its own strengths and weaknesses.In this paper,we propose THAPE(Tunable Hybrid Associative Predictive Engine),which combines descriptive and predictive techniques.By leveraging both techniques,our aim is to enhance the quality of analyzing generated rules.This includes removing irrelevant or redundant rules,uncovering interesting and useful rules,exploring hidden association rules that may affect other factors,and providing backtracking ability for a given product.The proposed approach offers a tailored method that suits specific goals for retailers,enabling them to gain a better understanding of customer behavior based on factual transactions in the target market.We applied THAPE to a real dataset as a case study in this paper to demonstrate its effectiveness.Through this application,we successfully mined a concise set of highly interesting and useful association rules.Out of the 11,265 rules generated,we identified 125 rules that are particularly relevant to the business context.These identified rules significantly improve the interpretability and usefulness of association rules for decision-making purposes.
基金This work was supported in part by the Postdoctoral Science Foundation of China under Grant No.2020M683736in part by the Natural Science Foundation of Heilongjiang Province of China under Grant No.LH2021F038+2 种基金in part by the innovation practice project of college students in Heilongjiang Province under Grant Nos.202010231009,202110231024,and 202110231155in part by the basic scientific research business expenses scientific research projects of provincial universities in Heilongjiang Province Grant Nos.XJGZ2021001in part by the Education and teaching reform program of 2021 in Heilongjiang Province under Grant No.SJGY20210457.
文摘Prediction systems are an important aspect of intelligent decisions.In engineering practice,the complex system structure and the external environment cause many uncertain factors in the model,which influence the modeling accuracy of the model.The belief rule base(BRB)can implement nonlinear modeling and express a variety of uncertain information,including fuzziness,ignorance,randomness,etc.However,the BRB system also has two main problems:Firstly,modeling methods based on expert knowledge make it difficult to guarantee the model’s accuracy.Secondly,interpretability is not considered in the optimization process of current research,resulting in the destruction of the interpretability of BRB.To balance the accuracy and interpretability of the model,a self-growth belief rule basewith interpretability constraints(SBRB-I)is proposed.The reasoning process of the SBRB-I model is based on the evidence reasoning(ER)approach.Moreover,the self-growth learning strategy ensures effective cooperation between the datadriven model and the expert system.A case study showed that the accuracy and interpretability of the model could be guaranteed.The SBRB-I model has good application prospects in prediction systems.
基金supported in part by the National Natural Science Foundation of China(No.62276206)the Key Research and Development Program of Shaanxi under Grant S2022-YF-YBGY-0921+2 种基金the State Key Program of National Natural Science of China(No.62231027)supported by the Science and Technology on Communication Information Security Control Laboratory。
文摘With the development of information technology,radio communication technology has made rapid progress.Many radio signals that have appeared in space are difficult to classify without manually labeling.Unsupervised radio signal clustering methods have recently become an urgent need for this situation.Meanwhile,the high complexity of deep learning makes it difficult to understand the decision results of the clustering models,making it essential to conduct interpretable analysis.This paper proposed a combined loss function for unsupervised clustering based on autoencoder.The combined loss function includes reconstruction loss and deep clustering loss.Deep clustering loss is added based on reconstruction loss,which makes similar deep features converge more in feature space.In addition,a features visualization method for signal clustering was proposed to analyze the interpretability of autoencoder utilizing Saliency Map.Extensive experiments have been conducted on a modulated signal dataset,and the results indicate the superior performance of our proposed method over other clustering algorithms.In particular,for the simulated dataset containing six modulation modes,when the SNR is 20dB,the clustering accuracy of the proposed method is greater than 78%.The interpretability analysis of the clustering model was performed to visualize the significant features of different modulated signals and verified the high separability of the features extracted by clustering model.
基金PriTEM project funded by UiO:Energy Convergence Environments
文摘The balancing market in the energy sector plays a critical role in physically and financially balancing the supply and demand.Modeling dynamics in the balancing market can provide valuable insights and prognosis for power grid stability and secure energy supply.While complex machine learning models can achieve high accuracy,their“blackbox”nature severely limits the model interpretability.In this paper,we explore the trade-off between model accuracy and interpretability for the energy balancing market.Particularly,we take the example of forecasting manual frequency restoration reserve(mFRR)activation price in the balancing market using real market data from different energy price zones.We explore the interpretability of mFRR forecasting using two models:extreme gradient boosting(XGBoost)machine and explainable boosting machine(EBM).We also integrate the two models,and we benchmark all the models against a baseline naive model.Our results show that EBM provides forecasting accuracy comparable to XGBoost while yielding a considerable level of interpretability.Our analysis also underscores the challenge of accurately predicting the mFRR price for the instances when the activation price deviates significantly from the spot price.Importantly,EBM's interpretability features reveal insights into non-linear mFRR price drivers and regional market dynamics.Our study demonstrates that EBM is a viable and valuable interpretable alternative to complex black-box AI models in the forecast for the balancing market.
文摘With the rapid advancement of natural language processing(NLP),large language models(LLMs)have demonstrated excep-tional performance across tasks(Xu et al.,2024;Lee et al.,2024;Tan et al.,2023)like machine translation,text summarization,and question-answering,significantly accelerating NLP research.Furthermore,LLMs have also facilitated advancements across di-verse fields.In robotics,for example,LLMs enhance the interpre-tation and translation of user voice commands,enabling precise planning and execution of robotic arm movements(Driess et al.,2023).
基金supported by the National Key R&D Program(No.2021YFB3702404)National Natural Science Foundation of China(Nos.52311530082 and U22A20106)support provided by“Xingliao Talent Plan”project(Grant No.XLYC2203027)is gratefully acknowledged.
文摘In the rolling production of steel,predicting the performance of new products is challenging due to the low variety of data distributions resulting from standardized manufacturing processes and fixed product categories.This scenario poses a significant hurdle for machine learning models,leading to what is commonly known as the“cold-start problem”.To address this issue,we propose a knowledge graph attention neural network for steel manufacturing(SteelKGAT).By leveraging expert knowledge and a multi-head attention mechanism,SteelKGAT aims to enhance prediction accuracy.Our experimental results demonstrate that the SteelKGAT model outperforms existing methods when generalizing to previously unseen products.Only the SteelKGAT model accurately captures the feature trend,thereby offering correct guidance in product tuning,which is of practical significance for new product development(NPD).Additionally,we employ the Integrated Gradients(IG)method to shed light on the model's predictions,revealing the relative importance of each feature within the knowledge graph.Notably,this work represents the first application of knowledge graph attention neural networks to address the cold-start problem in steel rolling production.By combining domain expertise and interpretable predictions,our knowledge-informed SteelKGAT model provides accurate insights into the mechanical properties of products even in cold-start scenarios.
基金supported in part by the National Key Research and Development Program of China under Grant 2020YFB0905900.
文摘This paper proposes a new approach for online power system transient security assessment(TSA)and preventive control based on XGBoost and DC optimal power flow(DCOPF).The novelty of this proposal is that it applies the XGBoost and data selection method based on the 1-norm distance in local feature importance evaluation which can provide a certain model interpretability.The method of SMOTE+ENN is adopted for data rebalancing.The contingency-oriented XGBoost model is trained with databases generated by time domain simulations to represent the transient security constraint in the DCOPF model,which has a relatively fast speed of calculation.The transient security constrained generation rescheduling is implemented with the differential evolution algorithm,which is utilized to optimize the rescheduled generation in the preventive control.Feasibility and effectiveness of the proposed approach are demonstrated on an IEEE 39-bus test system and a 500-bus operational model for South Carolina,USA.
基金The research work was supported by the National Natural Science Foundation of China(Grant No.51978400)the National Key Research and Development Program of China(No.2021YFE0107800).The support is gratefully acknowledged.
文摘The prediction of structural performance plays a significant role in damage assessment of glass fiber reinforcement polymer(GFRP)elastic gridshell structures.Machine learning(ML)approaches are implemented in this study,to predict maximum stress and displacement of GFRP elastic gridshell structures.Several ML algorithms,including linear regression(LR),ridge regression(RR),support vector regression(SVR),K-nearest neighbors(KNN),decision tree(DT),random forest(RF),adaptive boosting(AdaBoost),extreme gradient boosting(XGBoost),category boosting(CatBoost),and light gradient boosting machine(LightGBM),are implemented in this study.Output features of structural performance considered in this study are the maximum stress as f1(x)and the maximum displacement to self-weight ratio as f2(x).A comparative study is conducted and the Catboost model presents the highest prediction accuracy.Finally,interpretable ML approaches,including shapely additive explanations(SHAP),partial dependence plot(PDP),and accumulated local effects(ALE),are applied to explain the predictions.SHAP is employed to describe the importance of each variable to structural performance both locally and globally.The results of sensitivity analysis(SA),feature importance of the CatBoost model and SHAP approach indicate the same parameters as the most significant variables for f1(x)and f2(x).