Slag viscosity plays a crucial role in the smelting process.A slag viscosity prediction model was developed by integrating hyperparameter optimization algorithms,machine learning,and SHapley Additive exPlanations(SHAP...Slag viscosity plays a crucial role in the smelting process.A slag viscosity prediction model was developed by integrating hyperparameter optimization algorithms,machine learning,and SHapley Additive exPlanations(SHAP)analysis.The developed slag viscosity prediction models were evaluated using multiple statistical metrics,leading to the identification of the optimal model—Bayesian optimization-based categorical boosting(BO-CatBoost).And this model was further compared with existing models,including NPL model,FactSage+Roscoe-Einstein(RE)equation,artificial neural network model+RE equation,Riboud model+RE equation,and Zhang model.The results indicate that the slag viscosity prediction model based on BO-CatBoost outperforms all other models,achieving a coefficient of determination of 0.9897,a root mean square error of 1.0619,a mean absolute error of 0.6133,and a hit ratio of 95.1%.The global interpretability analysis of SHAP analysis was used to reveal the importance degree of different features on slag viscosity.The local interpretability analysis of SHAP analysis was used to obtain the quantitative influence of different features on slag viscosity in specific samples.The high-accuracy and interpretable slag viscosity prediction model developed is beneficial to the intelligent design of slag composition.展开更多
Achieving the simultaneous enhancement of strength and ductility in laser powder bed fused (LPBF-ed) titanium (Ti) is challenging due to the complex, high-dimensional parameter space and interactions between parameter...Achieving the simultaneous enhancement of strength and ductility in laser powder bed fused (LPBF-ed) titanium (Ti) is challenging due to the complex, high-dimensional parameter space and interactions between parameters and powders. Herein, a hybrid intelligent framework for process parameter optimization of LPBF-ed Ti with improved ultimate tensile strength (UTS) and elongation (EL) was proposed. It combines the data augmentation method (AVG ± EC × SD), the multi-model fusion stacking ensemble learning model (GBDT-BPNN-XGBoost), the interpretable machine learning method and the non-dominated ranking genetic algorithm (NSGA-Ⅱ). The GBDT-BPNN-XGBoost outperforms single models in predicting UTS and EL across the accuracy, generalization ability and stability. The SHAP analysis reveals that laser power (P) is the most important feature affecting both UTS and EL, and it has a positive impact on them when P < 220 W. The UTS and EL of samples fabricated by the optimal process parameters were 718 ± 5 MPa and 27.9 % ± 0.1 %, respectively. The outstanding strength-ductility balance is attributable to the forward stresses in hard α'-martensite and back stresses in soft αm'-martensite induced by the strain gradients of hetero-microstructure. The back stresses strengthen the soft αm'-martensite, improving the overall UTS. The forward stresses stimulate the activation of dislocations in hard α'-martensite and the generation of 〈c + a〉 dislocations, allowing the plastic strain to occur in hard regions and enhancing the overall ductility. This work provides a feasible strategy for multi-objective optimization and valuable insights into tailoring the microstructure for improving mechanical properties.展开更多
Although lithium-ion batteries(LIBs)currently dominate a wide spectrum of energy storage applications,they face challenges such as fast cycle life decay and poor stability that hinder their further application.To addr...Although lithium-ion batteries(LIBs)currently dominate a wide spectrum of energy storage applications,they face challenges such as fast cycle life decay and poor stability that hinder their further application.To address these limitations,element doping has emerged as a prevalent strategy to enhance the discharge capacity and extend the durability of Li-Ni-Co-Mn(LNCM)ternary compounds.This study utilized a machine learning-driven feature screening method to effectively pinpoint four key features crucially impacting the initial discharge capacity(IC)of Li-Ni-Co-Mn(LNCM)ternary cathode materials.These features were also proved highly predictive for the 50^(th)cycle discharge capacity(EC).Additionally,the application of SHAP value analysis yielded an in-depth understanding of the interplay between these features and discharge performance.This insight offers valuable direction for future advancements in the development of LNCM cathode materials,effectively promoting this field toward greater efficiency and sustainability.展开更多
This study aims to develop an accurate and robust machine learning model to predict the carbonation depth of fly ash concrete,overcoming the limitations of traditional predictive methods.Five ensemble-based models,suc...This study aims to develop an accurate and robust machine learning model to predict the carbonation depth of fly ash concrete,overcoming the limitations of traditional predictive methods.Five ensemble-based models,such as adaptive boosting(AdaBoost),categorical boosting(CatBoost),gradient boosting regressor(GBR),hist gradient boosting regressor(HistGBR),and extreme gradient boosting(XGBoost),were developed and optimized using 729 high-quality dataset points incorporating seven input parameters,including cement,CO_(2),exposure time,water-binder ratio,fly ash,curing time,and compressive strength.Several performance evaluation metrics were used to compare the models.The GBR model emerged as the best-performing model,based on high coefficient of determination(R^(2))values and balanced error metrics across both validation and testing datasets.While all models performed exceptionally well on the training data,GBR demonstrated superior generalization capability,with R^(2) values of 0.9438 on the validation set and 0.9310 on the testing set.Furthermore,its low mean squared error(MSE),root mean square error(RMSE),mean absolute error(MAE),and median absolute error(MdAE)confirmed its robustness and accuracy.Moreover,shapley additive explanations(SHAP)analysis enhanced the interpretability of predictions,highlighting the curing time and exposure time as the most critical drivers of carbonation depth.展开更多
Unconfined Compressive Strength(UCS)is a key parameter for the assessment of the stability and performance of stabilized soils,yet traditional laboratory testing is both time and resource intensive.In this study,an in...Unconfined Compressive Strength(UCS)is a key parameter for the assessment of the stability and performance of stabilized soils,yet traditional laboratory testing is both time and resource intensive.In this study,an interpretable machine learning approach to UCS prediction is presented,pairing five models(Random Forest(RF),Gradient Boosting(GB),Extreme Gradient Boosting(XGB),CatBoost,and K-Nearest Neighbors(KNN))with SHapley Additive exPlanations(SHAP)for enhanced interpretability and to guide feature removal.A complete dataset of 12 geotechnical and chemical parameters,i.e.,Atterberg limits,compaction properties,stabilizer chemistry,dosage,curing time,was used to train and test the models.R2,RMSE,MSE,and MAE were used to assess performance.Initial results with all 12 features indicated that boosting-based models(GB,XGB,CatBoost)exhibited the highest predictive accuracy(R^(2)=0.93)with satisfactory generalization on test data,followed by RF and KNN.SHAP analysis consistently picked CaO content,curing time,stabilizer dosage,and compaction parameters as the most important features,aligning with established soil stabilization mechanisms.Models were then re-trained on the top 8 and top 5 SHAP-ranked features.Interestingly,GB,XGB,and CatBoost maintained comparable accuracy with reduced input sets,while RF was moderately sensitive and KNN was somewhat better owing to reduced dimensionality.The findings confirm that feature reduction through SHAP enables cost-effective UCS prediction through the reduction of laboratory test requirements without significant accuracy loss.The suggested hybrid approach offers an explainable,interpretable,and cost-effective tool for geotechnical engineering practice.展开更多
The global shift towards sustainable and environmentally friendly transportation options has led to the increasing adoption of electric buses(Ebuses).To optimize the deployment and operational strategies of Ebuses,it ...The global shift towards sustainable and environmentally friendly transportation options has led to the increasing adoption of electric buses(Ebuses).To optimize the deployment and operational strategies of Ebuses,it is imperative to accurately predict their energy consumption under varying conditions,particularly in cold climates where battery life is typically degraded.The exploration of this aspect within the Canadian context has been limited.In addition,we have found that existing models in the literature perform poorly in the Canadian environment,giving rise to the need for new models using Canadian data.This paper focuses on the development,comparison,and evaluation of various data-driven models designed to predict the energy consumption of different Ebuses with different heating technologies under a wide range of climate conditions.We specifically use Canadian data as a good representative of cold climates in general.The results show that the performance of the different bus types varies substantially under the exact same conditions.In addition,tree-based family of models proves to be the most suitable approach for predicting the Ebus consumption rate.The results indicate that the Random Forest method emerges as the superior choice for predicting the energy consumption rate,with a resulting mean absolute error of 0.09–0.1 kWh/km observed across the different models.Furthermore,SHAP analysis shows that the main variables influencing the energy consumption rate depend on the type of heating system(using the battery for heating or using an auxiliary system that utilizes diesel for heating)adopted.展开更多
This study constructs 196 transition metals(TM)@S_(x)N_(y) single-atom catalysts(SACs)(x=0-4 and y=0-4)and employs the eXtreme Gradient Boosting(XGBoost)classification model in machine learning(ML)for effectively dist...This study constructs 196 transition metals(TM)@S_(x)N_(y) single-atom catalysts(SACs)(x=0-4 and y=0-4)and employs the eXtreme Gradient Boosting(XGBoost)classification model in machine learning(ML)for effectively distinguishing qualified and unqualified catalysts.The prediction accuracy rate is high,up to 95%.The SHapley Additive exPlanations(SHAP)analysis reveals that the N≡N bond length and the number of outermost d electrons(N_(d))can well describe the nitrogen(N2)reduction reaction(NRR)activity.The relationships between N≡N,N_(d),the adsorption energies of different intermediates(ΔE_(*N_(2)),ΔE_(*N_(2)H),and ΔE_(*NH_(2))),the general descriptor(φ),and the Gibbs free energy of key steps(ΔG_(*N_(2)),ΔG_(*N_(2)-*N_(2)H),and ΔG_(*N_H(2)-*NH_(3)))indicate that moderate nitrogen activation can enhance the reaction activity.Among the 17 screened SACs,Mo@S3N1,and W@S_(3)N_(1) demonstrate the best catalytic performance,with limiting potential(U_(L))values of only-0.26 and-0.25 V under implicit solvation conditions.The electronic properties and variations in N≡N and TM-N bond lengths are investigated to reveal the origin of NRR activity.This study provides the decisive features and NRR dataset for ML research,as well as a feasible strategy for rational design of NRR SACs.展开更多
基金funded by the National Natural Science Foundation of China(No.52374321)the National Key Research and Development Program of China(No.2024YFB3713602)the Youth Science and Technology Innovation Fund of Jianlong Group-University of Science and Technology Beijing(No.20231235).
文摘Slag viscosity plays a crucial role in the smelting process.A slag viscosity prediction model was developed by integrating hyperparameter optimization algorithms,machine learning,and SHapley Additive exPlanations(SHAP)analysis.The developed slag viscosity prediction models were evaluated using multiple statistical metrics,leading to the identification of the optimal model—Bayesian optimization-based categorical boosting(BO-CatBoost).And this model was further compared with existing models,including NPL model,FactSage+Roscoe-Einstein(RE)equation,artificial neural network model+RE equation,Riboud model+RE equation,and Zhang model.The results indicate that the slag viscosity prediction model based on BO-CatBoost outperforms all other models,achieving a coefficient of determination of 0.9897,a root mean square error of 1.0619,a mean absolute error of 0.6133,and a hit ratio of 95.1%.The global interpretability analysis of SHAP analysis was used to reveal the importance degree of different features on slag viscosity.The local interpretability analysis of SHAP analysis was used to obtain the quantitative influence of different features on slag viscosity in specific samples.The high-accuracy and interpretable slag viscosity prediction model developed is beneficial to the intelligent design of slag composition.
基金supported by the National Natural Sci-ence Foundation of China(Nos.52274359 and 52304379)the China National Postdoctoral Program for Innovative Talents(No.BX20220034)+2 种基金the China Postdoctoral Science Foundation(No.2022M720403)the AECC University Research Cooperation Project(No.HFZL2021CXY021)the Interdisciplinary Research Project for Young Teachers of USTB(Fundamental Research Funds for the Central Universities)(No.FRF-IDRY-23-025).
文摘Achieving the simultaneous enhancement of strength and ductility in laser powder bed fused (LPBF-ed) titanium (Ti) is challenging due to the complex, high-dimensional parameter space and interactions between parameters and powders. Herein, a hybrid intelligent framework for process parameter optimization of LPBF-ed Ti with improved ultimate tensile strength (UTS) and elongation (EL) was proposed. It combines the data augmentation method (AVG ± EC × SD), the multi-model fusion stacking ensemble learning model (GBDT-BPNN-XGBoost), the interpretable machine learning method and the non-dominated ranking genetic algorithm (NSGA-Ⅱ). The GBDT-BPNN-XGBoost outperforms single models in predicting UTS and EL across the accuracy, generalization ability and stability. The SHAP analysis reveals that laser power (P) is the most important feature affecting both UTS and EL, and it has a positive impact on them when P < 220 W. The UTS and EL of samples fabricated by the optimal process parameters were 718 ± 5 MPa and 27.9 % ± 0.1 %, respectively. The outstanding strength-ductility balance is attributable to the forward stresses in hard α'-martensite and back stresses in soft αm'-martensite induced by the strain gradients of hetero-microstructure. The back stresses strengthen the soft αm'-martensite, improving the overall UTS. The forward stresses stimulate the activation of dislocations in hard α'-martensite and the generation of 〈c + a〉 dislocations, allowing the plastic strain to occur in hard regions and enhancing the overall ductility. This work provides a feasible strategy for multi-objective optimization and valuable insights into tailoring the microstructure for improving mechanical properties.
基金supported by the National Natural Science Foundation of China(Nos.52122408,52071023)the Program for Science&Technology Innovation Talents in the University of Henan Province(No.22HASTIT1006)+2 种基金the Program for Central Plains Talents(No.ZYYCYU202012172)the Ministry of Education,Singapore(No.RG70/20)the Opening Project of National Joint Engineering Research Center for Abrasion Control and Molding of Metal Materials,Henan University of Science and Technology(No.HKDNM201906).
文摘Although lithium-ion batteries(LIBs)currently dominate a wide spectrum of energy storage applications,they face challenges such as fast cycle life decay and poor stability that hinder their further application.To address these limitations,element doping has emerged as a prevalent strategy to enhance the discharge capacity and extend the durability of Li-Ni-Co-Mn(LNCM)ternary compounds.This study utilized a machine learning-driven feature screening method to effectively pinpoint four key features crucially impacting the initial discharge capacity(IC)of Li-Ni-Co-Mn(LNCM)ternary cathode materials.These features were also proved highly predictive for the 50^(th)cycle discharge capacity(EC).Additionally,the application of SHAP value analysis yielded an in-depth understanding of the interplay between these features and discharge performance.This insight offers valuable direction for future advancements in the development of LNCM cathode materials,effectively promoting this field toward greater efficiency and sustainability.
文摘This study aims to develop an accurate and robust machine learning model to predict the carbonation depth of fly ash concrete,overcoming the limitations of traditional predictive methods.Five ensemble-based models,such as adaptive boosting(AdaBoost),categorical boosting(CatBoost),gradient boosting regressor(GBR),hist gradient boosting regressor(HistGBR),and extreme gradient boosting(XGBoost),were developed and optimized using 729 high-quality dataset points incorporating seven input parameters,including cement,CO_(2),exposure time,water-binder ratio,fly ash,curing time,and compressive strength.Several performance evaluation metrics were used to compare the models.The GBR model emerged as the best-performing model,based on high coefficient of determination(R^(2))values and balanced error metrics across both validation and testing datasets.While all models performed exceptionally well on the training data,GBR demonstrated superior generalization capability,with R^(2) values of 0.9438 on the validation set and 0.9310 on the testing set.Furthermore,its low mean squared error(MSE),root mean square error(RMSE),mean absolute error(MAE),and median absolute error(MdAE)confirmed its robustness and accuracy.Moreover,shapley additive explanations(SHAP)analysis enhanced the interpretability of predictions,highlighting the curing time and exposure time as the most critical drivers of carbonation depth.
文摘Unconfined Compressive Strength(UCS)is a key parameter for the assessment of the stability and performance of stabilized soils,yet traditional laboratory testing is both time and resource intensive.In this study,an interpretable machine learning approach to UCS prediction is presented,pairing five models(Random Forest(RF),Gradient Boosting(GB),Extreme Gradient Boosting(XGB),CatBoost,and K-Nearest Neighbors(KNN))with SHapley Additive exPlanations(SHAP)for enhanced interpretability and to guide feature removal.A complete dataset of 12 geotechnical and chemical parameters,i.e.,Atterberg limits,compaction properties,stabilizer chemistry,dosage,curing time,was used to train and test the models.R2,RMSE,MSE,and MAE were used to assess performance.Initial results with all 12 features indicated that boosting-based models(GB,XGB,CatBoost)exhibited the highest predictive accuracy(R^(2)=0.93)with satisfactory generalization on test data,followed by RF and KNN.SHAP analysis consistently picked CaO content,curing time,stabilizer dosage,and compaction parameters as the most important features,aligning with established soil stabilization mechanisms.Models were then re-trained on the top 8 and top 5 SHAP-ranked features.Interestingly,GB,XGB,and CatBoost maintained comparable accuracy with reduced input sets,while RF was moderately sensitive and KNN was somewhat better owing to reduced dimensionality.The findings confirm that feature reduction through SHAP enables cost-effective UCS prediction through the reduction of laboratory test requirements without significant accuracy loss.The suggested hybrid approach offers an explainable,interpretable,and cost-effective tool for geotechnical engineering practice.
文摘The global shift towards sustainable and environmentally friendly transportation options has led to the increasing adoption of electric buses(Ebuses).To optimize the deployment and operational strategies of Ebuses,it is imperative to accurately predict their energy consumption under varying conditions,particularly in cold climates where battery life is typically degraded.The exploration of this aspect within the Canadian context has been limited.In addition,we have found that existing models in the literature perform poorly in the Canadian environment,giving rise to the need for new models using Canadian data.This paper focuses on the development,comparison,and evaluation of various data-driven models designed to predict the energy consumption of different Ebuses with different heating technologies under a wide range of climate conditions.We specifically use Canadian data as a good representative of cold climates in general.The results show that the performance of the different bus types varies substantially under the exact same conditions.In addition,tree-based family of models proves to be the most suitable approach for predicting the Ebus consumption rate.The results indicate that the Random Forest method emerges as the superior choice for predicting the energy consumption rate,with a resulting mean absolute error of 0.09–0.1 kWh/km observed across the different models.Furthermore,SHAP analysis shows that the main variables influencing the energy consumption rate depend on the type of heating system(using the battery for heating or using an auxiliary system that utilizes diesel for heating)adopted.
基金supported by National Natural Science Foundation of China(Nos.52271136 and 22373063)the Natural Science Foundation of Shaanxi Province in China(Nos.2021JC-06 and 2019TD-020)Fundamental Research Funds for the Central Universities of China(No.GK202203002).
文摘This study constructs 196 transition metals(TM)@S_(x)N_(y) single-atom catalysts(SACs)(x=0-4 and y=0-4)and employs the eXtreme Gradient Boosting(XGBoost)classification model in machine learning(ML)for effectively distinguishing qualified and unqualified catalysts.The prediction accuracy rate is high,up to 95%.The SHapley Additive exPlanations(SHAP)analysis reveals that the N≡N bond length and the number of outermost d electrons(N_(d))can well describe the nitrogen(N2)reduction reaction(NRR)activity.The relationships between N≡N,N_(d),the adsorption energies of different intermediates(ΔE_(*N_(2)),ΔE_(*N_(2)H),and ΔE_(*NH_(2))),the general descriptor(φ),and the Gibbs free energy of key steps(ΔG_(*N_(2)),ΔG_(*N_(2)-*N_(2)H),and ΔG_(*N_H(2)-*NH_(3)))indicate that moderate nitrogen activation can enhance the reaction activity.Among the 17 screened SACs,Mo@S3N1,and W@S_(3)N_(1) demonstrate the best catalytic performance,with limiting potential(U_(L))values of only-0.26 and-0.25 V under implicit solvation conditions.The electronic properties and variations in N≡N and TM-N bond lengths are investigated to reveal the origin of NRR activity.This study provides the decisive features and NRR dataset for ML research,as well as a feasible strategy for rational design of NRR SACs.