BACKGROUND Ischemic heart disease(IHD)impacts the quality of life and has the highest mortality rate of cardiovascular diseases globally.AIM To compare variations in the parameters of the single-lead electrocardiogram...BACKGROUND Ischemic heart disease(IHD)impacts the quality of life and has the highest mortality rate of cardiovascular diseases globally.AIM To compare variations in the parameters of the single-lead electrocardiogram(ECG)during resting conditions and physical exertion in individuals diagnosed with IHD and those without the condition using vasodilator-induced stress computed tomography(CT)myocardial perfusion imaging as the diagnostic reference standard.METHODS This single center observational study included 80 participants.The participants were aged≥40 years and given an informed written consent to participate in the study.Both groups,G1(n=31)with and G2(n=49)without post stress induced myocardial perfusion defect,passed cardiologist consultation,anthropometric measurements,blood pressure and pulse rate measurement,echocardiography,cardio-ankle vascular index,bicycle ergometry,recording 3-min single-lead ECG(Cardio-Qvark)before and just after bicycle ergometry followed by performing CT myocardial perfusion.The LASSO regression with nested cross-validation was used to find the association between Cardio-Qvark parameters and the existence of the perfusion defect.Statistical processing was performed with the R programming language v4.2,Python v.3.10[^R],and Statistica 12 program.RESULTS Bicycle ergometry yielded an area under the receiver operating characteristic curve of 50.7%[95%confidence interval(CI):0.388-0.625],specificity of 53.1%(95%CI:0.392-0.673),and sensitivity of 48.4%(95%CI:0.306-0.657).In contrast,the Cardio-Qvark test performed notably better with an area under the receiver operating characteristic curve of 67%(95%CI:0.530-0.801),specificity of 75.5%(95%CI:0.628-0.88),and sensitivity of 51.6%(95%CI:0.333-0.695).CONCLUSION The single-lead ECG has a relatively higher diagnostic accuracy compared with bicycle ergometry by using machine learning models,but the difference was not statistically significant.However,further investigations are required to uncover the hidden capabilities of single-lead ECG in IHD diagnosis.展开更多
The high porosity and tunable chemical functionality of metal-organic frameworks(MOFs)make it a promising catalyst design platform.High-throughput screening of catalytic performance is feasible since the large MOF str...The high porosity and tunable chemical functionality of metal-organic frameworks(MOFs)make it a promising catalyst design platform.High-throughput screening of catalytic performance is feasible since the large MOF structure database is available.In this study,we report a machine learning model for high-throughput screening of MOF catalysts for the CO_(2) cycloaddition reaction.The descriptors for model training were judiciously chosen according to the reaction mechanism,which leads to high accuracy up to 97%for the 75%quantile of the training set as the classification criterion.The feature contribution was further evaluated with SHAP and PDP analysis to provide a certain physical understanding.12,415 hypothetical MOF structures and 100 reported MOFs were evaluated under 100℃ and 1 bar within one day using the model,and 239 potentially efficient catalysts were discovered.Among them,MOF-76(Y)achieved the top performance experimentally among reported MOFs,in good agreement with the prediction.展开更多
Geared-rotor systems are critical components in mechanical applications,and their performance can be severely affected by faults,such as profile errors,wear,pitting,spalling,flaking,and cracks.Profile errors in gear t...Geared-rotor systems are critical components in mechanical applications,and their performance can be severely affected by faults,such as profile errors,wear,pitting,spalling,flaking,and cracks.Profile errors in gear teeth are inevitable in manufacturing and subsequently accumulate during operations.This work aims to predict the status of gear profile deviations based on gear dynamics response using the digital model of an experimental rig setup.The digital model comprises detailed CAD models and has been validated against the expected physical behavior using commercial finite element analysis software.The different profile deviations are then modeled using gear charts,and the dynamic response is captured through simulations.The various features are then obtained by signal processing,and various ML models are then evaluated to predict the fault/no-fault condition for the gear.The best performance is achieved by an artificial neural network with a prediction accuracy of 97.5%,which concludes a strong influence on the dynamics of the gear rotor system due to profile deviations.展开更多
Background Cotton is one of the most important commercial crops after food crops,especially in countries like India,where it’s grown extensively under rainfed conditions.Because of its usage in multiple industries,su...Background Cotton is one of the most important commercial crops after food crops,especially in countries like India,where it’s grown extensively under rainfed conditions.Because of its usage in multiple industries,such as textile,medicine,and automobile industries,it has greater commercial importance.The crop’s performance is greatly influenced by prevailing weather dynamics.As climate changes,assessing how weather changes affect crop performance is essential.Among various techniques that are available,crop models are the most effective and widely used tools for predicting yields.Results This study compares statistical and machine learning models to assess their ability to predict cotton yield across major producing districts of Karnataka,India,utilizing a long-term dataset spanning from 1990 to 2023 that includes yield and weather factors.The artificial neural networks(ANNs)performed superiorly with acceptable yield deviations ranging within±10%during both vegetative stage(F1)and mid stage(F2)for cotton.The model evaluation metrics such as root mean square error(RMSE),normalized root mean square error(nRMSE),and modelling efficiency(EF)were also within the acceptance limits in most districts.Furthermore,the tested ANN model was used to assess the importance of the dominant weather factors influencing crop yield in each district.Specifically,the use of morning relative humidity as an individual parameter and its interaction with maximum and minimum tempera-ture had a major influence on cotton yield in most of the yield predicted districts.These differences highlighted the differential interactions of weather factors in each district for cotton yield formation,highlighting individual response of each weather factor under different soils and management conditions over the major cotton growing districts of Karnataka.Conclusions Compared with statistical models,machine learning models such as ANNs proved higher efficiency in forecasting the cotton yield due to their ability to consider the interactive effects of weather factors on yield forma-tion at different growth stages.This highlights the best suitability of ANNs for yield forecasting in rainfed conditions and for the study on relative impacts of weather factors on yield.Thus,the study aims to provide valuable insights to support stakeholders in planning effective crop management strategies and formulating relevant policies.展开更多
BACKGROUND Delayed wound healing is a common clinical complication following gastric cancer radical surgery,adversely affecting patient prognosis.With advances in artificial intelligence,machine learning offers a prom...BACKGROUND Delayed wound healing is a common clinical complication following gastric cancer radical surgery,adversely affecting patient prognosis.With advances in artificial intelligence,machine learning offers a promising approach for developing predictive models that can identify high-risk patients and support early clinical intervention.AIM To construct machine learning-based risk prediction models for delayed wound healing after gastric cancer surgery to support clinical decision-making.METHODS We reviewed a total of 514 patients who underwent gastric cancer radical surgery under general anesthesia from January 1,2014 to December 30,2023.Seventy percent of the dataset was selected as the training set and 30%as the validation set.Decision trees,support vector machines,and logistic regression were used to construct a risk prediction model.The performance of the model was evaluated using accuracy,recall,precision,F1 index,and area under the receiver operating characteristic curve and decision curve.RESULTS This study included five variables:Sex,elderly,duration of abdominal drainage,preoperative white blood cell(WBC)count,and absolute value of neutrophils.These variables were selected based on their clinical relevance and statistical significance in predicting delayed wound healing.The results showed that the decision tree model outperformed the logistic regression and support vector machine models in both the training and validation sets.Specifically,the decision tree model achieved higher accuracy,F1 index,recall,and area under the curve(AUC)values.The support vector machine model also demonstrated better performance than logistic regression,with higher accuracy,recall,and F1 index,but a slightly lower AUC.The key variables of sex,elderly,duration of abdominal drainage,preoperative WBC count,and absolute value of neutrophils were found to be strong predictors of delayed wound healing.Patients with longer duration of abdominal drainage had a significantly higher risk of delayed wound healing,with a risk ratio of 1.579 compared to those with shorter duration of abdominal drainage.Similarly,preoperative WBC count,sex,elderly,and absolute value of neutrophils were associated with a higher risk of delayed wound healing,highlighting the importance of these variables in the model.CONCLUSION The model is able to identify high-risk patients based on sex,elderly,duration of abdominal drainage,preoperative WBC count,and absolute value of neutrophils can provide valuable insights for clinical decision-making.展开更多
Weather forecasts from numerical weather prediction models play a central role in solar energy forecasting,where a cascade of physics-based models is used in a model chain approach to convert forecasts of solar irradi...Weather forecasts from numerical weather prediction models play a central role in solar energy forecasting,where a cascade of physics-based models is used in a model chain approach to convert forecasts of solar irradiance to solar power production.Ensemble simulations from such weather models aim to quantify uncertainty in the future development of the weather,and can be used to propagate this uncertainty through the model chain to generate probabilistic solar energy predictions.However,ensemble prediction systems are known to exhibit systematic errors,and thus require post-processing to obtain accurate and reliable probabilistic forecasts.The overarching aim of our study is to systematically evaluate different strategies to apply post-processing in model chain approaches with a specific focus on solar energy:not applying any post-processing at all;post-processing only the irradiance predictions before the conversion;post-processing only the solar power predictions obtained from the model chain;or applying post-processing in both steps.In a case study based on a benchmark dataset for the Jacumba solar plant in the U.S.,we develop statistical and machine learning methods for postprocessing ensemble predictions of global horizontal irradiance(GHI)and solar power generation.Further,we propose a neural-network-based model for direct solar power forecasting that bypasses the model chain.Our results indicate that postprocessing substantially improves the solar power generation forecasts,in particular when post-processing is applied to the power predictions.The machine learning methods for post-processing slightly outperform the statistical methods,and the direct forecasting approach performs comparably to the post-processing strategies.展开更多
Machine learning(ML)techniques have emerged as powerful tools for improving the predictive capabilities of Reynolds-averaged Navier-Stokes(RANS)turbulence models in separated flows.This improvement is achieved by leve...Machine learning(ML)techniques have emerged as powerful tools for improving the predictive capabilities of Reynolds-averaged Navier-Stokes(RANS)turbulence models in separated flows.This improvement is achieved by leveraging complex ML models,such as those developed using field inversion and machine learning(FIML),to dynamically adjust the constants within the baseline RANS model.However,the ML models often overlook the fundamental calibrations of the RANS turbulence model.Consequently,the basic calibration of the baseline RANS model is disrupted,leading to a degradation in the accuracy,particularly in basic wall-attached flows outside of the training set.To address this issue,a modified version of the Spalart-Allmaras(SA)turbulence model,known as Rubber-band SA(RBSA),has been proposed recently.This modification involves identifying and embedding constraints related to basic wall-attached flows directly into the model.It is shown that no matter how the parameters of the RBSA model are adjusted as constants throughout the flow field,its accuracy in wall-attached flows remains unaffected.In this paper,we propose a new constraint for the RBSA model,which better safeguards the law of wall in extreme conditions where the model parameter is adjusted dramatically.The resultant model is called the RBSA-poly model.We then show that when combined with FIML augmentation,the RBSA-poly model effectively preserves the accuracy of simple wall-attached flows,even when the adjusted parameters become functions of local flow variables rather than constants.A comparative analysis with the FIML-augmented original SA model reveals that the augmented RBSA-poly model reduces error in basic wall-attached flows by 50%while maintaining comparable accuracy in trained separated flows.These findings confirm the effectiveness of utilizing FIML in conjunction with the RBSA model,offering superior accuracy retention in cardinal flows.展开更多
Driven by rapid technological advancements and economic growth,mineral extraction and metal refining have increased dramatically,generating huge volumes of tailings and mine waste(TMWs).Investigating the morphological...Driven by rapid technological advancements and economic growth,mineral extraction and metal refining have increased dramatically,generating huge volumes of tailings and mine waste(TMWs).Investigating the morphological fractions of heavy metals and metalloids(HMMs)in TMWs is key to evaluating their leaching potential into the environment;however,traditional experiments are time-consuming and labor-intensive.In this study,10 machine learning(ML)algorithms were used and compared for rapidly predicting the morphological fractions of HMMs in TMWs.A dataset comprising 2376 data points was used,with mineral composition,elemental properties,and total concentration used as inputs and concentration of morphological fraction used as output.After grid search optimization,the extra tree model performed the best,achieving coefficient of determination(R2)of 0.946 and 0.942 on the validation and test sets,respectively.Electronegativity was found to have the greatest impact on the morphological fraction.The models’performance was enhanced by applying an ensemble method to the top three optimal ML models,including gradient boosting decision tree,extra trees and categorical boosting.Overall,the proposed framework can accurately predict the concentrations of different morphological fractions of HMMs in TMWs.This approach can minimize detection time,aid in the safe management and recovery of TMWs.展开更多
Objective:To analyze the risk factors of anxiety in young hypertensive patients and build a prediction model to provide a scientific basis for clinical diagnosis and treatment.Methods:According to the research content...Objective:To analyze the risk factors of anxiety in young hypertensive patients and build a prediction model to provide a scientific basis for clinical diagnosis and treatment.Methods:According to the research content,young hypertensive patients admitted to the hospital from January 2022 to December 2024 were selected as the research object and at least 950 patients were included according to the sample size calculation.According to the existence of anxiety,950 patients were divided into control group(n=650)and observation group(n=300),and the clinical data of all patients were collected for univariate analysis and multivariate Logistic regression analysis to get the risk factors of hypertension patients complicated with anxiety in.All patients were randomly divided into a training set(n=665)and a test set(n=285)according to the ratio of 7:3,and the evaluation efficiency of different prediction models was obtained by using machine learning algorithm.To evaluate the clinical application effect of the prediction model.Results:(1)Univariate analysis showed that age,BMI,education background,marital status,smoking,drinking,sleep disorder,family history of hypertension,history of diabetes,history of hyperlipidemia,history of cerebral infarction,and TC were important risk factors for young hypertensive patients complicated with anxiety.(2)Multivariate Logistic regression analysis showed that hypertension history,drinking history,coronary heart disease history,diabetes history,BMI,TC,and TG are important independent risk factors for young hypertensive patients complicated with anxiety.(3)Extra Trees has the highest predictive power for young people with hypertension complicated with anxiety,while Decision-Tree has the lowest predictive power.Conclusion:Hypertension history,drinking history,coronary heart disease history,diabetes history,BMI,TC,and TG are important independent risk factors that affect the anxiety of young hypertensive patients.Extra Trees model has the best prediction efficiency among different groups of models.展开更多
The aim of this paper is to explore the effect of geometrical parameters on ultimate load-carrying capacity of a circular hollow section(CHS)X-joint under axial compression of the brace end.First of all,finite element...The aim of this paper is to explore the effect of geometrical parameters on ultimate load-carrying capacity of a circular hollow section(CHS)X-joint under axial compression of the brace end.First of all,finite element(FE)model to calculate ultimate load-carrying capacity of the CHS X-joint subjected to uniaxial load of the brace is constructed,and the calculated load–displacement curves are compared to the experimental ones.After validation of the FE model,46080 groups of FE calculation models with different geometrical parameters are generated by means of parametric modeling.Subsequently,eight variables including gusset thickness and chord thickness are set as input to predict load-carrying capacity of the CHS X-joint by four machine learning(ML)algorithms,i.e.,Generalized Regression Neural Network,Support Vector Machine,random forest(RF),and Extreme Gradient Boosting(XGBoost).Finally,the constructed ML prediction models are interpreted by SHapley Additive exPlanations,to explore the impact weight of each factor on ultimate load-carrying capacity of the joint.The results show that all the four models can predict the load-carrying capacity of the subject accurately,with all the R2 values greater than 0.97.In addition,RF model yields the minimum mean-square error,Root Mean Squared Error,Mean Absolute Error,and Mean Absolute Percentage Error values,and the greatest R2 value,while the prediction accuracy of XGBoost is relatively worse.Among all the eight considered geometrical parameters,brace diameter has the strongest impact on load-carrying capacity of the joint,followed by chord thickness,chord ring width,chord ring thickness,brace ring width,and brace thickness,while the thicknesses of the gusset plate and brace have marginal influence on load-carrying capacity.The study of the current paper can provide guidelines for dimension design of CHS X-joints.展开更多
Objective:To construct a clinical prediction model of acupuncture treatment for diminished ovarian reserve(DOR)based on a machine learning algorithm to provide a clinical prediction of acupuncture for ameliorating pre...Objective:To construct a clinical prediction model of acupuncture treatment for diminished ovarian reserve(DOR)based on a machine learning algorithm to provide a clinical prediction of acupuncture for ameliorating pregnancy outcomes in DOR.Methods:We enrolled 377 DOR patients treated with acupuncture and with records of pregnancy outcomes(139 cases of pregnancy and 238 cases failed)exported from the International Patient Registry Platform of Acupuncture-moxibustion(IPRPAM).The predictive variables were determined using Spearman’s correlation analysis and feature engineering methods.The model was constructed by adopting logistic regression,naïve Bayes,random forest,support vector machine,extreme gradient boosting,the knearest neighbor algorithm,linear discriminant analysis,and neural network methods.The models were validated by the area under the curve(AUC),accuracy(ACC),and importance sequencing,and individual pregnancy prediction was conducted for the best-performing model.Results:The key factors determining pregnancy after acupuncture in patients with DOR were age,luteinizing hormone(LH)level after treatment,follicle-stimulating hormone(FSH)level after treatment,the ratio of FSH to LH(FSH/LH)after treatment,and history of acupuncture treatment.Random forest model ACC was 0.95,Fβwas 0.93,Logloss was 0.30,Logloss value was the lowest,the model variables exhibited the highest accuracy and precision.Conclusion:The random forest model for the effects of acupuncture on pregnancy outcomes in patients with DOR,constructed based on the IPRPAM,presents a favorable value for clinical application.展开更多
BACKGROUND Relieving pain is central to the early management of knee osteoarthritis,with a plethora of pharmacological agents licensed for this purpose.Intra-articular corticosteroid injections are a widely used optio...BACKGROUND Relieving pain is central to the early management of knee osteoarthritis,with a plethora of pharmacological agents licensed for this purpose.Intra-articular corticosteroid injections are a widely used option,albeit with variable efficacy.AIM To develop a machine learning(ML)model that predicts which patients will benefit from corticosteroid injections.METHODS Data from two prospective cohort studies[Osteoarthritis(OA)Initiative and Multicentre OA Study]was combined.The primary outcome was patientreported pain score following corticosteroid injection,assessed using the Western Ontario and McMaster Universities OA pain scale,with significant change defined using minimally clinically important difference and meaningful within person change.A ML algorithm was developed,utilizing linear discriminant analysis,to predict symptomatic improvement,and examine the association between pain scores and patient factors by calculating the sensitivity,specificity,positive predictive value,negative predictive value,accuracy,and F2 score.RESULTS A total of 330 patients were included,with a mean age of 63.4(SD:8.3).The mean Western Ontario and McMaster Universities OA pain score was 5.2(SD:4.1),with only 25.5%of patients achieving significant improvement in pain following corticosteroid injection.The ML model generated an accuracy of 67.8%(95%confidence interval:64.6%-70.9%),F1 score of 30.8%,and an area under the curve score of 0.60.CONCLUSION The model demonstrated feasibility to assist clinicians with decision-making in patient selection for corticosteroid injections.Further studies are required to improve the model prior to testing in clinical settings.展开更多
Assessing the stability of slopes is one of the crucial tasks of geotechnical engineering for assessing and managing risks related to natural hazards,directly affecting safety and sustainable development.This study pr...Assessing the stability of slopes is one of the crucial tasks of geotechnical engineering for assessing and managing risks related to natural hazards,directly affecting safety and sustainable development.This study primarily focuses on developing robust and practical hybrid models to predict the slope stability status of circular failure mode.For this purpose,three robust models were developed using a database including 627 case histories of slope stability status.The models were developed using the random forest(RF),support vector machine(SVM),and extreme gradient boosting(XGB)techniques,employing 5-fold cross validation approach.To enhance the performance of models,this study employs Bayesian optimizer(BO)to fine-tuning their hyperparameters.The results indicate that the performance order of the three developed models is RF-BO>SVM-BO>XGB-BO.Furthermore,comparing the developed models with previous models,it was found that the RF-BO model can effectively determine the slope stability status with outstanding performance.This implies that the RF-BO model could serve as a dependable tool for project managers,assisting in the evaluation of slope stability during both the design and operational phases of projects,despite the inherent challenges in this domain.The results regarding the importance of influencing parameters indicate that cohesion,friction angle,and slope height exert the most significant impact on slope stability status.This suggests that concentrating on these parameters and employing the RF-BO model can effectively mitigate the severity of geohazards in the short-term and contribute to the attainment of long-term sustainable development objectives.展开更多
The inelastic behavior of thermoplastic polymers may involve shearing and crazing,and both depend on temperature and strain rate.Traditional constitutive models account for temperature and strain rate through phenomen...The inelastic behavior of thermoplastic polymers may involve shearing and crazing,and both depend on temperature and strain rate.Traditional constitutive models account for temperature and strain rate through phenomenological or empirical formulas.In this study,we present a physics-guided machine learning(ML)framework to model shear and craze in polymeric materials.The effects of all three principal stresses for the craze initiation are considered other than the maximum tensile principal stress solely in previous works.We implemented a finite element framework through a user-defined material subroutine and applied the constitutive model to the deformation in three polymers(PLA 4060D,PLA 3051D,and HIPS).The result shows that our ML-based model can predict the stress-strain and volume-strain responses at different strain rates with high accuracy.Notably,the ML-based approach needs no assumptions about yield criteria or hardening laws.This work highlights the potential of hybrid physics-ML paradigms to overcome the trade-offs between model complexity and accuracy in polymer mechanics,paving the way for computationally efficient and generalizable constitutive models for thermoplastic materials.展开更多
BACKGROUND Colorectal polyps are precancerous diseases of colorectal cancer.Early detection and resection of colorectal polyps can effectively reduce the mortality of colorectal cancer.Endoscopic mucosal resection(EMR...BACKGROUND Colorectal polyps are precancerous diseases of colorectal cancer.Early detection and resection of colorectal polyps can effectively reduce the mortality of colorectal cancer.Endoscopic mucosal resection(EMR)is a common polypectomy proce-dure in clinical practice,but it has a high postoperative recurrence rate.Currently,there is no predictive model for the recurrence of colorectal polyps after EMR.AIM To construct and validate a machine learning(ML)model for predicting the risk of colorectal polyp recurrence one year after EMR.METHODS This study retrospectively collected data from 1694 patients at three medical centers in Xuzhou.Additionally,a total of 166 patients were collected to form a prospective validation set.Feature variable screening was conducted using uni-variate and multivariate logistic regression analyses,and five ML algorithms were used to construct the predictive models.The optimal models were evaluated based on different performance metrics.Decision curve analysis(DCA)and SHapley Additive exPlanation(SHAP)analysis were performed to assess clinical applicability and predictor importance.RESULTS Multivariate logistic regression analysis identified 8 independent risk factors for colorectal polyp recurrence one year after EMR(P<0.05).Among the models,eXtreme Gradient Boosting(XGBoost)demonstrated the highest area under the curve(AUC)in the training set,internal validation set,and prospective validation set,with AUCs of 0.909(95%CI:0.89-0.92),0.921(95%CI:0.90-0.94),and 0.963(95%CI:0.94-0.99),respectively.DCA indicated favorable clinical utility for the XGBoost model.SHAP analysis identified smoking history,family history,and age as the top three most important predictors in the model.CONCLUSION The XGBoost model has the best predictive performance and can assist clinicians in providing individualized colonoscopy follow-up recommendations.展开更多
The investigation by Zhu et al on the assessment of cellular proliferation markers to assist clinical decision-making in patients with hepatocellular carcinoma(HCC)using a machine learning model-based approach is a sc...The investigation by Zhu et al on the assessment of cellular proliferation markers to assist clinical decision-making in patients with hepatocellular carcinoma(HCC)using a machine learning model-based approach is a scientific approach.This study looked into the possibilities of using a Ki-67(a marker for cell proliferation)expression-based machine learning model to help doctors make decisions about treatment options for patients with HCC before surgery.The study used reconstructed tomography images of 164 patients with confirmed HCC from the intratumoral and peritumoral regions.The features were chosen using various statistical methods,including least absolute shrinkage and selection operator regression.Also,a nomogram was made using Radscore and clinical risk factors.It was tested for its ability to predict receiver operating characteristic curves and calibration curves,and its clinical benefits were found using decision curve analysis.The calibration curve demonstrated excellent consistency between predicted and actual probability,and the decision curve confirmed its clinical benefit.The proposed model is helpful for treating patients with HCC because the predicted and actual probabilities are very close to each other,as shown by the decision curve analysis.Further prospective studies are required,incorporating a multicenter and large sample size design,additional relevant exclusion criteria,information on tumors(size,number,and grade),and cancer stage to strengthen the clinical benefit in patients with HCC.展开更多
The objective of this study is to develop an advanced approach to variogram modelling by integrating genetic algorithms(GA)with machine learning-based linear regression,aiming to improve the accuracy and efficiency of...The objective of this study is to develop an advanced approach to variogram modelling by integrating genetic algorithms(GA)with machine learning-based linear regression,aiming to improve the accuracy and efficiency of geostatistical analysis,particularly in mineral exploration.The study combines GA and machine learning to optimise variogram parameters,including range,sill,and nugget,by minimising the root mean square error(RMSE)and maximising the coefficient of determination(R^(2)).The experimental variograms were computed and modelled using theoretical models,followed by optimisation via evolutionary algorithms.The method was applied to gravity data from the Ngoura-Batouri-Kette mining district in Eastern Cameroon,covering 141 data points.Sequential Gaussian Simulations(SGS)were employed for predictive mapping to validate simulated results against true values.Key findings show variograms with ranges between 24.71 km and 49.77 km,opti-mised RMSE and R^(2) values of 11.21 mGal^(2) and 0.969,respectively,after 42 generations of GA optimisation.Predictive mapping using SGS demonstrated that simulated values closely matched true values,with the simu-lated mean at 21.75 mGal compared to the true mean of 25.16 mGal,and variances of 465.70 mGal^(2) and 555.28 mGal^(2),respectively.The results confirmed spatial variability and anisotropies in the N170-N210 directions,consistent with prior studies.This work presents a novel integration of GA and machine learning for variogram modelling,offering an automated,efficient approach to parameter estimation.The methodology significantly enhances predictive geostatistical models,contributing to the advancement of mineral exploration and improving the precision and speed of decision-making in the petroleum and mining industries.展开更多
The backwater effect caused by tributary inflow can significantly elevate the water level profile upstream of a confluence point.However,the influence of mainstream and confluence discharges on the backwater effect in...The backwater effect caused by tributary inflow can significantly elevate the water level profile upstream of a confluence point.However,the influence of mainstream and confluence discharges on the backwater effect in a river reach remains unclear.In this study,various hydrological data collected from the Jingjiang Reach of the Yangtze River in China were statistically analyzed to determine the backwater degree and range with three representative mainstream discharges.The results indicated that the backwater degree increased with mainstream discharge,and a positive relationship was observed between the runoff ratio and backwater degree at specific representative mainstream discharges.Following the operation of the Three Gorges Project,the backwater effect in the Jingjiang Reach diminished.For instance,mean backwater degrees for low,moderate,and high mainstream discharges were recorded as 0.83 m,1.61 m,and 2.41 m during the period from 1990 to 2002,whereas these values decreased to 0.30 m,0.95 m,and 2.08 m from 2009 to 2020.The backwater range extended upstream as mainstream discharge increased from 7000 m3/s to 30000 m3/s.Moreover,a random forest-based machine learning model was used to quantify the backwater effect with varying mainstream and confluence discharges,accounting for the impacts of mainstream discharge,confluence discharge,and channel degradation in the Jingjiang Reach.At the Jianli Hydrological Station,a decrease in mainstream discharge during flood seasons resulted in a 7%–15%increase in monthly mean backwater degree,while an increase in mainstream discharge during dry seasons led to a 1%–15%decrease in monthly mean backwater degree.Furthermore,increasing confluence discharge from Dongting Lake during June to July and September to November resulted in an 11%–42%increase in monthly mean backwater degree.Continuous channel degradation in the Jingjiang Reach contributed to a 6%–19%decrease in monthly mean backwater degree.Under the influence of these factors,the monthly mean backwater degree in 2017 varied from a decrease of 53%to an increase of 37%compared to corresponding values in 1991.展开更多
The authors regret that the original publication of this paper did not include Jawad Fayaz as a co-author.After further discussions and a thorough review of the research contributions,it was agreed that his significan...The authors regret that the original publication of this paper did not include Jawad Fayaz as a co-author.After further discussions and a thorough review of the research contributions,it was agreed that his significant contributions to the foundational aspects of the research warranted recognition,and he has now been added as a co-author.展开更多
Understanding spatial heterogeneity in groundwater responses to multiple factors is critical for water resource management in coastal cities.Daily groundwater depth(GWD)data from 43 wells(2018-2022)were collected in t...Understanding spatial heterogeneity in groundwater responses to multiple factors is critical for water resource management in coastal cities.Daily groundwater depth(GWD)data from 43 wells(2018-2022)were collected in three coastal cities in Jiangsu Province,China.Seasonal and Trend decomposition using Loess(STL)together with wavelet analysis and empirical mode decomposition were applied to identify tide-influenced wells while remaining wells were grouped by hierarchical clustering analysis(HCA).Machine learning models were developed to predict GWD,then their response to natural conditions and human activities was assessed by the Shapley Additive exPlanations(SHAP)method.Results showed that eXtreme Gradient Boosting(XGB)was superior to other models in terms of prediction performance and computational efficiency(R^(2)>0.95).GWD in Yancheng and southern Lianyungang were greater than those in Nantong,exhibiting larger fluctuations.Groundwater within 5 km of the coastline was affected by tides,with more pronounced effects in agricultural areas compared to urban areas.Shallow groundwater(3-7 m depth)responded immediately(0-1 day)to rainfall,primarily influenced by farmland and topography(slope and distance from rivers).Rainfall recharge to groundwater peaked at 50%farmland coverage,but this effect was suppressed by high temperatures(>30℃)which intensified as distance from rivers increased,especially in forest and grassland.Deep groundwater(>10 m)showed delayed responses to rainfall(1-4 days)and temperature(10-15 days),with GDP as the primary influence,followed by agricultural irrigation and population density.Farmland helped to maintain stable GWD in low population density regions,while excessive farmland coverage(>90%)led to overexploitation.In the early stages of GDP development,increased industrial and agricultural water demand led to GWD decline,but as GDP levels significantly improved,groundwater consumption pressure gradually eased.This methodological framework is applicable not only to coastal cities in China but also could be extended to coastal regions worldwide.展开更多
基金Supported by Government Assignment,No.1023022600020-6RSF Grant,No.24-15-00549Ministry of Science and Higher Education of the Russian Federation within the Framework of State Support for the Creation and Development of World-Class Research Center,No.075-15-2022-304.
文摘BACKGROUND Ischemic heart disease(IHD)impacts the quality of life and has the highest mortality rate of cardiovascular diseases globally.AIM To compare variations in the parameters of the single-lead electrocardiogram(ECG)during resting conditions and physical exertion in individuals diagnosed with IHD and those without the condition using vasodilator-induced stress computed tomography(CT)myocardial perfusion imaging as the diagnostic reference standard.METHODS This single center observational study included 80 participants.The participants were aged≥40 years and given an informed written consent to participate in the study.Both groups,G1(n=31)with and G2(n=49)without post stress induced myocardial perfusion defect,passed cardiologist consultation,anthropometric measurements,blood pressure and pulse rate measurement,echocardiography,cardio-ankle vascular index,bicycle ergometry,recording 3-min single-lead ECG(Cardio-Qvark)before and just after bicycle ergometry followed by performing CT myocardial perfusion.The LASSO regression with nested cross-validation was used to find the association between Cardio-Qvark parameters and the existence of the perfusion defect.Statistical processing was performed with the R programming language v4.2,Python v.3.10[^R],and Statistica 12 program.RESULTS Bicycle ergometry yielded an area under the receiver operating characteristic curve of 50.7%[95%confidence interval(CI):0.388-0.625],specificity of 53.1%(95%CI:0.392-0.673),and sensitivity of 48.4%(95%CI:0.306-0.657).In contrast,the Cardio-Qvark test performed notably better with an area under the receiver operating characteristic curve of 67%(95%CI:0.530-0.801),specificity of 75.5%(95%CI:0.628-0.88),and sensitivity of 51.6%(95%CI:0.333-0.695).CONCLUSION The single-lead ECG has a relatively higher diagnostic accuracy compared with bicycle ergometry by using machine learning models,but the difference was not statistically significant.However,further investigations are required to uncover the hidden capabilities of single-lead ECG in IHD diagnosis.
基金financial support from the National Key Research and Development Program of China(2021YFB 3501501)the National Natural Science Foundation of China(No.22225803,22038001,22108007 and 22278011)+1 种基金Beijing Natural Science Foundation(No.Z230023)Beijing Science and Technology Commission(No.Z211100004321001).
文摘The high porosity and tunable chemical functionality of metal-organic frameworks(MOFs)make it a promising catalyst design platform.High-throughput screening of catalytic performance is feasible since the large MOF structure database is available.In this study,we report a machine learning model for high-throughput screening of MOF catalysts for the CO_(2) cycloaddition reaction.The descriptors for model training were judiciously chosen according to the reaction mechanism,which leads to high accuracy up to 97%for the 75%quantile of the training set as the classification criterion.The feature contribution was further evaluated with SHAP and PDP analysis to provide a certain physical understanding.12,415 hypothetical MOF structures and 100 reported MOFs were evaluated under 100℃ and 1 bar within one day using the model,and 239 potentially efficient catalysts were discovered.Among them,MOF-76(Y)achieved the top performance experimentally among reported MOFs,in good agreement with the prediction.
文摘Geared-rotor systems are critical components in mechanical applications,and their performance can be severely affected by faults,such as profile errors,wear,pitting,spalling,flaking,and cracks.Profile errors in gear teeth are inevitable in manufacturing and subsequently accumulate during operations.This work aims to predict the status of gear profile deviations based on gear dynamics response using the digital model of an experimental rig setup.The digital model comprises detailed CAD models and has been validated against the expected physical behavior using commercial finite element analysis software.The different profile deviations are then modeled using gear charts,and the dynamic response is captured through simulations.The various features are then obtained by signal processing,and various ML models are then evaluated to predict the fault/no-fault condition for the gear.The best performance is achieved by an artificial neural network with a prediction accuracy of 97.5%,which concludes a strong influence on the dynamics of the gear rotor system due to profile deviations.
基金funded through India Meteorological Department,New Delhi,India under the Forecasting Agricultural output using Space,Agrometeorol ogy and Land based observations(FASAL)project and fund number:No.ASC/FASAL/KT-11/01/HQ-2010.
文摘Background Cotton is one of the most important commercial crops after food crops,especially in countries like India,where it’s grown extensively under rainfed conditions.Because of its usage in multiple industries,such as textile,medicine,and automobile industries,it has greater commercial importance.The crop’s performance is greatly influenced by prevailing weather dynamics.As climate changes,assessing how weather changes affect crop performance is essential.Among various techniques that are available,crop models are the most effective and widely used tools for predicting yields.Results This study compares statistical and machine learning models to assess their ability to predict cotton yield across major producing districts of Karnataka,India,utilizing a long-term dataset spanning from 1990 to 2023 that includes yield and weather factors.The artificial neural networks(ANNs)performed superiorly with acceptable yield deviations ranging within±10%during both vegetative stage(F1)and mid stage(F2)for cotton.The model evaluation metrics such as root mean square error(RMSE),normalized root mean square error(nRMSE),and modelling efficiency(EF)were also within the acceptance limits in most districts.Furthermore,the tested ANN model was used to assess the importance of the dominant weather factors influencing crop yield in each district.Specifically,the use of morning relative humidity as an individual parameter and its interaction with maximum and minimum tempera-ture had a major influence on cotton yield in most of the yield predicted districts.These differences highlighted the differential interactions of weather factors in each district for cotton yield formation,highlighting individual response of each weather factor under different soils and management conditions over the major cotton growing districts of Karnataka.Conclusions Compared with statistical models,machine learning models such as ANNs proved higher efficiency in forecasting the cotton yield due to their ability to consider the interactive effects of weather factors on yield forma-tion at different growth stages.This highlights the best suitability of ANNs for yield forecasting in rainfed conditions and for the study on relative impacts of weather factors on yield.Thus,the study aims to provide valuable insights to support stakeholders in planning effective crop management strategies and formulating relevant policies.
基金Supported by the Shandong Province Traditional Chinese Medicine Technology Project,No.Q-2023147the Weifang Health Commission Research Project,No.WFWSJK-2023-033+3 种基金the Weifang City Science and Technology Development Plan(Medical Category),No.2023YX057the Weifang Medical University 2022 Campus Level Education and Teaching Reform and Research Project,No.2022YB051Norman Bethune Public Welfare Foundation,No.ezmr2023-037Special Research Project on Optimized Management of Acute Pain,Wu Jieping Medical Foundation.
文摘BACKGROUND Delayed wound healing is a common clinical complication following gastric cancer radical surgery,adversely affecting patient prognosis.With advances in artificial intelligence,machine learning offers a promising approach for developing predictive models that can identify high-risk patients and support early clinical intervention.AIM To construct machine learning-based risk prediction models for delayed wound healing after gastric cancer surgery to support clinical decision-making.METHODS We reviewed a total of 514 patients who underwent gastric cancer radical surgery under general anesthesia from January 1,2014 to December 30,2023.Seventy percent of the dataset was selected as the training set and 30%as the validation set.Decision trees,support vector machines,and logistic regression were used to construct a risk prediction model.The performance of the model was evaluated using accuracy,recall,precision,F1 index,and area under the receiver operating characteristic curve and decision curve.RESULTS This study included five variables:Sex,elderly,duration of abdominal drainage,preoperative white blood cell(WBC)count,and absolute value of neutrophils.These variables were selected based on their clinical relevance and statistical significance in predicting delayed wound healing.The results showed that the decision tree model outperformed the logistic regression and support vector machine models in both the training and validation sets.Specifically,the decision tree model achieved higher accuracy,F1 index,recall,and area under the curve(AUC)values.The support vector machine model also demonstrated better performance than logistic regression,with higher accuracy,recall,and F1 index,but a slightly lower AUC.The key variables of sex,elderly,duration of abdominal drainage,preoperative WBC count,and absolute value of neutrophils were found to be strong predictors of delayed wound healing.Patients with longer duration of abdominal drainage had a significantly higher risk of delayed wound healing,with a risk ratio of 1.579 compared to those with shorter duration of abdominal drainage.Similarly,preoperative WBC count,sex,elderly,and absolute value of neutrophils were associated with a higher risk of delayed wound healing,highlighting the importance of these variables in the model.CONCLUSION The model is able to identify high-risk patients based on sex,elderly,duration of abdominal drainage,preoperative WBC count,and absolute value of neutrophils can provide valuable insights for clinical decision-making.
基金the Young Investigator Group“Artificial Intelligence for Probabilistic Weather Forecasting”funded by the Vector Stiftungfunding from the Federal Ministry of Education and Research(BMBF)and the Baden-Württemberg Ministry of Science as part of the Excellence Strategy of the German Federal and State Governments。
文摘Weather forecasts from numerical weather prediction models play a central role in solar energy forecasting,where a cascade of physics-based models is used in a model chain approach to convert forecasts of solar irradiance to solar power production.Ensemble simulations from such weather models aim to quantify uncertainty in the future development of the weather,and can be used to propagate this uncertainty through the model chain to generate probabilistic solar energy predictions.However,ensemble prediction systems are known to exhibit systematic errors,and thus require post-processing to obtain accurate and reliable probabilistic forecasts.The overarching aim of our study is to systematically evaluate different strategies to apply post-processing in model chain approaches with a specific focus on solar energy:not applying any post-processing at all;post-processing only the irradiance predictions before the conversion;post-processing only the solar power predictions obtained from the model chain;or applying post-processing in both steps.In a case study based on a benchmark dataset for the Jacumba solar plant in the U.S.,we develop statistical and machine learning methods for postprocessing ensemble predictions of global horizontal irradiance(GHI)and solar power generation.Further,we propose a neural-network-based model for direct solar power forecasting that bypasses the model chain.Our results indicate that postprocessing substantially improves the solar power generation forecasts,in particular when post-processing is applied to the power predictions.The machine learning methods for post-processing slightly outperform the statistical methods,and the direct forecasting approach performs comparably to the post-processing strategies.
基金supported by the National Natural Science Foundation of China(Grant Nos.12388101,12372288,U23A2069,and 92152301).
文摘Machine learning(ML)techniques have emerged as powerful tools for improving the predictive capabilities of Reynolds-averaged Navier-Stokes(RANS)turbulence models in separated flows.This improvement is achieved by leveraging complex ML models,such as those developed using field inversion and machine learning(FIML),to dynamically adjust the constants within the baseline RANS model.However,the ML models often overlook the fundamental calibrations of the RANS turbulence model.Consequently,the basic calibration of the baseline RANS model is disrupted,leading to a degradation in the accuracy,particularly in basic wall-attached flows outside of the training set.To address this issue,a modified version of the Spalart-Allmaras(SA)turbulence model,known as Rubber-band SA(RBSA),has been proposed recently.This modification involves identifying and embedding constraints related to basic wall-attached flows directly into the model.It is shown that no matter how the parameters of the RBSA model are adjusted as constants throughout the flow field,its accuracy in wall-attached flows remains unaffected.In this paper,we propose a new constraint for the RBSA model,which better safeguards the law of wall in extreme conditions where the model parameter is adjusted dramatically.The resultant model is called the RBSA-poly model.We then show that when combined with FIML augmentation,the RBSA-poly model effectively preserves the accuracy of simple wall-attached flows,even when the adjusted parameters become functions of local flow variables rather than constants.A comparative analysis with the FIML-augmented original SA model reveals that the augmented RBSA-poly model reduces error in basic wall-attached flows by 50%while maintaining comparable accuracy in trained separated flows.These findings confirm the effectiveness of utilizing FIML in conjunction with the RBSA model,offering superior accuracy retention in cardinal flows.
基金Project(2024JJ2074) supported by the Natural Science Foundation of Hunan Province,ChinaProject(22376221) supported by the National Natural Science Foundation of ChinaProject(2023QNRC001) supported by the Young Elite Scientists Sponsorship Program by CAST,China。
文摘Driven by rapid technological advancements and economic growth,mineral extraction and metal refining have increased dramatically,generating huge volumes of tailings and mine waste(TMWs).Investigating the morphological fractions of heavy metals and metalloids(HMMs)in TMWs is key to evaluating their leaching potential into the environment;however,traditional experiments are time-consuming and labor-intensive.In this study,10 machine learning(ML)algorithms were used and compared for rapidly predicting the morphological fractions of HMMs in TMWs.A dataset comprising 2376 data points was used,with mineral composition,elemental properties,and total concentration used as inputs and concentration of morphological fraction used as output.After grid search optimization,the extra tree model performed the best,achieving coefficient of determination(R2)of 0.946 and 0.942 on the validation and test sets,respectively.Electronegativity was found to have the greatest impact on the morphological fraction.The models’performance was enhanced by applying an ensemble method to the top three optimal ML models,including gradient boosting decision tree,extra trees and categorical boosting.Overall,the proposed framework can accurately predict the concentrations of different morphological fractions of HMMs in TMWs.This approach can minimize detection time,aid in the safe management and recovery of TMWs.
文摘Objective:To analyze the risk factors of anxiety in young hypertensive patients and build a prediction model to provide a scientific basis for clinical diagnosis and treatment.Methods:According to the research content,young hypertensive patients admitted to the hospital from January 2022 to December 2024 were selected as the research object and at least 950 patients were included according to the sample size calculation.According to the existence of anxiety,950 patients were divided into control group(n=650)and observation group(n=300),and the clinical data of all patients were collected for univariate analysis and multivariate Logistic regression analysis to get the risk factors of hypertension patients complicated with anxiety in.All patients were randomly divided into a training set(n=665)and a test set(n=285)according to the ratio of 7:3,and the evaluation efficiency of different prediction models was obtained by using machine learning algorithm.To evaluate the clinical application effect of the prediction model.Results:(1)Univariate analysis showed that age,BMI,education background,marital status,smoking,drinking,sleep disorder,family history of hypertension,history of diabetes,history of hyperlipidemia,history of cerebral infarction,and TC were important risk factors for young hypertensive patients complicated with anxiety.(2)Multivariate Logistic regression analysis showed that hypertension history,drinking history,coronary heart disease history,diabetes history,BMI,TC,and TG are important independent risk factors for young hypertensive patients complicated with anxiety.(3)Extra Trees has the highest predictive power for young people with hypertension complicated with anxiety,while Decision-Tree has the lowest predictive power.Conclusion:Hypertension history,drinking history,coronary heart disease history,diabetes history,BMI,TC,and TG are important independent risk factors that affect the anxiety of young hypertensive patients.Extra Trees model has the best prediction efficiency among different groups of models.
文摘The aim of this paper is to explore the effect of geometrical parameters on ultimate load-carrying capacity of a circular hollow section(CHS)X-joint under axial compression of the brace end.First of all,finite element(FE)model to calculate ultimate load-carrying capacity of the CHS X-joint subjected to uniaxial load of the brace is constructed,and the calculated load–displacement curves are compared to the experimental ones.After validation of the FE model,46080 groups of FE calculation models with different geometrical parameters are generated by means of parametric modeling.Subsequently,eight variables including gusset thickness and chord thickness are set as input to predict load-carrying capacity of the CHS X-joint by four machine learning(ML)algorithms,i.e.,Generalized Regression Neural Network,Support Vector Machine,random forest(RF),and Extreme Gradient Boosting(XGBoost).Finally,the constructed ML prediction models are interpreted by SHapley Additive exPlanations,to explore the impact weight of each factor on ultimate load-carrying capacity of the joint.The results show that all the four models can predict the load-carrying capacity of the subject accurately,with all the R2 values greater than 0.97.In addition,RF model yields the minimum mean-square error,Root Mean Squared Error,Mean Absolute Error,and Mean Absolute Percentage Error values,and the greatest R2 value,while the prediction accuracy of XGBoost is relatively worse.Among all the eight considered geometrical parameters,brace diameter has the strongest impact on load-carrying capacity of the joint,followed by chord thickness,chord ring width,chord ring thickness,brace ring width,and brace thickness,while the thicknesses of the gusset plate and brace have marginal influence on load-carrying capacity.The study of the current paper can provide guidelines for dimension design of CHS X-joints.
基金Supported by the Qihuang Scholars Program in 202114th Five-Year National Key R&D Program Project:2022YFC3500504。
文摘Objective:To construct a clinical prediction model of acupuncture treatment for diminished ovarian reserve(DOR)based on a machine learning algorithm to provide a clinical prediction of acupuncture for ameliorating pregnancy outcomes in DOR.Methods:We enrolled 377 DOR patients treated with acupuncture and with records of pregnancy outcomes(139 cases of pregnancy and 238 cases failed)exported from the International Patient Registry Platform of Acupuncture-moxibustion(IPRPAM).The predictive variables were determined using Spearman’s correlation analysis and feature engineering methods.The model was constructed by adopting logistic regression,naïve Bayes,random forest,support vector machine,extreme gradient boosting,the knearest neighbor algorithm,linear discriminant analysis,and neural network methods.The models were validated by the area under the curve(AUC),accuracy(ACC),and importance sequencing,and individual pregnancy prediction was conducted for the best-performing model.Results:The key factors determining pregnancy after acupuncture in patients with DOR were age,luteinizing hormone(LH)level after treatment,follicle-stimulating hormone(FSH)level after treatment,the ratio of FSH to LH(FSH/LH)after treatment,and history of acupuncture treatment.Random forest model ACC was 0.95,Fβwas 0.93,Logloss was 0.30,Logloss value was the lowest,the model variables exhibited the highest accuracy and precision.Conclusion:The random forest model for the effects of acupuncture on pregnancy outcomes in patients with DOR,constructed based on the IPRPAM,presents a favorable value for clinical application.
基金Supported by National Institute For Health and Care Research,No.NIHR302632.
文摘BACKGROUND Relieving pain is central to the early management of knee osteoarthritis,with a plethora of pharmacological agents licensed for this purpose.Intra-articular corticosteroid injections are a widely used option,albeit with variable efficacy.AIM To develop a machine learning(ML)model that predicts which patients will benefit from corticosteroid injections.METHODS Data from two prospective cohort studies[Osteoarthritis(OA)Initiative and Multicentre OA Study]was combined.The primary outcome was patientreported pain score following corticosteroid injection,assessed using the Western Ontario and McMaster Universities OA pain scale,with significant change defined using minimally clinically important difference and meaningful within person change.A ML algorithm was developed,utilizing linear discriminant analysis,to predict symptomatic improvement,and examine the association between pain scores and patient factors by calculating the sensitivity,specificity,positive predictive value,negative predictive value,accuracy,and F2 score.RESULTS A total of 330 patients were included,with a mean age of 63.4(SD:8.3).The mean Western Ontario and McMaster Universities OA pain score was 5.2(SD:4.1),with only 25.5%of patients achieving significant improvement in pain following corticosteroid injection.The ML model generated an accuracy of 67.8%(95%confidence interval:64.6%-70.9%),F1 score of 30.8%,and an area under the curve score of 0.60.CONCLUSION The model demonstrated feasibility to assist clinicians with decision-making in patient selection for corticosteroid injections.Further studies are required to improve the model prior to testing in clinical settings.
文摘Assessing the stability of slopes is one of the crucial tasks of geotechnical engineering for assessing and managing risks related to natural hazards,directly affecting safety and sustainable development.This study primarily focuses on developing robust and practical hybrid models to predict the slope stability status of circular failure mode.For this purpose,three robust models were developed using a database including 627 case histories of slope stability status.The models were developed using the random forest(RF),support vector machine(SVM),and extreme gradient boosting(XGB)techniques,employing 5-fold cross validation approach.To enhance the performance of models,this study employs Bayesian optimizer(BO)to fine-tuning their hyperparameters.The results indicate that the performance order of the three developed models is RF-BO>SVM-BO>XGB-BO.Furthermore,comparing the developed models with previous models,it was found that the RF-BO model can effectively determine the slope stability status with outstanding performance.This implies that the RF-BO model could serve as a dependable tool for project managers,assisting in the evaluation of slope stability during both the design and operational phases of projects,despite the inherent challenges in this domain.The results regarding the importance of influencing parameters indicate that cohesion,friction angle,and slope height exert the most significant impact on slope stability status.This suggests that concentrating on these parameters and employing the RF-BO model can effectively mitigate the severity of geohazards in the short-term and contribute to the attainment of long-term sustainable development objectives.
基金supported by the National Natural Science Foundation of China(NSFC)Excellent Research Group Program for“Multiscale Problems in Nonlinear Mechanics”(Grant No.12588201)。
文摘The inelastic behavior of thermoplastic polymers may involve shearing and crazing,and both depend on temperature and strain rate.Traditional constitutive models account for temperature and strain rate through phenomenological or empirical formulas.In this study,we present a physics-guided machine learning(ML)framework to model shear and craze in polymeric materials.The effects of all three principal stresses for the craze initiation are considered other than the maximum tensile principal stress solely in previous works.We implemented a finite element framework through a user-defined material subroutine and applied the constitutive model to the deformation in three polymers(PLA 4060D,PLA 3051D,and HIPS).The result shows that our ML-based model can predict the stress-strain and volume-strain responses at different strain rates with high accuracy.Notably,the ML-based approach needs no assumptions about yield criteria or hardening laws.This work highlights the potential of hybrid physics-ML paradigms to overcome the trade-offs between model complexity and accuracy in polymer mechanics,paving the way for computationally efficient and generalizable constitutive models for thermoplastic materials.
文摘BACKGROUND Colorectal polyps are precancerous diseases of colorectal cancer.Early detection and resection of colorectal polyps can effectively reduce the mortality of colorectal cancer.Endoscopic mucosal resection(EMR)is a common polypectomy proce-dure in clinical practice,but it has a high postoperative recurrence rate.Currently,there is no predictive model for the recurrence of colorectal polyps after EMR.AIM To construct and validate a machine learning(ML)model for predicting the risk of colorectal polyp recurrence one year after EMR.METHODS This study retrospectively collected data from 1694 patients at three medical centers in Xuzhou.Additionally,a total of 166 patients were collected to form a prospective validation set.Feature variable screening was conducted using uni-variate and multivariate logistic regression analyses,and five ML algorithms were used to construct the predictive models.The optimal models were evaluated based on different performance metrics.Decision curve analysis(DCA)and SHapley Additive exPlanation(SHAP)analysis were performed to assess clinical applicability and predictor importance.RESULTS Multivariate logistic regression analysis identified 8 independent risk factors for colorectal polyp recurrence one year after EMR(P<0.05).Among the models,eXtreme Gradient Boosting(XGBoost)demonstrated the highest area under the curve(AUC)in the training set,internal validation set,and prospective validation set,with AUCs of 0.909(95%CI:0.89-0.92),0.921(95%CI:0.90-0.94),and 0.963(95%CI:0.94-0.99),respectively.DCA indicated favorable clinical utility for the XGBoost model.SHAP analysis identified smoking history,family history,and age as the top three most important predictors in the model.CONCLUSION The XGBoost model has the best predictive performance and can assist clinicians in providing individualized colonoscopy follow-up recommendations.
文摘The investigation by Zhu et al on the assessment of cellular proliferation markers to assist clinical decision-making in patients with hepatocellular carcinoma(HCC)using a machine learning model-based approach is a scientific approach.This study looked into the possibilities of using a Ki-67(a marker for cell proliferation)expression-based machine learning model to help doctors make decisions about treatment options for patients with HCC before surgery.The study used reconstructed tomography images of 164 patients with confirmed HCC from the intratumoral and peritumoral regions.The features were chosen using various statistical methods,including least absolute shrinkage and selection operator regression.Also,a nomogram was made using Radscore and clinical risk factors.It was tested for its ability to predict receiver operating characteristic curves and calibration curves,and its clinical benefits were found using decision curve analysis.The calibration curve demonstrated excellent consistency between predicted and actual probability,and the decision curve confirmed its clinical benefit.The proposed model is helpful for treating patients with HCC because the predicted and actual probabilities are very close to each other,as shown by the decision curve analysis.Further prospective studies are required,incorporating a multicenter and large sample size design,additional relevant exclusion criteria,information on tumors(size,number,and grade),and cancer stage to strengthen the clinical benefit in patients with HCC.
文摘The objective of this study is to develop an advanced approach to variogram modelling by integrating genetic algorithms(GA)with machine learning-based linear regression,aiming to improve the accuracy and efficiency of geostatistical analysis,particularly in mineral exploration.The study combines GA and machine learning to optimise variogram parameters,including range,sill,and nugget,by minimising the root mean square error(RMSE)and maximising the coefficient of determination(R^(2)).The experimental variograms were computed and modelled using theoretical models,followed by optimisation via evolutionary algorithms.The method was applied to gravity data from the Ngoura-Batouri-Kette mining district in Eastern Cameroon,covering 141 data points.Sequential Gaussian Simulations(SGS)were employed for predictive mapping to validate simulated results against true values.Key findings show variograms with ranges between 24.71 km and 49.77 km,opti-mised RMSE and R^(2) values of 11.21 mGal^(2) and 0.969,respectively,after 42 generations of GA optimisation.Predictive mapping using SGS demonstrated that simulated values closely matched true values,with the simu-lated mean at 21.75 mGal compared to the true mean of 25.16 mGal,and variances of 465.70 mGal^(2) and 555.28 mGal^(2),respectively.The results confirmed spatial variability and anisotropies in the N170-N210 directions,consistent with prior studies.This work presents a novel integration of GA and machine learning for variogram modelling,offering an automated,efficient approach to parameter estimation.The methodology significantly enhances predictive geostatistical models,contributing to the advancement of mineral exploration and improving the precision and speed of decision-making in the petroleum and mining industries.
基金supported by the National Key Research and Development Program of China(Grant No.2023YFC3209504)the National Natural Science Foundation of China(Grants No.U2040215 and 52479075)the Natural Science Foundation of Hubei Province(Grant No.2021CFA029).
文摘The backwater effect caused by tributary inflow can significantly elevate the water level profile upstream of a confluence point.However,the influence of mainstream and confluence discharges on the backwater effect in a river reach remains unclear.In this study,various hydrological data collected from the Jingjiang Reach of the Yangtze River in China were statistically analyzed to determine the backwater degree and range with three representative mainstream discharges.The results indicated that the backwater degree increased with mainstream discharge,and a positive relationship was observed between the runoff ratio and backwater degree at specific representative mainstream discharges.Following the operation of the Three Gorges Project,the backwater effect in the Jingjiang Reach diminished.For instance,mean backwater degrees for low,moderate,and high mainstream discharges were recorded as 0.83 m,1.61 m,and 2.41 m during the period from 1990 to 2002,whereas these values decreased to 0.30 m,0.95 m,and 2.08 m from 2009 to 2020.The backwater range extended upstream as mainstream discharge increased from 7000 m3/s to 30000 m3/s.Moreover,a random forest-based machine learning model was used to quantify the backwater effect with varying mainstream and confluence discharges,accounting for the impacts of mainstream discharge,confluence discharge,and channel degradation in the Jingjiang Reach.At the Jianli Hydrological Station,a decrease in mainstream discharge during flood seasons resulted in a 7%–15%increase in monthly mean backwater degree,while an increase in mainstream discharge during dry seasons led to a 1%–15%decrease in monthly mean backwater degree.Furthermore,increasing confluence discharge from Dongting Lake during June to July and September to November resulted in an 11%–42%increase in monthly mean backwater degree.Continuous channel degradation in the Jingjiang Reach contributed to a 6%–19%decrease in monthly mean backwater degree.Under the influence of these factors,the monthly mean backwater degree in 2017 varied from a decrease of 53%to an increase of 37%compared to corresponding values in 1991.
文摘The authors regret that the original publication of this paper did not include Jawad Fayaz as a co-author.After further discussions and a thorough review of the research contributions,it was agreed that his significant contributions to the foundational aspects of the research warranted recognition,and he has now been added as a co-author.
基金supported by the Natural Science Foundation of Jiangsu province,China(BK20240937)the Belt and Road Special Foundation of the National Key Laboratory of Water Disaster Prevention(2022491411,2021491811)the Basal Research Fund of Central Public Welfare Scientific Institution of Nanjing Hydraulic Research Institute(Y223006).
文摘Understanding spatial heterogeneity in groundwater responses to multiple factors is critical for water resource management in coastal cities.Daily groundwater depth(GWD)data from 43 wells(2018-2022)were collected in three coastal cities in Jiangsu Province,China.Seasonal and Trend decomposition using Loess(STL)together with wavelet analysis and empirical mode decomposition were applied to identify tide-influenced wells while remaining wells were grouped by hierarchical clustering analysis(HCA).Machine learning models were developed to predict GWD,then their response to natural conditions and human activities was assessed by the Shapley Additive exPlanations(SHAP)method.Results showed that eXtreme Gradient Boosting(XGB)was superior to other models in terms of prediction performance and computational efficiency(R^(2)>0.95).GWD in Yancheng and southern Lianyungang were greater than those in Nantong,exhibiting larger fluctuations.Groundwater within 5 km of the coastline was affected by tides,with more pronounced effects in agricultural areas compared to urban areas.Shallow groundwater(3-7 m depth)responded immediately(0-1 day)to rainfall,primarily influenced by farmland and topography(slope and distance from rivers).Rainfall recharge to groundwater peaked at 50%farmland coverage,but this effect was suppressed by high temperatures(>30℃)which intensified as distance from rivers increased,especially in forest and grassland.Deep groundwater(>10 m)showed delayed responses to rainfall(1-4 days)and temperature(10-15 days),with GDP as the primary influence,followed by agricultural irrigation and population density.Farmland helped to maintain stable GWD in low population density regions,while excessive farmland coverage(>90%)led to overexploitation.In the early stages of GDP development,increased industrial and agricultural water demand led to GWD decline,but as GDP levels significantly improved,groundwater consumption pressure gradually eased.This methodological framework is applicable not only to coastal cities in China but also could be extended to coastal regions worldwide.