Landslides pose a formidable natural hazard across the Qinghai-Tibet Plateau(QTP),endangering both ecosystems and human life.Identifying the driving factors behind landslides and accurately assessing susceptibility ar...Landslides pose a formidable natural hazard across the Qinghai-Tibet Plateau(QTP),endangering both ecosystems and human life.Identifying the driving factors behind landslides and accurately assessing susceptibility are key to mitigating disaster risk.This study integrated multi-source historical landslide data with 15 predictive factors and used several machine learning models—Random Forest(RF),Gradient Boosting Regression Trees(GBRT),Extreme Gradient Boosting(XGBoost),and Categorical Boosting(CatBoost)—to generate susceptibility maps.The Shapley additive explanation(SHAP)method was applied to quantify factor importance and explore their nonlinear effects.The results showed that:(1)CatBoost was the best-performing model(CA=0.938,AUC=0.980)in assessing landslide susceptibility,with altitude emerging as the most significant factor,followed by distance to roads and earthquake sites,precipitation,and slope;(2)the SHAP method revealed critical nonlinear thresholds,demonstrating that historical landslides were concentrated at mid-altitudes(1400-4000 m)and decreased markedly above 4000 m,with a parallel reduction in probability beyond 700 m from roads;and(3)landslide-prone areas,comprising 13%of the QTP,were concentrated in the southeastern and northeastern parts of the plateau.By integrating machine learning and SHAP analysis,this study revealed landslide hazard-prone areas and their driving factors,providing insights to support disaster management strategies and sustainable regional planning.展开更多
Urban rainwater runoff is an important source of nonpoint source pollution due to its transport of diverse contaminants,including polycyclic aromatic hydrocarbons(PAHs)and chlorinated derivatives.Importantly,these chl...Urban rainwater runoff is an important source of nonpoint source pollution due to its transport of diverse contaminants,including polycyclic aromatic hydrocarbons(PAHs)and chlorinated derivatives.Importantly,these chlorinated polycyclic aromatic hydrocarbons(Cl-PAHs)exhibit elevated toxicological potential compared to their non-halogenated parent compounds.In this study,we proposed an approach that combined multivariate receptor model with integration of SHapley Additive exPlanations and Random Forest model.This method identifies the possible sources and reveals the impact of source apportionment results and environmental driving factors(such as geographical and meteorological data)on pollutant concentrations.Sixteen PAHs and nine ClPAHs were detected in 79 runoff samples from all three sites.TheΣ_(16)PAHs average concentration(2923.93 to 6071.83 ng/L)was significantly higher than theΣ_(9)Cl-PAHs(384.34 to 1314.73 ng/L).The source apportionment was conducted by positive matrix factorization(PMF),and six potential pollution sources for PAHs and three for Cl-PAHs were quantified.PAHs primarily originate from the combustion of fossil fuels such as traffic,industrial emissions and coal tar,while Cl-PAHs are mainly derived from atmospheric deposition and industrial emissions.Meanwhile,the self‑organizing map classified PAHs and Cl-PAHs into 2 and 3 groups,respectively.The k-means algorithm yielded 4 clusters for runoff samples.Among machine learning models,Random Forest(RF)demonstrated optimal predictive performance and integrated with SHapley Additive exPlanations(RF-SHAP)revealed the effects of driving factors on the predicted concentration of PAHs and Cl-PAHs in urban runoff samples.展开更多
Objective Accurately identifying the key influencing factors of psychological birth trauma in primiparous women is crucial for implementing effective preventive and intervention measures.This study aimed to develop an...Objective Accurately identifying the key influencing factors of psychological birth trauma in primiparous women is crucial for implementing effective preventive and intervention measures.This study aimed to develop and validate an interpretable machine learning prediction model for identifying the key influencing factors of psychological birth trauma in primiparous women.Methods A multicenter cross-sectional study was conducted on primiparous women in four tertiary hospitals in Sichuan Province,southwestern China,from December 2023 to March 2024.The Childbirth Trauma Index was used in assessing psychological birth trauma in primiparous women.Data were collected and randomly divided into a training set(80%,n=289)and a testing set(20%,n=73).Six different machine learning models were trained and tested.Training and prediction were conducted using six machine learning models included Linear Regression,Support Vector Regression,Multilayer Perceptron Regression,eXtreme Gradient Boosting Regression,Random Forest Regression,and Adaptive Boosting Regression.The optimal model was selected based on various performance metrics,and its predictive results were interpreted using SHapley Additive exPlanations(SHAP)and accumulated local effects(ALE).Results Among the six machine learning models,the Multilayer Perceptron Regression model exhibited the best overall performance in the testing set(MAE=3.977,MSE=24.832,R2=0.507,EVS=0.524,RMSE=4.983).In the testing set,the R2 and EVS of the Multilayer Perceptron Regression model increased by 8.3%and 1.2%,respectively,compared to the traditional linear regression model.Meanwhile,the MAE,MSE,and RMSE decreased by 0.4%,7.3%,and 3.7%,respectively,compared to the traditional linear regression model.The SHAP analysis indicated that intrapartum pain,anxiety,postpartum pain,resilience,and planned pregnancy are the most critical influencing factors of psychological birth trauma in primiparous women.The ALE analysis indicated that higher intrapartum pain,anxiety,and postpartum pain scores are risk factors,while higher resilience scores are protective factors.Conclusions Interpretable machine learning prediction models can identify the key influencing factors of psychological birth trauma in primiparous women.SHAP and ALE analyses based on the Multilayer Perceptron Regression model can help healthcare providers understand the complex decision-making logic within a prediction model.This study provides a scientific basis for the early prevention and personalized intervention of psychological birth trauma in primiparous women.展开更多
Research on the application of machine learning(ML)models to landslide susceptibility assessments has gained popularity in recent years,with a focus primarily on topographic factors derived from digital elevation mode...Research on the application of machine learning(ML)models to landslide susceptibility assessments has gained popularity in recent years,with a focus primarily on topographic factors derived from digital elevation models(DEMs).However,few studies have focused on the explanatory effects of these factors on different models,i.e.whether DEM-based factors affect different models in the same way.This study investigated whether different ML models could yield consistent interpretations of DEM-based factors using explanatory algorithms.Six ML models,including a support vector machine,a neural network,extreme gradient boosting,a random forest,linear regression,and K-nearest neighbors,were trained and evaluated on five geospatial datasets derived from different DEMs.Each dataset contained eight DEM-based and six non-DEM-based factors from 8912 landslide samples.Model performance was assessed using accuracy,precision,recall rate,F1-score,kappa coefficient,and receiver operating characteristic curves.Explanatory analyses,including Shapley additive explanations and partial dependence plots,were also employed to investigate the effects of topographic factors on landslide susceptibility.The results indicate that DEM-based factors consistently influenced different ML models across the datasets.Furthermore,tree-based models outperformed the other models in almost all datasets,while the most suitable DEMs were obtained from Copernicus and TanDEM-X.In addition,the concave surface without potholes on steep slopes are ideal topographic conditions for landslide formation in the study area.This study can benefit the wider landslide research community by clarifying how topographic factors affect ML models.展开更多
Ferroptosis,a type of cell death that mainly involves iron metabolism imbalance and lipid peroxidation,is strongly correlated with the phagocytic response caused by bleeding after spinal cord injury.Thus,in this study...Ferroptosis,a type of cell death that mainly involves iron metabolism imbalance and lipid peroxidation,is strongly correlated with the phagocytic response caused by bleeding after spinal cord injury.Thus,in this study,bulk RNA sequencing data(GSE47681 and GSE5296)and single-cell RNA sequencing data(GSE162610)were acquired from gene expression databases.We then conducted differential analysis and immune infiltration analysis.Atf3 and Piezo1 were identified as key ferroptosis genes through random forest and least absolute shrinkage and selection operator algorithms.Further analysis of single-cell RNA sequencing data revealed a close relationship between ferroptosis and cell types such as macrophages/microglia and their intrinsic state transition processes.Differences in transcription factor regulation and intercellular communication networks were found in ferroptosis-related cells,confirming the high expression of Atf3 and Piezo1 in these cells.Molecular docking analysis confirmed that the proteins encoded by these genes can bind cycloheximide.In a mouse model of T8 spinal cord injury,low-dose cycloheximide treatment was found to improve neurological function,decrease levels of the pro-inflammatory cytokine inducible nitric oxide synthase,and increase levels of the anti-inflammatory cytokine arginase 1.Correspondingly,the expression of the ferroptosis-related gene Gpx4 increased in macrophages/microglia,while the expression of Acsl4 decreased.Our findings reveal the important role of ferroptosis in the treatment of spinal cord injury,identify the key cell types and genes involved in ferroptosis after spinal cord injury,and validate the efficacy of potential drug therapies,pointing to new directions in the treatment of spinal cord injury.展开更多
Online interactive learning plays a crucial role in improving online education quality.This grounded theory study examines:(1)what key factors shape EFL learners’online interactive learning,(2)how these factors form ...Online interactive learning plays a crucial role in improving online education quality.This grounded theory study examines:(1)what key factors shape EFL learners’online interactive learning,(2)how these factors form an empirically validated model,and(3)how they interact within this model,through systematic analysis of 9,207 discussion forum posts from a Chinese University MOOC platform.Results demonstrate that learning drive,course structure,teaching competence,interaction behavior,expected outcomes,and online learning context significantly influence EFL online interactive learning.The analysis reveals two key mechanisms:expected outcomes mediate the effects of learning drive(β=0.45),course structure,teaching competence,and interaction behavior(β=0.35)on learning outcomes,while online learning context moderates these relationships(β=0.25).Specifically,learning drive provides intrinsic/extrinsic motivation,whereas course structure,teaching competence,interaction behavior,and expected outcomes collectively enhance interaction quality and sustainability.These findings,derived through rigorous grounded theory methodology involving open,axial,and selective coding of large-scale interaction data,yield three key contributions:(1)a comprehensive theoretical model of EFL online learning dynamics,(2)empirical validation of mediation/moderation mechanisms,and(3)practical strategies for designing scaffolded interaction protocols and adaptive feedback systems.The study establishes that its theoretically saturated model(achieved after analyzing 7,366 posts with 1,841 verification cases)offers educators evidence-based approaches to optimize collaborative interaction in digital EFL environments.展开更多
BACKGROUND Pancreatic fistula is the most common complication of pancreatic surgeries that causes more serious conditions,including bleeding due to visceral vessel erosion and peritonitis.AIM To develop a machine lear...BACKGROUND Pancreatic fistula is the most common complication of pancreatic surgeries that causes more serious conditions,including bleeding due to visceral vessel erosion and peritonitis.AIM To develop a machine learning(ML)model for postoperative pancreatic fistula and identify significant risk factors of the complication.METHODS A single-center retrospective clinical study was conducted which included 150 patients,who underwent pancreat-oduodenectomy.Logistic regression,random forest,and CatBoost were employed for modeling the biochemical leak(symptomless fistula)and fistula grade B/C(clinically significant complication).The performance was estimated by receiver operating characteristic(ROC)area under the curve(AUC)after 5-fold cross-validation(20%testing and 80%training data).The risk factors were evaluated with the most accurate algorithm,based on the parameter“Importance”(Im),and Kendall correlation,P<0.05.RESULTS The CatBoost algorithm was the most accurate with an AUC of 74%-86%.The study provided results of ML-based modeling and algorithm selection for pancreatic fistula prediction and risk factor evaluation.From 14 parameters we selected the main pre-and intraoperative prognostic factors of all the fistulas:Tumor vascular invasion(Im=24.8%),age(Im=18.6%),and body mass index(Im=16.4%),AUC=74%.The ML model showed that biochemical leak,blood and drain amylase level(Im=21.6%and 16.4%),and blood leukocytes(Im=11.2%)were crucial predictors for subsequent fistula B/C,AUC=86%.Surgical techniques,morphology,and pancreatic duct diameter less than 3 mm were insignificant(Im<5%and no correlations detected).The results were confirmed by correlation analysis.CONCLUSION This study highlights the key predictors of postoperative pancreatic fistula and establishes a robust ML-based model for individualized risk prediction.These findings contribute to the advancement of personalized periop-erative care and may guide targeted preventive strategies.展开更多
The new curriculum standard points out that affection is one of the most important goals of fundamental education. The non-target language environment is easier to cause the affective change of middle school students ...The new curriculum standard points out that affection is one of the most important goals of fundamental education. The non-target language environment is easier to cause the affective change of middle school students who are changeable in their affective state. Based on the affective filter hypothesis, this paper deals with the adjustment to affective factors in English learning by using Internet English Curriculum Resource, such as attitude and motivation, anxiety and inhibition, self-esteem and self-confidence. At last, some suggestions are offered to judge Internet English Curriculum Resource.展开更多
This article aims at the discussion of cultural interferences in foreign language teaching and learning for Chinese non-English majors. It also calls for more attention from teachers, textbook compilers, and students ...This article aims at the discussion of cultural interferences in foreign language teaching and learning for Chinese non-English majors. It also calls for more attention from teachers, textbook compilers, and students themselves.展开更多
Objective To study the self-consciousness of children with learning disabilities (LD) and to identify related factors. Methods Five hundred and sixty pupils graded from 1 to 6 in an elementary school were investigated...Objective To study the self-consciousness of children with learning disabilities (LD) and to identify related factors. Methods Five hundred and sixty pupils graded from 1 to 6 in an elementary school were investigated. According to the pupil rating scale revised screening for learning disabilities (PRS), combined Raven’s test (CRT) and achievement of main courses, 35 of 560 pupils were diagnosed as LD children. Thirty-five children were selected from the average children and 35 from advanced children in academic achievement equally matched in class, gender, and age with LD children as control groups. The three groups were tested by Piers-Harris children’s self-concept scale. Basic information of each subject was collected by self-made questionnaire. Results Compared with the average and advanced children, LD children got significantly lower scores in self-concept scale. Based on logistic regression analysis, 3 factors were identified, including family income per month, single child and delivery model. Conclusion The results suggest that self-consciousness of children with LD is lower than that of normal children.展开更多
BACKGROUND Colorectal polyps are precancerous diseases of colorectal cancer.Early detection and resection of colorectal polyps can effectively reduce the mortality of colorectal cancer.Endoscopic mucosal resection(EMR...BACKGROUND Colorectal polyps are precancerous diseases of colorectal cancer.Early detection and resection of colorectal polyps can effectively reduce the mortality of colorectal cancer.Endoscopic mucosal resection(EMR)is a common polypectomy proce-dure in clinical practice,but it has a high postoperative recurrence rate.Currently,there is no predictive model for the recurrence of colorectal polyps after EMR.AIM To construct and validate a machine learning(ML)model for predicting the risk of colorectal polyp recurrence one year after EMR.METHODS This study retrospectively collected data from 1694 patients at three medical centers in Xuzhou.Additionally,a total of 166 patients were collected to form a prospective validation set.Feature variable screening was conducted using uni-variate and multivariate logistic regression analyses,and five ML algorithms were used to construct the predictive models.The optimal models were evaluated based on different performance metrics.Decision curve analysis(DCA)and SHapley Additive exPlanation(SHAP)analysis were performed to assess clinical applicability and predictor importance.RESULTS Multivariate logistic regression analysis identified 8 independent risk factors for colorectal polyp recurrence one year after EMR(P<0.05).Among the models,eXtreme Gradient Boosting(XGBoost)demonstrated the highest area under the curve(AUC)in the training set,internal validation set,and prospective validation set,with AUCs of 0.909(95%CI:0.89-0.92),0.921(95%CI:0.90-0.94),and 0.963(95%CI:0.94-0.99),respectively.DCA indicated favorable clinical utility for the XGBoost model.SHAP analysis identified smoking history,family history,and age as the top three most important predictors in the model.CONCLUSION The XGBoost model has the best predictive performance and can assist clinicians in providing individualized colonoscopy follow-up recommendations.展开更多
BACKGROUND Intensive care unit-acquired weakness(ICU-AW)is a common complication that significantly impacts the patient's recovery process,even leading to adverse outcomes.Currently,there is a lack of effective pr...BACKGROUND Intensive care unit-acquired weakness(ICU-AW)is a common complication that significantly impacts the patient's recovery process,even leading to adverse outcomes.Currently,there is a lack of effective preventive measures.AIM To identify significant risk factors for ICU-AW through iterative machine learning techniques and offer recommendations for its prevention and treatment.METHODS Patients were categorized into ICU-AW and non-ICU-AW groups on the 14th day post-ICU admission.Relevant data from the initial 14 d of ICU stay,such as age,comorbidities,sedative dosage,vasopressor dosage,duration of mechanical ventilation,length of ICU stay,and rehabilitation therapy,were gathered.The relationships between these variables and ICU-AW were examined.Utilizing iterative machine learning techniques,a multilayer perceptron neural network model was developed,and its predictive performance for ICU-AW was assessed using the receiver operating characteristic curve.RESULTS Within the ICU-AW group,age,duration of mechanical ventilation,lorazepam dosage,adrenaline dosage,and length of ICU stay were significantly higher than in the non-ICU-AW group.Additionally,sepsis,multiple organ dysfunction syndrome,hypoalbuminemia,acute heart failure,respiratory failure,acute kidney injury,anemia,stress-related gastrointestinal bleeding,shock,hypertension,coronary artery disease,malignant tumors,and rehabilitation therapy ratios were significantly higher in the ICU-AW group,demonstrating statistical significance.The most influential factors contributing to ICU-AW were identified as the length of ICU stay(100.0%)and the duration of mechanical ventilation(54.9%).The neural network model predicted ICU-AW with an area under the curve of 0.941,sensitivity of 92.2%,and specificity of 82.7%.CONCLUSION The main factors influencing ICU-AW are the length of ICU stay and the duration of mechanical ventilation.A primary preventive strategy,when feasible,involves minimizing both ICU stay and mechanical ventilation duration.展开更多
This investigation assessed the efficacy of 10 widely used machine learning algorithms(MLA)comprising the least absolute shrinkage and selection operator(LASSO),generalized linear model(GLM),stepwise generalized linea...This investigation assessed the efficacy of 10 widely used machine learning algorithms(MLA)comprising the least absolute shrinkage and selection operator(LASSO),generalized linear model(GLM),stepwise generalized linear model(SGLM),elastic net(ENET),partial least square(PLS),ridge regression,support vector machine(SVM),classification and regression trees(CART),bagged CART,and random forest(RF)for gully erosion susceptibility mapping(GESM)in Iran.The location of 462 previously existing gully erosion sites were mapped through widespread field investigations,of which 70%(323)and 30%(139)of observations were arbitrarily divided for algorithm calibration and validation.Twelve controlling factors for gully erosion,namely,soil texture,annual mean rainfall,digital elevation model(DEM),drainage density,slope,lithology,topographic wetness index(TWI),distance from rivers,aspect,distance from roads,plan curvature,and profile curvature were ranked in terms of their importance using each MLA.The MLA were compared using a training dataset for gully erosion and statistical measures such as RMSE(root mean square error),MAE(mean absolute error),and R-squared.Based on the comparisons among MLA,the RF algorithm exhibited the minimum RMSE and MAE and the maximum value of R-squared,and was therefore selected as the best model.The variable importance evaluation using the RF model revealed that distance from rivers had the highest significance in influencing the occurrence of gully erosion whereas plan curvature had the least importance.According to the GESM generated using RF,most of the study area is predicted to have a low(53.72%)or moderate(29.65%)susceptibility to gully erosion,whereas only a small area is identified to have a high(12.56%)or very high(4.07%)susceptibility.The outcome generated by RF model is validated using the ROC(Receiver Operating Characteristics)curve approach,which returned an area under the curve(AUC)of 0.985,proving the excellent forecasting ability of the model.The GESM prepared using the RF algorithm can aid decision-makers in targeting remedial actions for minimizing the damage caused by gully erosion.展开更多
Objective To investigate prevalence rate of learning disabilities (LD) in Chinese children, and to explore related risk factors, and to provide theoretical basis for preventing such disabilities. Methods One thousand ...Objective To investigate prevalence rate of learning disabilities (LD) in Chinese children, and to explore related risk factors, and to provide theoretical basis for preventing such disabilities. Methods One thousand and one hundred fifty one children were randomly selected in primary schools. According to criteria set by ICD-10, 118 children diagnosed as LD were classified into the study group. Four hundred and ninety one children were classified into the normal control group. Five hundred and forty two children were classified into the excellent control group. The study instruments included PRS (The pupil rating scale revised screening for learning disabilities), Conners' children behavior check-list taken by parents and YG-WR character check-list. Results The prevalence rate of LD in Chinese children was 10.3%. Significant differences were observed between LD and normally learning children, and between the LD group and the excellent group, in terms of scores of Conners' behavior check-list (P<0.05). The study further showed that individual differences in character between the LD group and the control groups still existed even after controlling individual differences in age, IQ, and gender. Some possible causal explanations contributing to LD were improper teaching by parents, low educational level of the parents, and children's characteristics and social relationships. Conclusion These data underscore the fact that LD is a serious national public health problem in China. LD is resulted from a number of factors. Good studying and living environments should be created for LD children.展开更多
To perform landslide susceptibility prediction(LSP),it is important to select appropriate mapping unit and landslide-related conditioning factors.The efficient and automatic multi-scale segmentation(MSS)method propose...To perform landslide susceptibility prediction(LSP),it is important to select appropriate mapping unit and landslide-related conditioning factors.The efficient and automatic multi-scale segmentation(MSS)method proposed by the authors promotes the application of slope units.However,LSP modeling based on these slope units has not been performed.Moreover,the heterogeneity of conditioning factors in slope units is neglected,leading to incomplete input variables of LSP modeling.In this study,the slope units extracted by the MSS method are used to construct LSP modeling,and the heterogeneity of conditioning factors is represented by the internal variations of conditioning factors within slope unit using the descriptive statistics features of mean,standard deviation and range.Thus,slope units-based machine learning models considering internal variations of conditioning factors(variant slope-machine learning)are proposed.The Chongyi County is selected as the case study and is divided into 53,055 slope units.Fifteen original slope unit-based conditioning factors are expanded to 38 slope unit-based conditioning factors through considering their internal variations.Random forest(RF)and multi-layer perceptron(MLP)machine learning models are used to construct variant Slope-RF and Slope-MLP models.Meanwhile,the Slope-RF and Slope-MLP models without considering the internal variations of conditioning factors,and conventional grid units-based machine learning(Grid-RF and MLP)models are built for comparisons through the LSP performance assessments.Results show that the variant Slopemachine learning models have higher LSP performances than Slope-machine learning models;LSP results of variant Slope-machine learning models have stronger directivity and practical application than Grid-machine learning models.It is concluded that slope units extracted by MSS method can be appropriate for LSP modeling,and the heterogeneity of conditioning factors within slope units can more comprehensively reflect the relationships between conditioning factors and landslides.The research results have important reference significance for land use and landslide prevention.展开更多
Objective:To evaluate the human epidermal growth factor receptor 2(HER2)status in patients with breast cancer using multidetector computed tomography(MDCT)-based handcrafted and deep radiomics features.Methods:This re...Objective:To evaluate the human epidermal growth factor receptor 2(HER2)status in patients with breast cancer using multidetector computed tomography(MDCT)-based handcrafted and deep radiomics features.Methods:This retrospective study enrolled 339 female patients(primary cohort,n=177;validation cohort,n=162)with pathologically confirmed invasive breast cancer.Handcrafted and deep radiomics features were extracted from the MDCT images during the arterial phase.After the feature selection procedures,handcrafted and deep radiomics signatures and the combined model were built using multivariate logistic regression analysis.Performance was assessed by measures of discrimination,calibration,and clinical usefulness in the primary cohort and validated in the validation cohort.Results:The handcrafted radiomics signature had a discriminative ability with a C-index of 0.739[95%confidence interval(95%CI):0.661-0.818]in the primary cohort and 0.695(95%CI:0.609-0.781)in the validation cohort.The deep radiomics signature also had a discriminative ability with a C-index of 0.760(95%CI:0.690-0.831)in the primary cohort and 0.777(95%CI:0.696-0.857)in the validation cohort.The combined model,which incorporated both the handcrafted and deep radiomics signatures,showed good discriminative ability with a C-index of 0.829(95%CI:0.767-0.890)in the primary cohort and 0.809(95%CI:0.740-0.879)in the validation cohort.Conclusions:Handcrafted and deep radiomics features from MDCT images were associated with HER2 status in patients with breast cancer.Thus,these features could provide complementary aid for the radiological evaluation of HER2 status in breast cancer.展开更多
BACKGROUND Ischemic heart disease(IHD)impacts the quality of life and has the highest mortality rate of cardiovascular diseases globally.AIM To compare variations in the parameters of the single-lead electrocardiogram...BACKGROUND Ischemic heart disease(IHD)impacts the quality of life and has the highest mortality rate of cardiovascular diseases globally.AIM To compare variations in the parameters of the single-lead electrocardiogram(ECG)during resting conditions and physical exertion in individuals diagnosed with IHD and those without the condition using vasodilator-induced stress computed tomography(CT)myocardial perfusion imaging as the diagnostic reference standard.METHODS This single center observational study included 80 participants.The participants were aged≥40 years and given an informed written consent to participate in the study.Both groups,G1(n=31)with and G2(n=49)without post stress induced myocardial perfusion defect,passed cardiologist consultation,anthropometric measurements,blood pressure and pulse rate measurement,echocardiography,cardio-ankle vascular index,bicycle ergometry,recording 3-min single-lead ECG(Cardio-Qvark)before and just after bicycle ergometry followed by performing CT myocardial perfusion.The LASSO regression with nested cross-validation was used to find the association between Cardio-Qvark parameters and the existence of the perfusion defect.Statistical processing was performed with the R programming language v4.2,Python v.3.10[^R],and Statistica 12 program.RESULTS Bicycle ergometry yielded an area under the receiver operating characteristic curve of 50.7%[95%confidence interval(CI):0.388-0.625],specificity of 53.1%(95%CI:0.392-0.673),and sensitivity of 48.4%(95%CI:0.306-0.657).In contrast,the Cardio-Qvark test performed notably better with an area under the receiver operating characteristic curve of 67%(95%CI:0.530-0.801),specificity of 75.5%(95%CI:0.628-0.88),and sensitivity of 51.6%(95%CI:0.333-0.695).CONCLUSION The single-lead ECG has a relatively higher diagnostic accuracy compared with bicycle ergometry by using machine learning models,but the difference was not statistically significant.However,further investigations are required to uncover the hidden capabilities of single-lead ECG in IHD diagnosis.展开更多
BACKGROUND Colorectal cancer is a common digestive malignancy,and chemotherapy remains a cornerstone of treatment.Myelosuppression,a frequent hematologic toxicity,poses significant clinical challenges.However,no inter...BACKGROUND Colorectal cancer is a common digestive malignancy,and chemotherapy remains a cornerstone of treatment.Myelosuppression,a frequent hematologic toxicity,poses significant clinical challenges.However,no interpretable machine learning-based nomogram exists to predict chemotherapy-induced myelosuppression in colorectal cancer patients.This study aimed to develop and validate an inter-pretable clinic-machine learning nomogram integrating clinical predictors with multiple algorithms via a feature mapping algorithm.The model provides accurate risk estimation and clinical interpretability,supporting individualized prevention strategies and optimizing decision-making in patients receiving first-line chemotherapy.AIM To develop and validate an interpretable clinic-machine learning nomogram predicting chemotherapy-induced myelosuppression in colorectal cancer.METHODS This retrospective study enrolled 855 colorectal cancer patients receiving first-line chemotherapy.Data were split into training(n=612),validation(n=153),and testing(n=90)cohorts.Ten predictors were identified through least absolute shrinkage and selection operator,decision tree,random forest,and expert con-sensus.Ten machine learning algorithms were applied,with performance assessed by area under the receiver operating characteristic curve(AUC),area under the precision-recall curve(AUPRC),calibration,and decision curves.The optimal model was integrated into a clinic-machine learning nomogram via the feature mapping algorithm,which was internally validated for predictive accuracy and clinical utility.(AUPRC),calibration,and decision curves.The optimal model was integrated into a clinic-machine learning nomogram via the feature mapping algorithm,which was internally validated for predictive accuracy and clinical utility.RESULTS A total of 855 colorectal cancer patients were enrolled,with 765 cases(April 2020 to December 2023)used for model training and validation,and 90 cases(January 2024 to July 2024)for internal testing.Baseline clinical features did not differ significantly between training and validation cohorts(P>0.05).Ten predictors were identified through integrated feature selection and expert consensus,including age,body surface area,body mass index,tumor position,albumin,carcinoembryonic antigen,carbohydrate antigen(CA)19-9,CA125,chemotherapy regimen,and chemotherapy cycles.Among ten machine learning algorithms,extreme gradient boosting achieved the best validation performance(AUC=0.97,AUPRC=0.92,sensitivity=0.79,specificity=0.92,accuracy=0.88).Logistic regression confirmed extra trees and random forest as independent predictors,which were incorporated into a clinic-machine learning nomogram.The clinic-machine learning nomogram demonstrated superior discrimination(AUC=0.96,AUPRC=0.93,accuracy=0.90,specificity=0.95),good calibration,and greater net clinical benefit across a wide probability range(10%-90%).Internal testing further confirmed its robustness and generalizability(AUC=0.95).CONCLUSION The clinic-machine learning nomogram accurately predicts chemotherapy-induced myelosuppression in colorectal cancer,providing interpretability and clinical utility to support individualized risk assessment and treatment decision-making.展开更多
基金The National Key Research and Development Program of China,No.2023YFC3206601。
文摘Landslides pose a formidable natural hazard across the Qinghai-Tibet Plateau(QTP),endangering both ecosystems and human life.Identifying the driving factors behind landslides and accurately assessing susceptibility are key to mitigating disaster risk.This study integrated multi-source historical landslide data with 15 predictive factors and used several machine learning models—Random Forest(RF),Gradient Boosting Regression Trees(GBRT),Extreme Gradient Boosting(XGBoost),and Categorical Boosting(CatBoost)—to generate susceptibility maps.The Shapley additive explanation(SHAP)method was applied to quantify factor importance and explore their nonlinear effects.The results showed that:(1)CatBoost was the best-performing model(CA=0.938,AUC=0.980)in assessing landslide susceptibility,with altitude emerging as the most significant factor,followed by distance to roads and earthquake sites,precipitation,and slope;(2)the SHAP method revealed critical nonlinear thresholds,demonstrating that historical landslides were concentrated at mid-altitudes(1400-4000 m)and decreased markedly above 4000 m,with a parallel reduction in probability beyond 700 m from roads;and(3)landslide-prone areas,comprising 13%of the QTP,were concentrated in the southeastern and northeastern parts of the plateau.By integrating machine learning and SHAP analysis,this study revealed landslide hazard-prone areas and their driving factors,providing insights to support disaster management strategies and sustainable regional planning.
基金supported by Guangdong Basic and Applied Basic Research Foundation(Nos.2021B1515120055 and 2022A1515010499).
文摘Urban rainwater runoff is an important source of nonpoint source pollution due to its transport of diverse contaminants,including polycyclic aromatic hydrocarbons(PAHs)and chlorinated derivatives.Importantly,these chlorinated polycyclic aromatic hydrocarbons(Cl-PAHs)exhibit elevated toxicological potential compared to their non-halogenated parent compounds.In this study,we proposed an approach that combined multivariate receptor model with integration of SHapley Additive exPlanations and Random Forest model.This method identifies the possible sources and reveals the impact of source apportionment results and environmental driving factors(such as geographical and meteorological data)on pollutant concentrations.Sixteen PAHs and nine ClPAHs were detected in 79 runoff samples from all three sites.TheΣ_(16)PAHs average concentration(2923.93 to 6071.83 ng/L)was significantly higher than theΣ_(9)Cl-PAHs(384.34 to 1314.73 ng/L).The source apportionment was conducted by positive matrix factorization(PMF),and six potential pollution sources for PAHs and three for Cl-PAHs were quantified.PAHs primarily originate from the combustion of fossil fuels such as traffic,industrial emissions and coal tar,while Cl-PAHs are mainly derived from atmospheric deposition and industrial emissions.Meanwhile,the self‑organizing map classified PAHs and Cl-PAHs into 2 and 3 groups,respectively.The k-means algorithm yielded 4 clusters for runoff samples.Among machine learning models,Random Forest(RF)demonstrated optimal predictive performance and integrated with SHapley Additive exPlanations(RF-SHAP)revealed the effects of driving factors on the predicted concentration of PAHs and Cl-PAHs in urban runoff samples.
基金supported by the Sichuan Province Nursing Scientific Research Project Plan(H23022)the 2022 Municipal-University Science and Technology Strategic Cooperation Special Fund of Nanchong Science and Technology Bureau(22SXQT0222)。
文摘Objective Accurately identifying the key influencing factors of psychological birth trauma in primiparous women is crucial for implementing effective preventive and intervention measures.This study aimed to develop and validate an interpretable machine learning prediction model for identifying the key influencing factors of psychological birth trauma in primiparous women.Methods A multicenter cross-sectional study was conducted on primiparous women in four tertiary hospitals in Sichuan Province,southwestern China,from December 2023 to March 2024.The Childbirth Trauma Index was used in assessing psychological birth trauma in primiparous women.Data were collected and randomly divided into a training set(80%,n=289)and a testing set(20%,n=73).Six different machine learning models were trained and tested.Training and prediction were conducted using six machine learning models included Linear Regression,Support Vector Regression,Multilayer Perceptron Regression,eXtreme Gradient Boosting Regression,Random Forest Regression,and Adaptive Boosting Regression.The optimal model was selected based on various performance metrics,and its predictive results were interpreted using SHapley Additive exPlanations(SHAP)and accumulated local effects(ALE).Results Among the six machine learning models,the Multilayer Perceptron Regression model exhibited the best overall performance in the testing set(MAE=3.977,MSE=24.832,R2=0.507,EVS=0.524,RMSE=4.983).In the testing set,the R2 and EVS of the Multilayer Perceptron Regression model increased by 8.3%and 1.2%,respectively,compared to the traditional linear regression model.Meanwhile,the MAE,MSE,and RMSE decreased by 0.4%,7.3%,and 3.7%,respectively,compared to the traditional linear regression model.The SHAP analysis indicated that intrapartum pain,anxiety,postpartum pain,resilience,and planned pregnancy are the most critical influencing factors of psychological birth trauma in primiparous women.The ALE analysis indicated that higher intrapartum pain,anxiety,and postpartum pain scores are risk factors,while higher resilience scores are protective factors.Conclusions Interpretable machine learning prediction models can identify the key influencing factors of psychological birth trauma in primiparous women.SHAP and ALE analyses based on the Multilayer Perceptron Regression model can help healthcare providers understand the complex decision-making logic within a prediction model.This study provides a scientific basis for the early prevention and personalized intervention of psychological birth trauma in primiparous women.
基金supported by the National Key Research and Development Program of China(Grant No.2022YFC3003205)the Chengdu University of Technology Postgraduate Innovative Cultivation Program(Grant No.10800-000510-01-022)+1 种基金the Sichuan Science and Technology Program(Grant No.2025ZNSFSC1206)the State Key Laboratory of Geohazard Prevention and Geoenvironment Protection Independent Research Project(Grant No.SKLGP2023Z026).
文摘Research on the application of machine learning(ML)models to landslide susceptibility assessments has gained popularity in recent years,with a focus primarily on topographic factors derived from digital elevation models(DEMs).However,few studies have focused on the explanatory effects of these factors on different models,i.e.whether DEM-based factors affect different models in the same way.This study investigated whether different ML models could yield consistent interpretations of DEM-based factors using explanatory algorithms.Six ML models,including a support vector machine,a neural network,extreme gradient boosting,a random forest,linear regression,and K-nearest neighbors,were trained and evaluated on five geospatial datasets derived from different DEMs.Each dataset contained eight DEM-based and six non-DEM-based factors from 8912 landslide samples.Model performance was assessed using accuracy,precision,recall rate,F1-score,kappa coefficient,and receiver operating characteristic curves.Explanatory analyses,including Shapley additive explanations and partial dependence plots,were also employed to investigate the effects of topographic factors on landslide susceptibility.The results indicate that DEM-based factors consistently influenced different ML models across the datasets.Furthermore,tree-based models outperformed the other models in almost all datasets,while the most suitable DEMs were obtained from Copernicus and TanDEM-X.In addition,the concave surface without potholes on steep slopes are ideal topographic conditions for landslide formation in the study area.This study can benefit the wider landslide research community by clarifying how topographic factors affect ML models.
基金supported by the National Natural Science Foundation of China,No.81972073(to HZ)a grant from the Taishan Scholars Program ofShandong Province-Young Taishan Scholars,No.tsqn201909197(to HZ)+1 种基金a grant from Tianjin Key Medical Discipline(Specialty)Construct Project,No.TJYXZDXK-027A(to SF)a grant from Academic Expert International Innovation Summit,No.22JRRCRC00010(to SF).
文摘Ferroptosis,a type of cell death that mainly involves iron metabolism imbalance and lipid peroxidation,is strongly correlated with the phagocytic response caused by bleeding after spinal cord injury.Thus,in this study,bulk RNA sequencing data(GSE47681 and GSE5296)and single-cell RNA sequencing data(GSE162610)were acquired from gene expression databases.We then conducted differential analysis and immune infiltration analysis.Atf3 and Piezo1 were identified as key ferroptosis genes through random forest and least absolute shrinkage and selection operator algorithms.Further analysis of single-cell RNA sequencing data revealed a close relationship between ferroptosis and cell types such as macrophages/microglia and their intrinsic state transition processes.Differences in transcription factor regulation and intercellular communication networks were found in ferroptosis-related cells,confirming the high expression of Atf3 and Piezo1 in these cells.Molecular docking analysis confirmed that the proteins encoded by these genes can bind cycloheximide.In a mouse model of T8 spinal cord injury,low-dose cycloheximide treatment was found to improve neurological function,decrease levels of the pro-inflammatory cytokine inducible nitric oxide synthase,and increase levels of the anti-inflammatory cytokine arginase 1.Correspondingly,the expression of the ferroptosis-related gene Gpx4 increased in macrophages/microglia,while the expression of Acsl4 decreased.Our findings reveal the important role of ferroptosis in the treatment of spinal cord injury,identify the key cell types and genes involved in ferroptosis after spinal cord injury,and validate the efficacy of potential drug therapies,pointing to new directions in the treatment of spinal cord injury.
文摘Online interactive learning plays a crucial role in improving online education quality.This grounded theory study examines:(1)what key factors shape EFL learners’online interactive learning,(2)how these factors form an empirically validated model,and(3)how they interact within this model,through systematic analysis of 9,207 discussion forum posts from a Chinese University MOOC platform.Results demonstrate that learning drive,course structure,teaching competence,interaction behavior,expected outcomes,and online learning context significantly influence EFL online interactive learning.The analysis reveals two key mechanisms:expected outcomes mediate the effects of learning drive(β=0.45),course structure,teaching competence,and interaction behavior(β=0.35)on learning outcomes,while online learning context moderates these relationships(β=0.25).Specifically,learning drive provides intrinsic/extrinsic motivation,whereas course structure,teaching competence,interaction behavior,and expected outcomes collectively enhance interaction quality and sustainability.These findings,derived through rigorous grounded theory methodology involving open,axial,and selective coding of large-scale interaction data,yield three key contributions:(1)a comprehensive theoretical model of EFL online learning dynamics,(2)empirical validation of mediation/moderation mechanisms,and(3)practical strategies for designing scaffolded interaction protocols and adaptive feedback systems.The study establishes that its theoretically saturated model(achieved after analyzing 7,366 posts with 1,841 verification cases)offers educators evidence-based approaches to optimize collaborative interaction in digital EFL environments.
文摘BACKGROUND Pancreatic fistula is the most common complication of pancreatic surgeries that causes more serious conditions,including bleeding due to visceral vessel erosion and peritonitis.AIM To develop a machine learning(ML)model for postoperative pancreatic fistula and identify significant risk factors of the complication.METHODS A single-center retrospective clinical study was conducted which included 150 patients,who underwent pancreat-oduodenectomy.Logistic regression,random forest,and CatBoost were employed for modeling the biochemical leak(symptomless fistula)and fistula grade B/C(clinically significant complication).The performance was estimated by receiver operating characteristic(ROC)area under the curve(AUC)after 5-fold cross-validation(20%testing and 80%training data).The risk factors were evaluated with the most accurate algorithm,based on the parameter“Importance”(Im),and Kendall correlation,P<0.05.RESULTS The CatBoost algorithm was the most accurate with an AUC of 74%-86%.The study provided results of ML-based modeling and algorithm selection for pancreatic fistula prediction and risk factor evaluation.From 14 parameters we selected the main pre-and intraoperative prognostic factors of all the fistulas:Tumor vascular invasion(Im=24.8%),age(Im=18.6%),and body mass index(Im=16.4%),AUC=74%.The ML model showed that biochemical leak,blood and drain amylase level(Im=21.6%and 16.4%),and blood leukocytes(Im=11.2%)were crucial predictors for subsequent fistula B/C,AUC=86%.Surgical techniques,morphology,and pancreatic duct diameter less than 3 mm were insignificant(Im<5%and no correlations detected).The results were confirmed by correlation analysis.CONCLUSION This study highlights the key predictors of postoperative pancreatic fistula and establishes a robust ML-based model for individualized risk prediction.These findings contribute to the advancement of personalized periop-erative care and may guide targeted preventive strategies.
文摘The new curriculum standard points out that affection is one of the most important goals of fundamental education. The non-target language environment is easier to cause the affective change of middle school students who are changeable in their affective state. Based on the affective filter hypothesis, this paper deals with the adjustment to affective factors in English learning by using Internet English Curriculum Resource, such as attitude and motivation, anxiety and inhibition, self-esteem and self-confidence. At last, some suggestions are offered to judge Internet English Curriculum Resource.
文摘This article aims at the discussion of cultural interferences in foreign language teaching and learning for Chinese non-English majors. It also calls for more attention from teachers, textbook compilers, and students themselves.
文摘Objective To study the self-consciousness of children with learning disabilities (LD) and to identify related factors. Methods Five hundred and sixty pupils graded from 1 to 6 in an elementary school were investigated. According to the pupil rating scale revised screening for learning disabilities (PRS), combined Raven’s test (CRT) and achievement of main courses, 35 of 560 pupils were diagnosed as LD children. Thirty-five children were selected from the average children and 35 from advanced children in academic achievement equally matched in class, gender, and age with LD children as control groups. The three groups were tested by Piers-Harris children’s self-concept scale. Basic information of each subject was collected by self-made questionnaire. Results Compared with the average and advanced children, LD children got significantly lower scores in self-concept scale. Based on logistic regression analysis, 3 factors were identified, including family income per month, single child and delivery model. Conclusion The results suggest that self-consciousness of children with LD is lower than that of normal children.
文摘BACKGROUND Colorectal polyps are precancerous diseases of colorectal cancer.Early detection and resection of colorectal polyps can effectively reduce the mortality of colorectal cancer.Endoscopic mucosal resection(EMR)is a common polypectomy proce-dure in clinical practice,but it has a high postoperative recurrence rate.Currently,there is no predictive model for the recurrence of colorectal polyps after EMR.AIM To construct and validate a machine learning(ML)model for predicting the risk of colorectal polyp recurrence one year after EMR.METHODS This study retrospectively collected data from 1694 patients at three medical centers in Xuzhou.Additionally,a total of 166 patients were collected to form a prospective validation set.Feature variable screening was conducted using uni-variate and multivariate logistic regression analyses,and five ML algorithms were used to construct the predictive models.The optimal models were evaluated based on different performance metrics.Decision curve analysis(DCA)and SHapley Additive exPlanation(SHAP)analysis were performed to assess clinical applicability and predictor importance.RESULTS Multivariate logistic regression analysis identified 8 independent risk factors for colorectal polyp recurrence one year after EMR(P<0.05).Among the models,eXtreme Gradient Boosting(XGBoost)demonstrated the highest area under the curve(AUC)in the training set,internal validation set,and prospective validation set,with AUCs of 0.909(95%CI:0.89-0.92),0.921(95%CI:0.90-0.94),and 0.963(95%CI:0.94-0.99),respectively.DCA indicated favorable clinical utility for the XGBoost model.SHAP analysis identified smoking history,family history,and age as the top three most important predictors in the model.CONCLUSION The XGBoost model has the best predictive performance and can assist clinicians in providing individualized colonoscopy follow-up recommendations.
基金Supported by Science and Technology Support Program of Qiandongnan Prefecture,No.Qiandongnan Sci-Tech Support[2021]12Guizhou Province High-Level Innovative Talent Training Program,No.Qiannan Thousand Talents[2022]201701.
文摘BACKGROUND Intensive care unit-acquired weakness(ICU-AW)is a common complication that significantly impacts the patient's recovery process,even leading to adverse outcomes.Currently,there is a lack of effective preventive measures.AIM To identify significant risk factors for ICU-AW through iterative machine learning techniques and offer recommendations for its prevention and treatment.METHODS Patients were categorized into ICU-AW and non-ICU-AW groups on the 14th day post-ICU admission.Relevant data from the initial 14 d of ICU stay,such as age,comorbidities,sedative dosage,vasopressor dosage,duration of mechanical ventilation,length of ICU stay,and rehabilitation therapy,were gathered.The relationships between these variables and ICU-AW were examined.Utilizing iterative machine learning techniques,a multilayer perceptron neural network model was developed,and its predictive performance for ICU-AW was assessed using the receiver operating characteristic curve.RESULTS Within the ICU-AW group,age,duration of mechanical ventilation,lorazepam dosage,adrenaline dosage,and length of ICU stay were significantly higher than in the non-ICU-AW group.Additionally,sepsis,multiple organ dysfunction syndrome,hypoalbuminemia,acute heart failure,respiratory failure,acute kidney injury,anemia,stress-related gastrointestinal bleeding,shock,hypertension,coronary artery disease,malignant tumors,and rehabilitation therapy ratios were significantly higher in the ICU-AW group,demonstrating statistical significance.The most influential factors contributing to ICU-AW were identified as the length of ICU stay(100.0%)and the duration of mechanical ventilation(54.9%).The neural network model predicted ICU-AW with an area under the curve of 0.941,sensitivity of 92.2%,and specificity of 82.7%.CONCLUSION The main factors influencing ICU-AW are the length of ICU stay and the duration of mechanical ventilation.A primary preventive strategy,when feasible,involves minimizing both ICU stay and mechanical ventilation duration.
基金supported by the College of Agriculture,Shiraz University(Grant No.97GRC1M271143)funding from the UK Biotechnology and Biological Sciences Research Council(BBSRC)funded by BBSRC grant award BBS/E/C/000I0330–Soil to Nutrition project 3–Sustainable intensification:optimisation at multiple scales。
文摘This investigation assessed the efficacy of 10 widely used machine learning algorithms(MLA)comprising the least absolute shrinkage and selection operator(LASSO),generalized linear model(GLM),stepwise generalized linear model(SGLM),elastic net(ENET),partial least square(PLS),ridge regression,support vector machine(SVM),classification and regression trees(CART),bagged CART,and random forest(RF)for gully erosion susceptibility mapping(GESM)in Iran.The location of 462 previously existing gully erosion sites were mapped through widespread field investigations,of which 70%(323)and 30%(139)of observations were arbitrarily divided for algorithm calibration and validation.Twelve controlling factors for gully erosion,namely,soil texture,annual mean rainfall,digital elevation model(DEM),drainage density,slope,lithology,topographic wetness index(TWI),distance from rivers,aspect,distance from roads,plan curvature,and profile curvature were ranked in terms of their importance using each MLA.The MLA were compared using a training dataset for gully erosion and statistical measures such as RMSE(root mean square error),MAE(mean absolute error),and R-squared.Based on the comparisons among MLA,the RF algorithm exhibited the minimum RMSE and MAE and the maximum value of R-squared,and was therefore selected as the best model.The variable importance evaluation using the RF model revealed that distance from rivers had the highest significance in influencing the occurrence of gully erosion whereas plan curvature had the least importance.According to the GESM generated using RF,most of the study area is predicted to have a low(53.72%)or moderate(29.65%)susceptibility to gully erosion,whereas only a small area is identified to have a high(12.56%)or very high(4.07%)susceptibility.The outcome generated by RF model is validated using the ROC(Receiver Operating Characteristics)curve approach,which returned an area under the curve(AUC)of 0.985,proving the excellent forecasting ability of the model.The GESM prepared using the RF algorithm can aid decision-makers in targeting remedial actions for minimizing the damage caused by gully erosion.
文摘Objective To investigate prevalence rate of learning disabilities (LD) in Chinese children, and to explore related risk factors, and to provide theoretical basis for preventing such disabilities. Methods One thousand and one hundred fifty one children were randomly selected in primary schools. According to criteria set by ICD-10, 118 children diagnosed as LD were classified into the study group. Four hundred and ninety one children were classified into the normal control group. Five hundred and forty two children were classified into the excellent control group. The study instruments included PRS (The pupil rating scale revised screening for learning disabilities), Conners' children behavior check-list taken by parents and YG-WR character check-list. Results The prevalence rate of LD in Chinese children was 10.3%. Significant differences were observed between LD and normally learning children, and between the LD group and the excellent group, in terms of scores of Conners' behavior check-list (P<0.05). The study further showed that individual differences in character between the LD group and the control groups still existed even after controlling individual differences in age, IQ, and gender. Some possible causal explanations contributing to LD were improper teaching by parents, low educational level of the parents, and children's characteristics and social relationships. Conclusion These data underscore the fact that LD is a serious national public health problem in China. LD is resulted from a number of factors. Good studying and living environments should be created for LD children.
基金funded by the Natural Science Foundation of China(Grant Nos.41807285,41972280 and 52179103).
文摘To perform landslide susceptibility prediction(LSP),it is important to select appropriate mapping unit and landslide-related conditioning factors.The efficient and automatic multi-scale segmentation(MSS)method proposed by the authors promotes the application of slope units.However,LSP modeling based on these slope units has not been performed.Moreover,the heterogeneity of conditioning factors in slope units is neglected,leading to incomplete input variables of LSP modeling.In this study,the slope units extracted by the MSS method are used to construct LSP modeling,and the heterogeneity of conditioning factors is represented by the internal variations of conditioning factors within slope unit using the descriptive statistics features of mean,standard deviation and range.Thus,slope units-based machine learning models considering internal variations of conditioning factors(variant slope-machine learning)are proposed.The Chongyi County is selected as the case study and is divided into 53,055 slope units.Fifteen original slope unit-based conditioning factors are expanded to 38 slope unit-based conditioning factors through considering their internal variations.Random forest(RF)and multi-layer perceptron(MLP)machine learning models are used to construct variant Slope-RF and Slope-MLP models.Meanwhile,the Slope-RF and Slope-MLP models without considering the internal variations of conditioning factors,and conventional grid units-based machine learning(Grid-RF and MLP)models are built for comparisons through the LSP performance assessments.Results show that the variant Slopemachine learning models have higher LSP performances than Slope-machine learning models;LSP results of variant Slope-machine learning models have stronger directivity and practical application than Grid-machine learning models.It is concluded that slope units extracted by MSS method can be appropriate for LSP modeling,and the heterogeneity of conditioning factors within slope units can more comprehensively reflect the relationships between conditioning factors and landslides.The research results have important reference significance for land use and landslide prevention.
基金supported by the National Key R&D Program of China(No.2017YFC1309100)the National Science Fund for Distinguished Young Scholars(No.81925023)+1 种基金the National Natural Science Foundation of China(No.81771912,81701662,81701782,81601469,and 81702322)Science and Technology Planning Project of Guangdong Province(No.2017B020227012)。
文摘Objective:To evaluate the human epidermal growth factor receptor 2(HER2)status in patients with breast cancer using multidetector computed tomography(MDCT)-based handcrafted and deep radiomics features.Methods:This retrospective study enrolled 339 female patients(primary cohort,n=177;validation cohort,n=162)with pathologically confirmed invasive breast cancer.Handcrafted and deep radiomics features were extracted from the MDCT images during the arterial phase.After the feature selection procedures,handcrafted and deep radiomics signatures and the combined model were built using multivariate logistic regression analysis.Performance was assessed by measures of discrimination,calibration,and clinical usefulness in the primary cohort and validated in the validation cohort.Results:The handcrafted radiomics signature had a discriminative ability with a C-index of 0.739[95%confidence interval(95%CI):0.661-0.818]in the primary cohort and 0.695(95%CI:0.609-0.781)in the validation cohort.The deep radiomics signature also had a discriminative ability with a C-index of 0.760(95%CI:0.690-0.831)in the primary cohort and 0.777(95%CI:0.696-0.857)in the validation cohort.The combined model,which incorporated both the handcrafted and deep radiomics signatures,showed good discriminative ability with a C-index of 0.829(95%CI:0.767-0.890)in the primary cohort and 0.809(95%CI:0.740-0.879)in the validation cohort.Conclusions:Handcrafted and deep radiomics features from MDCT images were associated with HER2 status in patients with breast cancer.Thus,these features could provide complementary aid for the radiological evaluation of HER2 status in breast cancer.
基金Supported by Government Assignment,No.1023022600020-6RSF Grant,No.24-15-00549Ministry of Science and Higher Education of the Russian Federation within the Framework of State Support for the Creation and Development of World-Class Research Center,No.075-15-2022-304.
文摘BACKGROUND Ischemic heart disease(IHD)impacts the quality of life and has the highest mortality rate of cardiovascular diseases globally.AIM To compare variations in the parameters of the single-lead electrocardiogram(ECG)during resting conditions and physical exertion in individuals diagnosed with IHD and those without the condition using vasodilator-induced stress computed tomography(CT)myocardial perfusion imaging as the diagnostic reference standard.METHODS This single center observational study included 80 participants.The participants were aged≥40 years and given an informed written consent to participate in the study.Both groups,G1(n=31)with and G2(n=49)without post stress induced myocardial perfusion defect,passed cardiologist consultation,anthropometric measurements,blood pressure and pulse rate measurement,echocardiography,cardio-ankle vascular index,bicycle ergometry,recording 3-min single-lead ECG(Cardio-Qvark)before and just after bicycle ergometry followed by performing CT myocardial perfusion.The LASSO regression with nested cross-validation was used to find the association between Cardio-Qvark parameters and the existence of the perfusion defect.Statistical processing was performed with the R programming language v4.2,Python v.3.10[^R],and Statistica 12 program.RESULTS Bicycle ergometry yielded an area under the receiver operating characteristic curve of 50.7%[95%confidence interval(CI):0.388-0.625],specificity of 53.1%(95%CI:0.392-0.673),and sensitivity of 48.4%(95%CI:0.306-0.657).In contrast,the Cardio-Qvark test performed notably better with an area under the receiver operating characteristic curve of 67%(95%CI:0.530-0.801),specificity of 75.5%(95%CI:0.628-0.88),and sensitivity of 51.6%(95%CI:0.333-0.695).CONCLUSION The single-lead ECG has a relatively higher diagnostic accuracy compared with bicycle ergometry by using machine learning models,but the difference was not statistically significant.However,further investigations are required to uncover the hidden capabilities of single-lead ECG in IHD diagnosis.
基金Supported by the Beijing Municipal Natural Science Foundation,No.7252262High Level Chinese Medical Hospital Promotion Project,No.HLCMHPP2023085+2 种基金National Natural Science Foundation of China,No.82174463National Administration of Traditional Chinese Medicine,No.ZYYCXTD-C-C202205China Academy of Chinese Medical Sciences,No.CI2021A01804 and No.2022S469.
文摘BACKGROUND Colorectal cancer is a common digestive malignancy,and chemotherapy remains a cornerstone of treatment.Myelosuppression,a frequent hematologic toxicity,poses significant clinical challenges.However,no interpretable machine learning-based nomogram exists to predict chemotherapy-induced myelosuppression in colorectal cancer patients.This study aimed to develop and validate an inter-pretable clinic-machine learning nomogram integrating clinical predictors with multiple algorithms via a feature mapping algorithm.The model provides accurate risk estimation and clinical interpretability,supporting individualized prevention strategies and optimizing decision-making in patients receiving first-line chemotherapy.AIM To develop and validate an interpretable clinic-machine learning nomogram predicting chemotherapy-induced myelosuppression in colorectal cancer.METHODS This retrospective study enrolled 855 colorectal cancer patients receiving first-line chemotherapy.Data were split into training(n=612),validation(n=153),and testing(n=90)cohorts.Ten predictors were identified through least absolute shrinkage and selection operator,decision tree,random forest,and expert con-sensus.Ten machine learning algorithms were applied,with performance assessed by area under the receiver operating characteristic curve(AUC),area under the precision-recall curve(AUPRC),calibration,and decision curves.The optimal model was integrated into a clinic-machine learning nomogram via the feature mapping algorithm,which was internally validated for predictive accuracy and clinical utility.(AUPRC),calibration,and decision curves.The optimal model was integrated into a clinic-machine learning nomogram via the feature mapping algorithm,which was internally validated for predictive accuracy and clinical utility.RESULTS A total of 855 colorectal cancer patients were enrolled,with 765 cases(April 2020 to December 2023)used for model training and validation,and 90 cases(January 2024 to July 2024)for internal testing.Baseline clinical features did not differ significantly between training and validation cohorts(P>0.05).Ten predictors were identified through integrated feature selection and expert consensus,including age,body surface area,body mass index,tumor position,albumin,carcinoembryonic antigen,carbohydrate antigen(CA)19-9,CA125,chemotherapy regimen,and chemotherapy cycles.Among ten machine learning algorithms,extreme gradient boosting achieved the best validation performance(AUC=0.97,AUPRC=0.92,sensitivity=0.79,specificity=0.92,accuracy=0.88).Logistic regression confirmed extra trees and random forest as independent predictors,which were incorporated into a clinic-machine learning nomogram.The clinic-machine learning nomogram demonstrated superior discrimination(AUC=0.96,AUPRC=0.93,accuracy=0.90,specificity=0.95),good calibration,and greater net clinical benefit across a wide probability range(10%-90%).Internal testing further confirmed its robustness and generalizability(AUC=0.95).CONCLUSION The clinic-machine learning nomogram accurately predicts chemotherapy-induced myelosuppression in colorectal cancer,providing interpretability and clinical utility to support individualized risk assessment and treatment decision-making.