Tropospheric zenith wet delay(ZWD)plays a vital role in the analysis of space geodetic observations.In recent years,machine learning methods have been increasingly applied to improve the accuracy of ZWD calculations.H...Tropospheric zenith wet delay(ZWD)plays a vital role in the analysis of space geodetic observations.In recent years,machine learning methods have been increasingly applied to improve the accuracy of ZWD calculations.However,a single machine learning model has limited generalization capabilities.To address these limitations,this study introduces a novel machine learning fusion(MLF)algorithm with stronger generalization capabilities to enhance ZWD modeling and prediction accuracy.The MLF algorithm utilizes a two-layer structure integrating extra trees(ET),backpropagation neural network(BPNN),and linear regression models.By comparing the root mean square error(RMSE)of these models,we found that both ET-based and MLF-based models outperform RF-based and BPNN-based models in terms of internal and external accuracy,across both surface meteorological data-based and blind models.The improvement in exte rnal accuracy is particularly significant in the blind models.Our re sults show that the MLF(with an RMSE of 3.93 cm)and ET(3.99 cm)models outperform the traditional GPT3model(4.07 cm),while the RF(4.21 cm)and BPNN(4.14 cm)have worse external accuracies than the GPT3 model.It is worth noting that the BPNN suffered from overfitting during external accuracy tests,which was avoided by the MLF.In summary,regardless of the availability of surface meteorological data,the MLF-based empirical models demonstrate superior internal and external accuracy compared to the other tested models in this study.展开更多
BACKGROUND:This study aims to develop and validate a machine learning-based in-hospital mortality predictive model for acute aortic syndrome(AAS)in the emergency department(ED)and to derive a simplifi ed version suita...BACKGROUND:This study aims to develop and validate a machine learning-based in-hospital mortality predictive model for acute aortic syndrome(AAS)in the emergency department(ED)and to derive a simplifi ed version suitable for rapid clinical application.METHODS:In this multi-center retrospective cohort study,AAS patient data from three hospitals were analyzed.The modeling cohort included data from the First Affiliated Hospital of Zhengzhou University and the People’s Hospital of Xinjiang Uygur Autonomous Region,with Peking University Third Hospital data serving as the external test set.Four machine learning algorithms—logistic regression(LR),multilayer perceptron(MLP),Gaussian naive Bayes(GNB),and random forest(RF)—were used to develop predictive models based on 34 early-accessible clinical variables.A simplifi ed model was then derived based on fi ve key variables(Stanford type,pericardial eff usion,asymmetric peripheral arterial pulsation,decreased bowel sounds,and dyspnea)via Least Absolute Shrinkage and Selection Operator(LASSO)regression to improve ED applicability.RESULTS:A total of 929 patients were included in the modeling cohort,and 210 were included in the external test set.Four machine learning models based on 34 clinical variables were developed,achieving internal and external validation AUCs of 0.85-0.90 and 0.73-0.85,respectively.The simplifi ed model incorporating fi ve key variables demonstrated internal and external validation AUCs of 0.71-0.86 and 0.75-0.78,respectively.Both models showed robust calibration and predictive stability across datasets.CONCLUSION:Both kinds of models were built based on machine learning tools,and proved to have certain prediction performance and extrapolation.展开更多
The rapid advancement of machine learning based tight-binding Hamiltonian(MLTB)methods has opened new avenues for efficient and accurate electronic structure simulations,particularly in large-scale systems and long-ti...The rapid advancement of machine learning based tight-binding Hamiltonian(MLTB)methods has opened new avenues for efficient and accurate electronic structure simulations,particularly in large-scale systems and long-time scenarios.This review begins with a concise overview of traditional tight-binding(TB)models,including both(semi-)empirical and first-principles approaches,establishing the foundation for understanding MLTB developments.We then present a systematic classification of existing MLTB methodologies,grouped into two major categories:direct prediction of TB Hamiltonian elements and inference of empirical parameters.A comparative analysis with other ML-based electronic structure models is also provided,highlighting the advancement of MLTB approaches.Finally,we explore the emerging MLTB application ecosystem,highlighting how the integration of MLTB models with a diverse suite of post-processing tools from linear-scaling solvers to quantum transport frameworks and molecular dynamics interfaces is essential for tackling complex scientific problems across different domains.The continued advancement of this integrated paradigm promises to accelerate materials discovery and open new frontiers in the predictive simulation of complex quantum phenomena.展开更多
This study aims to develop an accurate and robust machine learning model to predict the carbonation depth of fly ash concrete,overcoming the limitations of traditional predictive methods.Five ensemble-based models,suc...This study aims to develop an accurate and robust machine learning model to predict the carbonation depth of fly ash concrete,overcoming the limitations of traditional predictive methods.Five ensemble-based models,such as adaptive boosting(AdaBoost),categorical boosting(CatBoost),gradient boosting regressor(GBR),hist gradient boosting regressor(HistGBR),and extreme gradient boosting(XGBoost),were developed and optimized using 729 high-quality dataset points incorporating seven input parameters,including cement,CO_(2),exposure time,water-binder ratio,fly ash,curing time,and compressive strength.Several performance evaluation metrics were used to compare the models.The GBR model emerged as the best-performing model,based on high coefficient of determination(R^(2))values and balanced error metrics across both validation and testing datasets.While all models performed exceptionally well on the training data,GBR demonstrated superior generalization capability,with R^(2) values of 0.9438 on the validation set and 0.9310 on the testing set.Furthermore,its low mean squared error(MSE),root mean square error(RMSE),mean absolute error(MAE),and median absolute error(MdAE)confirmed its robustness and accuracy.Moreover,shapley additive explanations(SHAP)analysis enhanced the interpretability of predictions,highlighting the curing time and exposure time as the most critical drivers of carbonation depth.展开更多
Oxide dispersion strengthened(ODS)alloys are extensively used owing to high thermostability and creep strength contributed from uniformly dispersed fine oxides particles.However,the existence of these strengthening pa...Oxide dispersion strengthened(ODS)alloys are extensively used owing to high thermostability and creep strength contributed from uniformly dispersed fine oxides particles.However,the existence of these strengthening particles also deteriorates the processability and it is of great importance to establish accurate processing maps to guide the thermomechanical processes to enhance the formability.In this study,we performed particle swarm optimization-based back propagation artificial neural network model to predict the high temperature flow behavior of 0.25wt%Al2O3 particle-reinforced Cu alloys,and compared the accuracy with that of derived by Arrhenius-type constitutive model and back propagation artificial neural network model.To train these models,we obtained the raw data by fabricating ODS Cu alloys using the internal oxidation and reduction method,and conducting systematic hot compression tests between 400 and800℃with strain rates of 10^(-2)-10 S^(-1).At last,processing maps for ODS Cu alloys were proposed by combining processing parameters,mechanical behavior,microstructure characterization,and the modeling results achieved a coefficient of determination higher than>99%.展开更多
Lithium-ion batteries(LIBs)are widely deployed,from grid-scale storage to electric vehicles.LIBs remain stationary most of their service life,where calendar aging degrades capacity.Understanding the mechanisms of LIB ...Lithium-ion batteries(LIBs)are widely deployed,from grid-scale storage to electric vehicles.LIBs remain stationary most of their service life,where calendar aging degrades capacity.Understanding the mechanisms of LIB calendar aging is crucial for extending battery lifespan.However,LIB calendar aging is influenced by multiple factors,including battery material,its state,and storage environment.Calendar aging experiments are also time-consuming,costly,and lack standardized testing conditions.This study employs a data-driven approach to establish a cross-scale database linking materials,side-reaction mechanisms,and calendar aging of LIBs.MELODI(Mechanism-informed,Explainable,Learning-based Optimization for Degradation Identification)is proposed to identify calendar aging mechanisms and quantify the effects of multi-scale factors.Results reveal that cathode material loss drives up to 91.42%of calendar aging degradation in high-nickel(Ni)batteries,while solid electrolyte interphase growth dominates in lithium iron phosphate(LFP)and low-Ni batteries,contributing up to 82.43%of degradation in LFP batteries and 99.10%of decay in low-Ni batteries,respectively.This study systematically quantifies calendar aging in commercial LIBs under varying materials,states of charge,and temperatures.These findings offer quantitative guidance for experimental design or battery use,and implications for emerging applications like aerial robotics,vehicle-to-grid,and embodied intelligence systems.展开更多
The efficient and safe delivery of messenger RNA(m RNA)therapeutics remains a critical challenge for clinical translation,driving the need for advanced carrier design.Ionizable amphiphilic Janus dendrimers(IAJDs)repre...The efficient and safe delivery of messenger RNA(m RNA)therapeutics remains a critical challenge for clinical translation,driving the need for advanced carrier design.Ionizable amphiphilic Janus dendrimers(IAJDs)represent a promising class of carriers;however,their structural complexity and limited available datasets hinder systematic exploration and optimization.In this study,we established a tailored machinelearning framework to investigate the structure-function relationships of IAJDs under a constrained data regime(n=231).Conventional molecular fingerprints were found to be suboptimal for representing these macromolecules,motivating the adoption of count-based descriptors and systematic ablation analyses to disentangle the contributions of the substructural features.These experiments identified key functional motifs underlying transfection performance and provided interpretable insights into the IAJD design principles.Complementing these handcrafted descriptors,we further applied deep learning-based molecular embeddings,which captured higher-order chemical semantics and significantly improved predictive accuracy.Collectively,these advances demonstrate that both refined fingerprinting and representation learning approaches can overcome data limitations,enabling the reliable prediction of IAJD activity while offering mechanistic interpretability.This study illustrates the potential of data-driven strategies as hypothesis-generation and prioritization tools for the design of next-generation m RNA delivery systems.展开更多
Landslides pose a significant threat in the mountainous regions of Nepal.Landslide susceptibility maps are commonly used to identify potential landslide zones by statistically analyzing geological,topographical,and hy...Landslides pose a significant threat in the mountainous regions of Nepal.Landslide susceptibility maps are commonly used to identify potential landslide zones by statistically analyzing geological,topographical,and hydrological factors,assuming that similar conditions may trigger future failures.While such maps provide valuable insights into landslide-triggering conditions,they are limited in assessing risk to settlements and infrastructure located downslope or in valley bottoms.This study integrates machine learning based landslide susceptibility with numerical runout modeling to provide a comprehensive landslide hazard assessment in the Bhotekoshi watershed,overcoming the limitations of traditional models that focus solely on statistical susceptibility.To conduct the susceptibility analysis,a total of 439 landslides were mapped from 2012 to 2021 using satellite images.Of these,70%were used for training two machine learning(ML)models:random forest and Xtreme Gradient Boosting(XGBoost),and the remaining 30%were used for validation.Among the two ML models,Random Forest model demonstrated slightly superior performance,achieving higher predictive accuracy.After the machine learning susceptibility analysis,the study transitions into a regional-scale landslide runout analysis.First,a back analysis of the past landslide event was conducted to fine-tune the model parameters(internal angle of friction and basal friction angle)and validate performance of the runout model.Following the back analysis,the regional-scale numerical modeling of landslide runout was conducted by designating areas classified as the highest susceptibility class in the Random Forest susceptibility map as potential release zones.This approach allows for a detailed examination of landslide propagation and potential impacts along the downslope settlements and infrastructures.The analysis clearly demonstrates that integrating both machine learning and numerical runout methods significantly increases the estimated exposure of population,buildings,and roads within the very high hazard class compared to relying solely on susceptibility methods.Specifically,population exposure rises from 360 to 7743,buildings increase from 97 to 2771,and road exposure expands from 41 to 251 km.This result highlights the significant risk of underestimating exposure in the analyses that solely rely on landslide susceptibility models.Integration of susceptibility and runout analysis improves landslide risk assessment,aiding in land-use planning and disaster mitigation strategies.展开更多
This study investigates the uncertain dynamic characterization of hybrid composite plates by employing advanced machine-assisted finite element methodologies.Hybrid composites,widely used in aerospace,automotive,and s...This study investigates the uncertain dynamic characterization of hybrid composite plates by employing advanced machine-assisted finite element methodologies.Hybrid composites,widely used in aerospace,automotive,and structural applications,often face variability in material properties,geometric configurations,and manufacturing processes,leading to uncertainty in their dynamic response.To address this,three surrogate-based machine learning approaches like radial basis function(RBF),multivariate adaptive regression splines(MARS),and polynomial neural networks(PNN)are integrated with a finite element framework to efficiently capture the stochastic behavior of these plates.The research focuses on predicting the first three natural frequencies under material uncertainties,which are critical to ensuring structural reliability.Monte Carlo simulation(MCS)is used as a benchmark for generating probabilistic datasets,including mean values,standard deviations,and probability density functions.The surrogate models are then trained and validated against these datasets,enabling accurate representation of uncertainty with substantially fewer samples compared to conventionalMCS.Among the methods studied,the RBFmodel demonstrates superior performance,closely approximating MCS results with a reduced sample size,thereby achieving significant computational savings.The proposed framework not only reduces computational time and costs but also maintains high predictive accuracy,making it well-suited for complex engineering systems.Beyond free vibration analysis,the methodology can be extended to more sophisticated scenarios,such as forced vibration,damping effects,and nonlinear structural responses.Overall,this work presents a computationally efficient and robust approach for surrogate-based uncertainty quantification,advancing the analysis and design of hybrid composite structures under uncertainty.展开更多
Accurate prediction of concrete compressive strength is fundamental for optimizing mix designs,improving material utilization,and ensuring structural safety in modern construction.Traditional empirical methods often f...Accurate prediction of concrete compressive strength is fundamental for optimizing mix designs,improving material utilization,and ensuring structural safety in modern construction.Traditional empirical methods often fail to capture the non-linear relationships among concrete constituents,especially with the growing use of supple-mentary cementitious materials and recycled aggregates.This study presents an integrated machine learning framework for concrete strength prediction,combining advanced regression models—namely CatBoost—with metaheuristic optimization algorithms,with a particular focus on the Somersaulting Spider Optimizer(SSO).A comprehensive dataset encompassing diverse mix proportions and material types was used to evaluate baseline machine learning models,including CatBoost,XGBoost,ExtraTrees,and RandomForest.Among these,CatBoost demonstrated superior accuracy across multiple performance metrics.To further enhance predictive capability,several bio-inspired optimizers were employed for hyperparameter tuning.The SSO-CatBoost hybrid achieved the lowest mean squared error and highest correlation coefficients,outperforming other metaheuristic approaches such as Genetic Algorithm,Particle Swarm Optimization,and Grey Wolf Optimizer.Statistical significance was established through Analysis of Variance and Wilcoxon signed-rank testing,confirming the robustness of the optimized models.The proposed methodology not only delivers improved predictive performance but also offers a transparent framework for mix design optimization,supporting data-driven decision making in sustainable and resilient infrastructure development.展开更多
Accurate prediction of flood events is important for flood control and risk management.Machine learning techniques contributed greatly to advances in flood predictions,and existing studies mainly focused on predicting...Accurate prediction of flood events is important for flood control and risk management.Machine learning techniques contributed greatly to advances in flood predictions,and existing studies mainly focused on predicting flood resource variables using single or hybrid machine learning techniques.However,class-based flood predictions have rarely been investigated,which can aid in quickly diagnosing comprehensive flood characteristics and proposing targeted management strategies.This study proposed a prediction approach of flood regime metrics and event classes coupling machine learning algorithms with clustering-deduced membership degrees.Five algorithms were adopted for this exploration.Results showed that the class membership degrees accurately determined event classes with class hit rates up to 100%,compared with the four classes clustered from nine regime metrics.The nonlinear algorithms(Multiple Linear Regression,Random Forest,and least squares-Support Vector Machine)outperformed the linear techniques(Multiple Linear Regression and Stepwise Regression)in predicting flood regime metrics.The proposed approach well predicted flood event classes with average class hit rates of 66.0%-85.4%and 47.2%-76.0%in calibration and validation periods,respectively,particularly for the slow and late flood events.The predictive capability of the proposed prediction approach for flood regime metrics and classes was considerably stronger than that of hydrological modeling approach.展开更多
Rainfall input errors are a major source of uncertainty in flood forecasting,and merging multi-source precipitation data is essential for improving accuracy.Traditional merging methods often prioritize precipitation m...Rainfall input errors are a major source of uncertainty in flood forecasting,and merging multi-source precipitation data is essential for improving accuracy.Traditional merging methods often prioritize precipitation magnitude enhancements while overlooking event detection and false alarms.To address these limitations,this study developed a precipitation integration framework that combines machine learning classification-plus-regression models with Bayesian model averaging(BMA).Three machine learning algorithms-categorical boosting(CatBoost),light gradient boosting machine(LightGBM),and random forest(RF)-were used to improve precipitation event detection.The framework includes spatial unification of raw satellite products using bilinear interpolation,bias correction through classification-plus-regression models,and final merging via a seasonal-scale BMA model.The method integrated GSMaP,IMERG,and PERSIANN satellite precipitation products,with ground observations used for model training(2001-2014)and independent validation(2015-2020)in the Upper Ganjiang River Basin,China.Results showed that the framework significantly enhanced precipitation estimation accuracy and detection capability.LightGBM-based integration exhibited superior detection performance(FAR=0.08,CSI=0.86),while RF-based integration achieved the highest overall accuracy(RMSE=4.67,CC=0.92).Seasonal variations in BMA weights underscored the need to account for seasonal characteristics of precipitation products.Additionally,accuracy improvements were observed across all rainfall categories,especially for heavy rainstorms.The seasonal-scale BMA fusion has combined the strengths of individual corrections and further enhanced precipitation estimation.This research offers a robust method for generating accurate rainfall inputs,providing valuable support for hydrological modeling and flood forecasting applications.展开更多
Post-kidney transplant rejection is a critical factor influencing transplant success rates and the survival of transplanted organs.With the rapid advancement of artificial intelligence technologies,machine learning(ML...Post-kidney transplant rejection is a critical factor influencing transplant success rates and the survival of transplanted organs.With the rapid advancement of artificial intelligence technologies,machine learning(ML)has emerged as a powerful data analysis tool,widely applied in the prediction,diagnosis,and mechanistic study of kidney transplant rejection.This mini-review systematically summarizes the recent applications of ML techniques in post-kidney transplant rejection,covering areas such as the construction of predictive models,identification of biomarkers,analysis of pathological images,assessment of immune cell infiltration,and formulation of personalized treatment strategies.By integrating multi-omics data and clinical information,ML has significantly enhanced the accuracy of early rejection diagnosis and the capability for prognostic evaluation,driving the development of precision medicine in the field of kidney transplantation.Furthermore,this article discusses the challenges faced in existing research and potential future directions,providing a theoretical basis and technical references for related studies.展开更多
Alterations in RNA methylation may affect the initiation and development of Alzheimer’s disease.However,the exact nature of the relationship between RNA methylation and Alzheimer’s disease remains unclear.In this st...Alterations in RNA methylation may affect the initiation and development of Alzheimer’s disease.However,the exact nature of the relationship between RNA methylation and Alzheimer’s disease remains unclear.In this study,RNA methylation levels were analyzed by bulk transcriptomic and single-cell RNA sequencing.The expression levels of RNA methylation regulators were confirmed using molecular biology techniques.Co-expression network analysis was used to identify relevant long non-coding RNAs.Molecular subtypes related to RNA methylation were classified,and variations in clinical characteristics,biological behavior,and immune signatures between subtypes were assessed.Machine learning approaches were applied to identify methylation-associated long non-coding RNAs,which were used to construct a risk model and nomogram for Alzheimer’s disease.Potential therapeutic agents for different risk groups were predicted,and in vitro experiments were conducted to identify key RNA methylation events.Single-cell analysis demonstrated enhanced RNA methylation in patients with Alzheimer’s disease,particularly within T cells,B cells,and NK cells.Quantitative reverse transcription-polymerase chain reaction and western blot confirmed alterations in RNA methylation regulators in neurons treated with amyloid-βoligomers in vitro.This evidence supported the classification of patients with Alzheimer’s disease into heterogeneous subtypes.Specifically,subtype 1 was identified as the immune-active subtype,while subtype 2 was characterized by a metabolic phenotype.Machine learning algorithms identified five significant methylation-associated long non-coding RNAs-LINC01007,MAP4K3-DT,MIR302CHG,VAC14-AS1,and TGFB2-OT1-that accurately predict clinical outcomes for patients with Alzheimer’s disease.These patients were classified into low-and high-risk categories;the latter group displayed higher immune infiltration,upregulated immune regulatory gene expression,and elevated immune scores and responded better to treatment with arachidonic-trifluoroethane.These findings suggest that dysregulated RNA methylation alters the immune microenvironment in Alzheimer’s disease and is closely associated with its progression.This phenomenon provides novel insights into potential therapeutic strategies for Alzheimer’s disease that target RNA methylation.展开更多
To curb the worsening tropospheric ozone(O_(3))pollution problem in China,a rapid and accurate identification of O_(3)-precursor sensitivity(OPS)is a crucial prerequisite for formulating effective contingency O_(3) po...To curb the worsening tropospheric ozone(O_(3))pollution problem in China,a rapid and accurate identification of O_(3)-precursor sensitivity(OPS)is a crucial prerequisite for formulating effective contingency O_(3) pollution control strategies.However,currently widely-used methods,such as statistical models and numerical models,exhibit inherent limitations in identifying OPS in a timely and accurate manner.In this study,we developed a novel approach to identify OPS based on eXtreme Gradient Boosting model,Shapley additive explanation(SHAP)al-gorithm,and volatile organic compound(VOC)photochemical decay adjustment,using the meteorology and speciated pollutant monitoring data as the input.By comparing the difference in SHAP values between base sce-nario and precursor reduction scenario for nitrogen oxides(NO_(x))and VOCs,OPS was divided into NO_(x)-limited,VOCs-limited and transition regime.Using the long-lasting O_(3) pollution episode in the autumn of 2022 at the Guangdong-Hong Kong-Macao Greater Bay Area(GBA)as an example,we demonstrated large spatiotemporal heterogeneities of OPS over the GBA,which were generally shifted from NO_(x)-limited to VOCs-limited from September to October and more inclined to be VOCs-limited at the central and NO_(x)-limited in the peripheral areas.This study developed an innovative OPS identification method by comparing the difference in SHAP value before and after precursor emission reduction.Our method enables the accurate identification of OPS in the time scale of seconds,thereby providing a state-of-the-art tool for the rapid guidance of spatial-specific O_(3) control strategies.展开更多
BACKGROUND Colorectal polyps are precancerous diseases of colorectal cancer.Early detection and resection of colorectal polyps can effectively reduce the mortality of colorectal cancer.Endoscopic mucosal resection(EMR...BACKGROUND Colorectal polyps are precancerous diseases of colorectal cancer.Early detection and resection of colorectal polyps can effectively reduce the mortality of colorectal cancer.Endoscopic mucosal resection(EMR)is a common polypectomy proce-dure in clinical practice,but it has a high postoperative recurrence rate.Currently,there is no predictive model for the recurrence of colorectal polyps after EMR.AIM To construct and validate a machine learning(ML)model for predicting the risk of colorectal polyp recurrence one year after EMR.METHODS This study retrospectively collected data from 1694 patients at three medical centers in Xuzhou.Additionally,a total of 166 patients were collected to form a prospective validation set.Feature variable screening was conducted using uni-variate and multivariate logistic regression analyses,and five ML algorithms were used to construct the predictive models.The optimal models were evaluated based on different performance metrics.Decision curve analysis(DCA)and SHapley Additive exPlanation(SHAP)analysis were performed to assess clinical applicability and predictor importance.RESULTS Multivariate logistic regression analysis identified 8 independent risk factors for colorectal polyp recurrence one year after EMR(P<0.05).Among the models,eXtreme Gradient Boosting(XGBoost)demonstrated the highest area under the curve(AUC)in the training set,internal validation set,and prospective validation set,with AUCs of 0.909(95%CI:0.89-0.92),0.921(95%CI:0.90-0.94),and 0.963(95%CI:0.94-0.99),respectively.DCA indicated favorable clinical utility for the XGBoost model.SHAP analysis identified smoking history,family history,and age as the top three most important predictors in the model.CONCLUSION The XGBoost model has the best predictive performance and can assist clinicians in providing individualized colonoscopy follow-up recommendations.展开更多
Background Cotton is one of the most important commercial crops after food crops,especially in countries like India,where it’s grown extensively under rainfed conditions.Because of its usage in multiple industries,su...Background Cotton is one of the most important commercial crops after food crops,especially in countries like India,where it’s grown extensively under rainfed conditions.Because of its usage in multiple industries,such as textile,medicine,and automobile industries,it has greater commercial importance.The crop’s performance is greatly influenced by prevailing weather dynamics.As climate changes,assessing how weather changes affect crop performance is essential.Among various techniques that are available,crop models are the most effective and widely used tools for predicting yields.Results This study compares statistical and machine learning models to assess their ability to predict cotton yield across major producing districts of Karnataka,India,utilizing a long-term dataset spanning from 1990 to 2023 that includes yield and weather factors.The artificial neural networks(ANNs)performed superiorly with acceptable yield deviations ranging within±10%during both vegetative stage(F1)and mid stage(F2)for cotton.The model evaluation metrics such as root mean square error(RMSE),normalized root mean square error(nRMSE),and modelling efficiency(EF)were also within the acceptance limits in most districts.Furthermore,the tested ANN model was used to assess the importance of the dominant weather factors influencing crop yield in each district.Specifically,the use of morning relative humidity as an individual parameter and its interaction with maximum and minimum tempera-ture had a major influence on cotton yield in most of the yield predicted districts.These differences highlighted the differential interactions of weather factors in each district for cotton yield formation,highlighting individual response of each weather factor under different soils and management conditions over the major cotton growing districts of Karnataka.Conclusions Compared with statistical models,machine learning models such as ANNs proved higher efficiency in forecasting the cotton yield due to their ability to consider the interactive effects of weather factors on yield forma-tion at different growth stages.This highlights the best suitability of ANNs for yield forecasting in rainfed conditions and for the study on relative impacts of weather factors on yield.Thus,the study aims to provide valuable insights to support stakeholders in planning effective crop management strategies and formulating relevant policies.展开更多
BACKGROUND Ischemic heart disease(IHD)impacts the quality of life and has the highest mortality rate of cardiovascular diseases globally.AIM To compare variations in the parameters of the single-lead electrocardiogram...BACKGROUND Ischemic heart disease(IHD)impacts the quality of life and has the highest mortality rate of cardiovascular diseases globally.AIM To compare variations in the parameters of the single-lead electrocardiogram(ECG)during resting conditions and physical exertion in individuals diagnosed with IHD and those without the condition using vasodilator-induced stress computed tomography(CT)myocardial perfusion imaging as the diagnostic reference standard.METHODS This single center observational study included 80 participants.The participants were aged≥40 years and given an informed written consent to participate in the study.Both groups,G1(n=31)with and G2(n=49)without post stress induced myocardial perfusion defect,passed cardiologist consultation,anthropometric measurements,blood pressure and pulse rate measurement,echocardiography,cardio-ankle vascular index,bicycle ergometry,recording 3-min single-lead ECG(Cardio-Qvark)before and just after bicycle ergometry followed by performing CT myocardial perfusion.The LASSO regression with nested cross-validation was used to find the association between Cardio-Qvark parameters and the existence of the perfusion defect.Statistical processing was performed with the R programming language v4.2,Python v.3.10[^R],and Statistica 12 program.RESULTS Bicycle ergometry yielded an area under the receiver operating characteristic curve of 50.7%[95%confidence interval(CI):0.388-0.625],specificity of 53.1%(95%CI:0.392-0.673),and sensitivity of 48.4%(95%CI:0.306-0.657).In contrast,the Cardio-Qvark test performed notably better with an area under the receiver operating characteristic curve of 67%(95%CI:0.530-0.801),specificity of 75.5%(95%CI:0.628-0.88),and sensitivity of 51.6%(95%CI:0.333-0.695).CONCLUSION The single-lead ECG has a relatively higher diagnostic accuracy compared with bicycle ergometry by using machine learning models,but the difference was not statistically significant.However,further investigations are required to uncover the hidden capabilities of single-lead ECG in IHD diagnosis.展开更多
The high porosity and tunable chemical functionality of metal-organic frameworks(MOFs)make it a promising catalyst design platform.High-throughput screening of catalytic performance is feasible since the large MOF str...The high porosity and tunable chemical functionality of metal-organic frameworks(MOFs)make it a promising catalyst design platform.High-throughput screening of catalytic performance is feasible since the large MOF structure database is available.In this study,we report a machine learning model for high-throughput screening of MOF catalysts for the CO_(2) cycloaddition reaction.The descriptors for model training were judiciously chosen according to the reaction mechanism,which leads to high accuracy up to 97%for the 75%quantile of the training set as the classification criterion.The feature contribution was further evaluated with SHAP and PDP analysis to provide a certain physical understanding.12,415 hypothetical MOF structures and 100 reported MOFs were evaluated under 100℃ and 1 bar within one day using the model,and 239 potentially efficient catalysts were discovered.Among them,MOF-76(Y)achieved the top performance experimentally among reported MOFs,in good agreement with the prediction.展开更多
Geared-rotor systems are critical components in mechanical applications,and their performance can be severely affected by faults,such as profile errors,wear,pitting,spalling,flaking,and cracks.Profile errors in gear t...Geared-rotor systems are critical components in mechanical applications,and their performance can be severely affected by faults,such as profile errors,wear,pitting,spalling,flaking,and cracks.Profile errors in gear teeth are inevitable in manufacturing and subsequently accumulate during operations.This work aims to predict the status of gear profile deviations based on gear dynamics response using the digital model of an experimental rig setup.The digital model comprises detailed CAD models and has been validated against the expected physical behavior using commercial finite element analysis software.The different profile deviations are then modeled using gear charts,and the dynamic response is captured through simulations.The various features are then obtained by signal processing,and various ML models are then evaluated to predict the fault/no-fault condition for the gear.The best performance is achieved by an artificial neural network with a prediction accuracy of 97.5%,which concludes a strong influence on the dynamics of the gear rotor system due to profile deviations.展开更多
基金funded by National Natural Science Foundation of China Key Program(12431014)Key Project of Hunan Education Department(22A0126)+1 种基金Natural Science Foundation of Hunan Province(2022JJ30555)Postgraduate Scientific Research Innovation Project of Xiangtan University(XDCX2024Y172)。
文摘Tropospheric zenith wet delay(ZWD)plays a vital role in the analysis of space geodetic observations.In recent years,machine learning methods have been increasingly applied to improve the accuracy of ZWD calculations.However,a single machine learning model has limited generalization capabilities.To address these limitations,this study introduces a novel machine learning fusion(MLF)algorithm with stronger generalization capabilities to enhance ZWD modeling and prediction accuracy.The MLF algorithm utilizes a two-layer structure integrating extra trees(ET),backpropagation neural network(BPNN),and linear regression models.By comparing the root mean square error(RMSE)of these models,we found that both ET-based and MLF-based models outperform RF-based and BPNN-based models in terms of internal and external accuracy,across both surface meteorological data-based and blind models.The improvement in exte rnal accuracy is particularly significant in the blind models.Our re sults show that the MLF(with an RMSE of 3.93 cm)and ET(3.99 cm)models outperform the traditional GPT3model(4.07 cm),while the RF(4.21 cm)and BPNN(4.14 cm)have worse external accuracies than the GPT3 model.It is worth noting that the BPNN suffered from overfitting during external accuracy tests,which was avoided by the MLF.In summary,regardless of the availability of surface meteorological data,the MLF-based empirical models demonstrate superior internal and external accuracy compared to the other tested models in this study.
基金supported by the special fund of the National Clinical Key Specialty Construction Program[(2022)301-2305].
文摘BACKGROUND:This study aims to develop and validate a machine learning-based in-hospital mortality predictive model for acute aortic syndrome(AAS)in the emergency department(ED)and to derive a simplifi ed version suitable for rapid clinical application.METHODS:In this multi-center retrospective cohort study,AAS patient data from three hospitals were analyzed.The modeling cohort included data from the First Affiliated Hospital of Zhengzhou University and the People’s Hospital of Xinjiang Uygur Autonomous Region,with Peking University Third Hospital data serving as the external test set.Four machine learning algorithms—logistic regression(LR),multilayer perceptron(MLP),Gaussian naive Bayes(GNB),and random forest(RF)—were used to develop predictive models based on 34 early-accessible clinical variables.A simplifi ed model was then derived based on fi ve key variables(Stanford type,pericardial eff usion,asymmetric peripheral arterial pulsation,decreased bowel sounds,and dyspnea)via Least Absolute Shrinkage and Selection Operator(LASSO)regression to improve ED applicability.RESULTS:A total of 929 patients were included in the modeling cohort,and 210 were included in the external test set.Four machine learning models based on 34 clinical variables were developed,achieving internal and external validation AUCs of 0.85-0.90 and 0.73-0.85,respectively.The simplifi ed model incorporating fi ve key variables demonstrated internal and external validation AUCs of 0.71-0.86 and 0.75-0.78,respectively.Both models showed robust calibration and predictive stability across datasets.CONCLUSION:Both kinds of models were built based on machine learning tools,and proved to have certain prediction performance and extrapolation.
基金supported by the Advanced Materials-National Science and Technology Major Project(Grant No.2025ZD0618401)the National Natural Science Foundation of China(Grant No.12504285)+1 种基金the Natural Science Foundation of Jiangsu Province(Grant No.BK20250472)NFSG grant from BITS-Pilani,Dubai campus。
文摘The rapid advancement of machine learning based tight-binding Hamiltonian(MLTB)methods has opened new avenues for efficient and accurate electronic structure simulations,particularly in large-scale systems and long-time scenarios.This review begins with a concise overview of traditional tight-binding(TB)models,including both(semi-)empirical and first-principles approaches,establishing the foundation for understanding MLTB developments.We then present a systematic classification of existing MLTB methodologies,grouped into two major categories:direct prediction of TB Hamiltonian elements and inference of empirical parameters.A comparative analysis with other ML-based electronic structure models is also provided,highlighting the advancement of MLTB approaches.Finally,we explore the emerging MLTB application ecosystem,highlighting how the integration of MLTB models with a diverse suite of post-processing tools from linear-scaling solvers to quantum transport frameworks and molecular dynamics interfaces is essential for tackling complex scientific problems across different domains.The continued advancement of this integrated paradigm promises to accelerate materials discovery and open new frontiers in the predictive simulation of complex quantum phenomena.
文摘This study aims to develop an accurate and robust machine learning model to predict the carbonation depth of fly ash concrete,overcoming the limitations of traditional predictive methods.Five ensemble-based models,such as adaptive boosting(AdaBoost),categorical boosting(CatBoost),gradient boosting regressor(GBR),hist gradient boosting regressor(HistGBR),and extreme gradient boosting(XGBoost),were developed and optimized using 729 high-quality dataset points incorporating seven input parameters,including cement,CO_(2),exposure time,water-binder ratio,fly ash,curing time,and compressive strength.Several performance evaluation metrics were used to compare the models.The GBR model emerged as the best-performing model,based on high coefficient of determination(R^(2))values and balanced error metrics across both validation and testing datasets.While all models performed exceptionally well on the training data,GBR demonstrated superior generalization capability,with R^(2) values of 0.9438 on the validation set and 0.9310 on the testing set.Furthermore,its low mean squared error(MSE),root mean square error(RMSE),mean absolute error(MAE),and median absolute error(MdAE)confirmed its robustness and accuracy.Moreover,shapley additive explanations(SHAP)analysis enhanced the interpretability of predictions,highlighting the curing time and exposure time as the most critical drivers of carbonation depth.
基金financial support of the National Natural Science Foundation of China(No.52371103)the Fundamental Research Funds for the Central Universities,China(No.2242023K40028)+1 种基金the Open Research Fund of Jiangsu Key Laboratory for Advanced Metallic Materials,China(No.AMM2023B01).financial support of the Research Fund of Shihezi Key Laboratory of AluminumBased Advanced Materials,China(No.2023PT02)financial support of Guangdong Province Science and Technology Major Project,China(No.2021B0301030005)。
文摘Oxide dispersion strengthened(ODS)alloys are extensively used owing to high thermostability and creep strength contributed from uniformly dispersed fine oxides particles.However,the existence of these strengthening particles also deteriorates the processability and it is of great importance to establish accurate processing maps to guide the thermomechanical processes to enhance the formability.In this study,we performed particle swarm optimization-based back propagation artificial neural network model to predict the high temperature flow behavior of 0.25wt%Al2O3 particle-reinforced Cu alloys,and compared the accuracy with that of derived by Arrhenius-type constitutive model and back propagation artificial neural network model.To train these models,we obtained the raw data by fabricating ODS Cu alloys using the internal oxidation and reduction method,and conducting systematic hot compression tests between 400 and800℃with strain rates of 10^(-2)-10 S^(-1).At last,processing maps for ODS Cu alloys were proposed by combining processing parameters,mechanical behavior,microstructure characterization,and the modeling results achieved a coefficient of determination higher than>99%.
基金supported by the National Key Research and Development Program of China(2024YFE0213000)the Postdoctoral Innovative Talents Support Program(BX20240232)+1 种基金the Natural Science Foundation of China for Young Scholars(72304031)the Fundamental Research Funds for the Central Universities(FRF-TP-22-024A1).
文摘Lithium-ion batteries(LIBs)are widely deployed,from grid-scale storage to electric vehicles.LIBs remain stationary most of their service life,where calendar aging degrades capacity.Understanding the mechanisms of LIB calendar aging is crucial for extending battery lifespan.However,LIB calendar aging is influenced by multiple factors,including battery material,its state,and storage environment.Calendar aging experiments are also time-consuming,costly,and lack standardized testing conditions.This study employs a data-driven approach to establish a cross-scale database linking materials,side-reaction mechanisms,and calendar aging of LIBs.MELODI(Mechanism-informed,Explainable,Learning-based Optimization for Degradation Identification)is proposed to identify calendar aging mechanisms and quantify the effects of multi-scale factors.Results reveal that cathode material loss drives up to 91.42%of calendar aging degradation in high-nickel(Ni)batteries,while solid electrolyte interphase growth dominates in lithium iron phosphate(LFP)and low-Ni batteries,contributing up to 82.43%of degradation in LFP batteries and 99.10%of decay in low-Ni batteries,respectively.This study systematically quantifies calendar aging in commercial LIBs under varying materials,states of charge,and temperatures.These findings offer quantitative guidance for experimental design or battery use,and implications for emerging applications like aerial robotics,vehicle-to-grid,and embodied intelligence systems.
基金supported by the National Key Research and Development Program of China(No.2024YFF0508600)the National Natural Science Foundation of China(Nos.22305081,32571562 and 52572303)+2 种基金Leading Talents in Shanghai in 2018,Shanghai Sailing Program(No.23YF1408600)111 project(No.B14018)Fundamental Research Funds for the Central Universities of China。
文摘The efficient and safe delivery of messenger RNA(m RNA)therapeutics remains a critical challenge for clinical translation,driving the need for advanced carrier design.Ionizable amphiphilic Janus dendrimers(IAJDs)represent a promising class of carriers;however,their structural complexity and limited available datasets hinder systematic exploration and optimization.In this study,we established a tailored machinelearning framework to investigate the structure-function relationships of IAJDs under a constrained data regime(n=231).Conventional molecular fingerprints were found to be suboptimal for representing these macromolecules,motivating the adoption of count-based descriptors and systematic ablation analyses to disentangle the contributions of the substructural features.These experiments identified key functional motifs underlying transfection performance and provided interpretable insights into the IAJD design principles.Complementing these handcrafted descriptors,we further applied deep learning-based molecular embeddings,which captured higher-order chemical semantics and significantly improved predictive accuracy.Collectively,these advances demonstrate that both refined fingerprinting and representation learning approaches can overcome data limitations,enabling the reliable prediction of IAJD activity while offering mechanistic interpretability.This study illustrates the potential of data-driven strategies as hypothesis-generation and prioritization tools for the design of next-generation m RNA delivery systems.
基金China Scholarship Council(CSC)for providing a fully funded post-graduate study in institute of mountain hazards and environment UCASsupported by the National Natural Science Foundation of China(Grant Nos.42361144880)+3 种基金the Science and Technology Program of Xizang(Grant No.XZ202402ZD0001)the Basic Research Program of Qinghai Province(2024-ZJ-904)the Postdoctoral Fellowship Programs of CPSF(Grant Nos.GZC20232571,2024M753153)the International Cooperation Overseas Platform Project,CAS(Grant No.131C11KYSB20200033).
文摘Landslides pose a significant threat in the mountainous regions of Nepal.Landslide susceptibility maps are commonly used to identify potential landslide zones by statistically analyzing geological,topographical,and hydrological factors,assuming that similar conditions may trigger future failures.While such maps provide valuable insights into landslide-triggering conditions,they are limited in assessing risk to settlements and infrastructure located downslope or in valley bottoms.This study integrates machine learning based landslide susceptibility with numerical runout modeling to provide a comprehensive landslide hazard assessment in the Bhotekoshi watershed,overcoming the limitations of traditional models that focus solely on statistical susceptibility.To conduct the susceptibility analysis,a total of 439 landslides were mapped from 2012 to 2021 using satellite images.Of these,70%were used for training two machine learning(ML)models:random forest and Xtreme Gradient Boosting(XGBoost),and the remaining 30%were used for validation.Among the two ML models,Random Forest model demonstrated slightly superior performance,achieving higher predictive accuracy.After the machine learning susceptibility analysis,the study transitions into a regional-scale landslide runout analysis.First,a back analysis of the past landslide event was conducted to fine-tune the model parameters(internal angle of friction and basal friction angle)and validate performance of the runout model.Following the back analysis,the regional-scale numerical modeling of landslide runout was conducted by designating areas classified as the highest susceptibility class in the Random Forest susceptibility map as potential release zones.This approach allows for a detailed examination of landslide propagation and potential impacts along the downslope settlements and infrastructures.The analysis clearly demonstrates that integrating both machine learning and numerical runout methods significantly increases the estimated exposure of population,buildings,and roads within the very high hazard class compared to relying solely on susceptibility methods.Specifically,population exposure rises from 360 to 7743,buildings increase from 97 to 2771,and road exposure expands from 41 to 251 km.This result highlights the significant risk of underestimating exposure in the analyses that solely rely on landslide susceptibility models.Integration of susceptibility and runout analysis improves landslide risk assessment,aiding in land-use planning and disaster mitigation strategies.
文摘This study investigates the uncertain dynamic characterization of hybrid composite plates by employing advanced machine-assisted finite element methodologies.Hybrid composites,widely used in aerospace,automotive,and structural applications,often face variability in material properties,geometric configurations,and manufacturing processes,leading to uncertainty in their dynamic response.To address this,three surrogate-based machine learning approaches like radial basis function(RBF),multivariate adaptive regression splines(MARS),and polynomial neural networks(PNN)are integrated with a finite element framework to efficiently capture the stochastic behavior of these plates.The research focuses on predicting the first three natural frequencies under material uncertainties,which are critical to ensuring structural reliability.Monte Carlo simulation(MCS)is used as a benchmark for generating probabilistic datasets,including mean values,standard deviations,and probability density functions.The surrogate models are then trained and validated against these datasets,enabling accurate representation of uncertainty with substantially fewer samples compared to conventionalMCS.Among the methods studied,the RBFmodel demonstrates superior performance,closely approximating MCS results with a reduced sample size,thereby achieving significant computational savings.The proposed framework not only reduces computational time and costs but also maintains high predictive accuracy,making it well-suited for complex engineering systems.Beyond free vibration analysis,the methodology can be extended to more sophisticated scenarios,such as forced vibration,damping effects,and nonlinear structural responses.Overall,this work presents a computationally efficient and robust approach for surrogate-based uncertainty quantification,advancing the analysis and design of hybrid composite structures under uncertainty.
文摘Accurate prediction of concrete compressive strength is fundamental for optimizing mix designs,improving material utilization,and ensuring structural safety in modern construction.Traditional empirical methods often fail to capture the non-linear relationships among concrete constituents,especially with the growing use of supple-mentary cementitious materials and recycled aggregates.This study presents an integrated machine learning framework for concrete strength prediction,combining advanced regression models—namely CatBoost—with metaheuristic optimization algorithms,with a particular focus on the Somersaulting Spider Optimizer(SSO).A comprehensive dataset encompassing diverse mix proportions and material types was used to evaluate baseline machine learning models,including CatBoost,XGBoost,ExtraTrees,and RandomForest.Among these,CatBoost demonstrated superior accuracy across multiple performance metrics.To further enhance predictive capability,several bio-inspired optimizers were employed for hyperparameter tuning.The SSO-CatBoost hybrid achieved the lowest mean squared error and highest correlation coefficients,outperforming other metaheuristic approaches such as Genetic Algorithm,Particle Swarm Optimization,and Grey Wolf Optimizer.Statistical significance was established through Analysis of Variance and Wilcoxon signed-rank testing,confirming the robustness of the optimized models.The proposed methodology not only delivers improved predictive performance but also offers a transparent framework for mix design optimization,supporting data-driven decision making in sustainable and resilient infrastructure development.
基金National Key Research and Development Program of China,No.2023YFC3006704National Natural Science Foundation of China,No.42171047CAS-CSIRO Partnership Joint Project of 2024,No.177GJHZ2023097MI。
文摘Accurate prediction of flood events is important for flood control and risk management.Machine learning techniques contributed greatly to advances in flood predictions,and existing studies mainly focused on predicting flood resource variables using single or hybrid machine learning techniques.However,class-based flood predictions have rarely been investigated,which can aid in quickly diagnosing comprehensive flood characteristics and proposing targeted management strategies.This study proposed a prediction approach of flood regime metrics and event classes coupling machine learning algorithms with clustering-deduced membership degrees.Five algorithms were adopted for this exploration.Results showed that the class membership degrees accurately determined event classes with class hit rates up to 100%,compared with the four classes clustered from nine regime metrics.The nonlinear algorithms(Multiple Linear Regression,Random Forest,and least squares-Support Vector Machine)outperformed the linear techniques(Multiple Linear Regression and Stepwise Regression)in predicting flood regime metrics.The proposed approach well predicted flood event classes with average class hit rates of 66.0%-85.4%and 47.2%-76.0%in calibration and validation periods,respectively,particularly for the slow and late flood events.The predictive capability of the proposed prediction approach for flood regime metrics and classes was considerably stronger than that of hydrological modeling approach.
基金supported by the National Natural Science Foundation of China(42471049).
文摘Rainfall input errors are a major source of uncertainty in flood forecasting,and merging multi-source precipitation data is essential for improving accuracy.Traditional merging methods often prioritize precipitation magnitude enhancements while overlooking event detection and false alarms.To address these limitations,this study developed a precipitation integration framework that combines machine learning classification-plus-regression models with Bayesian model averaging(BMA).Three machine learning algorithms-categorical boosting(CatBoost),light gradient boosting machine(LightGBM),and random forest(RF)-were used to improve precipitation event detection.The framework includes spatial unification of raw satellite products using bilinear interpolation,bias correction through classification-plus-regression models,and final merging via a seasonal-scale BMA model.The method integrated GSMaP,IMERG,and PERSIANN satellite precipitation products,with ground observations used for model training(2001-2014)and independent validation(2015-2020)in the Upper Ganjiang River Basin,China.Results showed that the framework significantly enhanced precipitation estimation accuracy and detection capability.LightGBM-based integration exhibited superior detection performance(FAR=0.08,CSI=0.86),while RF-based integration achieved the highest overall accuracy(RMSE=4.67,CC=0.92).Seasonal variations in BMA weights underscored the need to account for seasonal characteristics of precipitation products.Additionally,accuracy improvements were observed across all rainfall categories,especially for heavy rainstorms.The seasonal-scale BMA fusion has combined the strengths of individual corrections and further enhanced precipitation estimation.This research offers a robust method for generating accurate rainfall inputs,providing valuable support for hydrological modeling and flood forecasting applications.
文摘Post-kidney transplant rejection is a critical factor influencing transplant success rates and the survival of transplanted organs.With the rapid advancement of artificial intelligence technologies,machine learning(ML)has emerged as a powerful data analysis tool,widely applied in the prediction,diagnosis,and mechanistic study of kidney transplant rejection.This mini-review systematically summarizes the recent applications of ML techniques in post-kidney transplant rejection,covering areas such as the construction of predictive models,identification of biomarkers,analysis of pathological images,assessment of immune cell infiltration,and formulation of personalized treatment strategies.By integrating multi-omics data and clinical information,ML has significantly enhanced the accuracy of early rejection diagnosis and the capability for prognostic evaluation,driving the development of precision medicine in the field of kidney transplantation.Furthermore,this article discusses the challenges faced in existing research and potential future directions,providing a theoretical basis and technical references for related studies.
基金supported by the Elderly Health Research Project of Jiangsu Province,No.LKM2023043(to XZ).
文摘Alterations in RNA methylation may affect the initiation and development of Alzheimer’s disease.However,the exact nature of the relationship between RNA methylation and Alzheimer’s disease remains unclear.In this study,RNA methylation levels were analyzed by bulk transcriptomic and single-cell RNA sequencing.The expression levels of RNA methylation regulators were confirmed using molecular biology techniques.Co-expression network analysis was used to identify relevant long non-coding RNAs.Molecular subtypes related to RNA methylation were classified,and variations in clinical characteristics,biological behavior,and immune signatures between subtypes were assessed.Machine learning approaches were applied to identify methylation-associated long non-coding RNAs,which were used to construct a risk model and nomogram for Alzheimer’s disease.Potential therapeutic agents for different risk groups were predicted,and in vitro experiments were conducted to identify key RNA methylation events.Single-cell analysis demonstrated enhanced RNA methylation in patients with Alzheimer’s disease,particularly within T cells,B cells,and NK cells.Quantitative reverse transcription-polymerase chain reaction and western blot confirmed alterations in RNA methylation regulators in neurons treated with amyloid-βoligomers in vitro.This evidence supported the classification of patients with Alzheimer’s disease into heterogeneous subtypes.Specifically,subtype 1 was identified as the immune-active subtype,while subtype 2 was characterized by a metabolic phenotype.Machine learning algorithms identified five significant methylation-associated long non-coding RNAs-LINC01007,MAP4K3-DT,MIR302CHG,VAC14-AS1,and TGFB2-OT1-that accurately predict clinical outcomes for patients with Alzheimer’s disease.These patients were classified into low-and high-risk categories;the latter group displayed higher immune infiltration,upregulated immune regulatory gene expression,and elevated immune scores and responded better to treatment with arachidonic-trifluoroethane.These findings suggest that dysregulated RNA methylation alters the immune microenvironment in Alzheimer’s disease and is closely associated with its progression.This phenomenon provides novel insights into potential therapeutic strategies for Alzheimer’s disease that target RNA methylation.
基金supported by the Key-Area Research and Development Program of Guangdong Province(No.2020B1111360003)the National Natural Science Foundation of China(Nos.42465008 and 42105164)+2 种基金Yunnan Science and Technology Department Project(No.202501AT070239)Yunnan Science and Technology Department Youth Project(No.202401AU070202)Xianyang Rapid Response Decision Support Project for Ozone(No.YZ2024-ZB019).
文摘To curb the worsening tropospheric ozone(O_(3))pollution problem in China,a rapid and accurate identification of O_(3)-precursor sensitivity(OPS)is a crucial prerequisite for formulating effective contingency O_(3) pollution control strategies.However,currently widely-used methods,such as statistical models and numerical models,exhibit inherent limitations in identifying OPS in a timely and accurate manner.In this study,we developed a novel approach to identify OPS based on eXtreme Gradient Boosting model,Shapley additive explanation(SHAP)al-gorithm,and volatile organic compound(VOC)photochemical decay adjustment,using the meteorology and speciated pollutant monitoring data as the input.By comparing the difference in SHAP values between base sce-nario and precursor reduction scenario for nitrogen oxides(NO_(x))and VOCs,OPS was divided into NO_(x)-limited,VOCs-limited and transition regime.Using the long-lasting O_(3) pollution episode in the autumn of 2022 at the Guangdong-Hong Kong-Macao Greater Bay Area(GBA)as an example,we demonstrated large spatiotemporal heterogeneities of OPS over the GBA,which were generally shifted from NO_(x)-limited to VOCs-limited from September to October and more inclined to be VOCs-limited at the central and NO_(x)-limited in the peripheral areas.This study developed an innovative OPS identification method by comparing the difference in SHAP value before and after precursor emission reduction.Our method enables the accurate identification of OPS in the time scale of seconds,thereby providing a state-of-the-art tool for the rapid guidance of spatial-specific O_(3) control strategies.
文摘BACKGROUND Colorectal polyps are precancerous diseases of colorectal cancer.Early detection and resection of colorectal polyps can effectively reduce the mortality of colorectal cancer.Endoscopic mucosal resection(EMR)is a common polypectomy proce-dure in clinical practice,but it has a high postoperative recurrence rate.Currently,there is no predictive model for the recurrence of colorectal polyps after EMR.AIM To construct and validate a machine learning(ML)model for predicting the risk of colorectal polyp recurrence one year after EMR.METHODS This study retrospectively collected data from 1694 patients at three medical centers in Xuzhou.Additionally,a total of 166 patients were collected to form a prospective validation set.Feature variable screening was conducted using uni-variate and multivariate logistic regression analyses,and five ML algorithms were used to construct the predictive models.The optimal models were evaluated based on different performance metrics.Decision curve analysis(DCA)and SHapley Additive exPlanation(SHAP)analysis were performed to assess clinical applicability and predictor importance.RESULTS Multivariate logistic regression analysis identified 8 independent risk factors for colorectal polyp recurrence one year after EMR(P<0.05).Among the models,eXtreme Gradient Boosting(XGBoost)demonstrated the highest area under the curve(AUC)in the training set,internal validation set,and prospective validation set,with AUCs of 0.909(95%CI:0.89-0.92),0.921(95%CI:0.90-0.94),and 0.963(95%CI:0.94-0.99),respectively.DCA indicated favorable clinical utility for the XGBoost model.SHAP analysis identified smoking history,family history,and age as the top three most important predictors in the model.CONCLUSION The XGBoost model has the best predictive performance and can assist clinicians in providing individualized colonoscopy follow-up recommendations.
基金funded through India Meteorological Department,New Delhi,India under the Forecasting Agricultural output using Space,Agrometeorol ogy and Land based observations(FASAL)project and fund number:No.ASC/FASAL/KT-11/01/HQ-2010.
文摘Background Cotton is one of the most important commercial crops after food crops,especially in countries like India,where it’s grown extensively under rainfed conditions.Because of its usage in multiple industries,such as textile,medicine,and automobile industries,it has greater commercial importance.The crop’s performance is greatly influenced by prevailing weather dynamics.As climate changes,assessing how weather changes affect crop performance is essential.Among various techniques that are available,crop models are the most effective and widely used tools for predicting yields.Results This study compares statistical and machine learning models to assess their ability to predict cotton yield across major producing districts of Karnataka,India,utilizing a long-term dataset spanning from 1990 to 2023 that includes yield and weather factors.The artificial neural networks(ANNs)performed superiorly with acceptable yield deviations ranging within±10%during both vegetative stage(F1)and mid stage(F2)for cotton.The model evaluation metrics such as root mean square error(RMSE),normalized root mean square error(nRMSE),and modelling efficiency(EF)were also within the acceptance limits in most districts.Furthermore,the tested ANN model was used to assess the importance of the dominant weather factors influencing crop yield in each district.Specifically,the use of morning relative humidity as an individual parameter and its interaction with maximum and minimum tempera-ture had a major influence on cotton yield in most of the yield predicted districts.These differences highlighted the differential interactions of weather factors in each district for cotton yield formation,highlighting individual response of each weather factor under different soils and management conditions over the major cotton growing districts of Karnataka.Conclusions Compared with statistical models,machine learning models such as ANNs proved higher efficiency in forecasting the cotton yield due to their ability to consider the interactive effects of weather factors on yield forma-tion at different growth stages.This highlights the best suitability of ANNs for yield forecasting in rainfed conditions and for the study on relative impacts of weather factors on yield.Thus,the study aims to provide valuable insights to support stakeholders in planning effective crop management strategies and formulating relevant policies.
基金Supported by Government Assignment,No.1023022600020-6RSF Grant,No.24-15-00549Ministry of Science and Higher Education of the Russian Federation within the Framework of State Support for the Creation and Development of World-Class Research Center,No.075-15-2022-304.
文摘BACKGROUND Ischemic heart disease(IHD)impacts the quality of life and has the highest mortality rate of cardiovascular diseases globally.AIM To compare variations in the parameters of the single-lead electrocardiogram(ECG)during resting conditions and physical exertion in individuals diagnosed with IHD and those without the condition using vasodilator-induced stress computed tomography(CT)myocardial perfusion imaging as the diagnostic reference standard.METHODS This single center observational study included 80 participants.The participants were aged≥40 years and given an informed written consent to participate in the study.Both groups,G1(n=31)with and G2(n=49)without post stress induced myocardial perfusion defect,passed cardiologist consultation,anthropometric measurements,blood pressure and pulse rate measurement,echocardiography,cardio-ankle vascular index,bicycle ergometry,recording 3-min single-lead ECG(Cardio-Qvark)before and just after bicycle ergometry followed by performing CT myocardial perfusion.The LASSO regression with nested cross-validation was used to find the association between Cardio-Qvark parameters and the existence of the perfusion defect.Statistical processing was performed with the R programming language v4.2,Python v.3.10[^R],and Statistica 12 program.RESULTS Bicycle ergometry yielded an area under the receiver operating characteristic curve of 50.7%[95%confidence interval(CI):0.388-0.625],specificity of 53.1%(95%CI:0.392-0.673),and sensitivity of 48.4%(95%CI:0.306-0.657).In contrast,the Cardio-Qvark test performed notably better with an area under the receiver operating characteristic curve of 67%(95%CI:0.530-0.801),specificity of 75.5%(95%CI:0.628-0.88),and sensitivity of 51.6%(95%CI:0.333-0.695).CONCLUSION The single-lead ECG has a relatively higher diagnostic accuracy compared with bicycle ergometry by using machine learning models,but the difference was not statistically significant.However,further investigations are required to uncover the hidden capabilities of single-lead ECG in IHD diagnosis.
基金financial support from the National Key Research and Development Program of China(2021YFB 3501501)the National Natural Science Foundation of China(No.22225803,22038001,22108007 and 22278011)+1 种基金Beijing Natural Science Foundation(No.Z230023)Beijing Science and Technology Commission(No.Z211100004321001).
文摘The high porosity and tunable chemical functionality of metal-organic frameworks(MOFs)make it a promising catalyst design platform.High-throughput screening of catalytic performance is feasible since the large MOF structure database is available.In this study,we report a machine learning model for high-throughput screening of MOF catalysts for the CO_(2) cycloaddition reaction.The descriptors for model training were judiciously chosen according to the reaction mechanism,which leads to high accuracy up to 97%for the 75%quantile of the training set as the classification criterion.The feature contribution was further evaluated with SHAP and PDP analysis to provide a certain physical understanding.12,415 hypothetical MOF structures and 100 reported MOFs were evaluated under 100℃ and 1 bar within one day using the model,and 239 potentially efficient catalysts were discovered.Among them,MOF-76(Y)achieved the top performance experimentally among reported MOFs,in good agreement with the prediction.
文摘Geared-rotor systems are critical components in mechanical applications,and their performance can be severely affected by faults,such as profile errors,wear,pitting,spalling,flaking,and cracks.Profile errors in gear teeth are inevitable in manufacturing and subsequently accumulate during operations.This work aims to predict the status of gear profile deviations based on gear dynamics response using the digital model of an experimental rig setup.The digital model comprises detailed CAD models and has been validated against the expected physical behavior using commercial finite element analysis software.The different profile deviations are then modeled using gear charts,and the dynamic response is captured through simulations.The various features are then obtained by signal processing,and various ML models are then evaluated to predict the fault/no-fault condition for the gear.The best performance is achieved by an artificial neural network with a prediction accuracy of 97.5%,which concludes a strong influence on the dynamics of the gear rotor system due to profile deviations.