BACKGROUND Laparoscopic distal pancreatectomy(LDP)has emerged as the preferred approach for both benign and malignant lesions located in the pancreatic body and tail.Nevertheless,a notable deficiency persists in the a...BACKGROUND Laparoscopic distal pancreatectomy(LDP)has emerged as the preferred approach for both benign and malignant lesions located in the pancreatic body and tail.Nevertheless,a notable deficiency persists in the absence of a standardized,procedure-specific metric for evaluating and comparing surgical quality.A composite measure termed“textbook outcome(TO)”,which encompasses key short-term endpoints,has been validated in laparoscopic pancreatoduodenectomy but has not yet been established in dedicated LDP cohorts.The definition and prediction of TO in this context could aid in facilitating cross-institutional benchmarking and fostering advancements in quality improvement.AIM To establish procedure-specific criteria for TO and identify independent predictors of TO failure in patients undergoing LDP.METHODS Consecutive patients who underwent LDP at a single high-volume pancreatic center between January 2015 and August 2022 were retrospectively analyzed.TO was defined as the absence of clinically relevant postoperative pancreatic fistula(grade B/C),post-pancreatectomy hemorrhage(grade B/C),severe complications(Clavien-Dindo≥III),readmission within 30 days,and in-hospital or 30-day mortality.Multivariable logistic regression was employed to identify independent predictors of TO failure,and a nomogram was constructed and internally validated.RESULTS Among 405 eligible patients,286(70.6%)attained TO.Multivariable analysis revealed that female sex[odds ratio(OR)=0.62,95%confidence interval(CI):0.39-0.99]conferred a protective effect,while preoperative endoscopic ultrasound-guided fine-needle aspiration(OR=2.66,95%CI:1.05-6.73),pancreatic portal hypertension(OR=2.81,95%CI:1.06-7.45),and cystic-solid(OR=2.51,95%CI:1.34-4.69)or solid lesions(OR=1.91,95%CI:1.06-3.44)were independently associated with TO failure(all P<0.05).The derived nomogram exhibited modest discrimination and calibration when assessed in both the training and validation datasets.CONCLUSION The proposed LDP-specific definition of TO is feasible and discriminative,and the developed nomogram provides an objective tool for individualized risk assessment.展开更多
Accurately predicting environmental parameters in solar greenhouses is crucial for achieving precise environmental control.In solar greenhouses,temperature,humidity,and light intensity are crucial environmental parame...Accurately predicting environmental parameters in solar greenhouses is crucial for achieving precise environmental control.In solar greenhouses,temperature,humidity,and light intensity are crucial environmental parameters.The monitoring platform collected data on the internal environment of the solar greenhouse for one year,including temperature,humidity,and light intensity.Additionally,meteorological data,comprising outdoor temperature,outdoor humidity,and outdoor light intensity,was gathered during the same time frame.The characteristics and interrelationships among these parameters were investigated by a thorough analysis.The analysis revealed that environmental parameters in solar greenhouses displayed characteristics such as temporal variability,non-linearity,and periodicity.These parameters exhibited complex coupling relationships.Notably,these characteristics and coupling relationships exhibited pronounced seasonal variations.The multi-parameter multi-step prediction model for solar greenhouse(MPMS-SGH)was introduced,aiming to accurately predict three key greenhouse environmental parameters,and the model had certain seasonal adaptability.MPMS-SGH was structured with multiple layers,including an input layer,a preprocessing layer,a feature extraction layer,and a prediction layer.The input layer was used to generate the original sequence matrix,which included indoor temperature,indoor humidity,indoor light intensity,as well as outdoor temperature and outdoor light intensity.Then the preprocessing layer normalized,decomposed,and positionally encoded the original sequence matrix.In the feature extraction layer,the time attention mechanism and frequency attention mechanism were used to extract features from the trend component and the seasonal component,respectively.Finally,the prediction layer used a multi-layer perceptron to perform multi-step prediction of indoor environmental parameters(i.e.temperature,humidity,and light intensity).The parameter selection experiment evaluated the predictive performance of MPMS-SGH on input and output sequences of different lengths.The results indicated that with a constant output sequence length,the prediction accuracy of MPMS-SGH was firstly increased and then decreased with the increase of input sequence length.Specifically,when the input sequence length was 100,MPMS-SGH had the highest prediction accuracy,with RMSE of 0.22℃,0.28%,and 250lx for temperature,humidity,and light intensity,respectively.When the length of the input sequence remained constant,as the length of the output sequence increased,the accuracy of the model in predicting the three environmental parameters was continuously decreased.When the length of the output sequence exceeded 45,the prediction accuracy of MPMS-SGH was significantly decreased.In order to achieve the best balance between model size and performance,the input sequence length of MPMS-SGH was set to be 100,while the output sequence length was set to be 35.To assess MPMS-SGH’s performance,comparative experiments with four prediction models were conducted:SVR,STL-SVR,LSTM,and STL-LSTM.The results demonstrated that MPMS-SGH surpassed all other models,achieving RMSE of 0.15℃for temperature,0.38%for humidity,and 260lx for light intensity.Additionally,sequence decomposition can contribute to enhancing MPMS-SGH’s prediction performance.To further evaluate MPMS-SGH’s capabilities,its prediction accuracy was tested across different seasons for greenhouse environmental parameters.MPMS-SGH had the highest accuracy in predicting indoor temperature and the lowest accuracy in predicting humidity.And the accuracy of MPMS-SGH in predicting environmental parameters of the solar greenhouse fluctuated with seasons.MPMS-SGH had the highest accuracy in predicting the temperature inside the greenhouse on sunny days in spring(R^(2)=0.91),the highest accuracy in predicting the humidity inside the greenhouse on sunny days in winter(R^(2)=0.83),and the highest accuracy in predicting the light intensity inside the greenhouse on cloudy days in autumm(R^(2)=0.89).MPMS-SGH had the lowest accuracy in predicting three environmental parameters in a sunny summer greenhouse.展开更多
Due to its synergistic effects and reduced side effects,combination therapy has become an important strategy for treating complex diseases.In traditional Chinese medicine(TCM),the“monarch,minister,assistant,envoy”co...Due to its synergistic effects and reduced side effects,combination therapy has become an important strategy for treating complex diseases.In traditional Chinese medicine(TCM),the“monarch,minister,assistant,envoy”compatibilities theory provides a systematic framework for drug compatibility and has guided the formation of a large number of classic formulas.However,due to the complex compositions and diverse mechanisms of action of TCM,it is difficult to comprehensively reveal its potential synergistic patterns using traditional methods.Synergistic prediction based on molecular compatibility theory provides new ideas for identifying combinations of active compounds in TCM.Compared to resource-intensive traditional experimental methods,artificial intelligence possesses the ability to mine synergistic patterns from multi-omics and structural data,providing an efficient means for modeling and optimizing TCM combinations.This paper systematically reviews the application progress of AI in the synergistic prediction of TCM active compounds and explores the challenges and prospects of its application in modeling combination relationships,thereby contributing to the modernization of TCM theory and methodological innovation.展开更多
Predicting the productivity of multistage fractured horizontal wells plays an important role in exploiting unconventional resources.In recent years,machine learning(ML)models have emerged as a new approach for such st...Predicting the productivity of multistage fractured horizontal wells plays an important role in exploiting unconventional resources.In recent years,machine learning(ML)models have emerged as a new approach for such studies.However,the scarcity of sufficient real data for model training often leads to imprecise predictions,even though the models trained with real data better characterize geological and engineering features.To tackle this issue,we propose an ML model that can obtain reliable results even with a small amount of data samples.Our model integrates the synthetic minority oversampling technique(SMOTE)to expand the data volume,the support vector machine(SVM)for model training,and the particle swarm optimization(PSO)algorithm for optimizing hyperparameters.To enhance the model performance,we conduct feature fusion and dimensionality reduction.Additionally,we examine the influences of different sample sizes and ML models for training.The proposed model demonstrates higher prediction accuracy and generalization ability,achieving a predicted R^(2)value of up to 0.9 for the test set,compared to the traditional ML techniques with an R^(2)of 0.13.This model accurately predicts the production of fractured horizontal wells even with limited samples,supplying an efficient tool for optimizing the production of unconventional resources.Importantly,the model holds the potential applicability to address similar challenges in other fields constrained by scarce data samples.展开更多
Higher education institutions are becoming increasingly concerned with the retention of their students.This work is motivated by the interest in predicting and reducing student dropout,and consequently in reducing the...Higher education institutions are becoming increasingly concerned with the retention of their students.This work is motivated by the interest in predicting and reducing student dropout,and consequently in reducing the financial losses of said institutions.Based on the characterization of the dropout problem and the application of a knowledge discovery process,an ensemble model is proposed to improve dropout prediction.The ensemble model combines the results of three models:logistic regression,neural networks,and decision tree.As a result,the model can correctly classify 89%of the students as enrolled or dropped and accurately identify 98.1%of dropouts.When compared with the Random Forest ensemble method,the proposed model demonstrates desirable characteristics to assist management in proposing actions to retain students.展开更多
Permeable roads generally exhibit inferior mechanical properties and shorter service life than traditional dense-graded/impermeable roads.Furthermore,the incorporation of recycled aggregates in their construction may ...Permeable roads generally exhibit inferior mechanical properties and shorter service life than traditional dense-graded/impermeable roads.Furthermore,the incorporation of recycled aggregates in their construction may exacerbate these limitations.To address these issues,this study introduced a novel cement-stabilized permeable recycled aggregate material.A total of 162 beam specimens prepared with nine different levels of cement-aggregate ratio were tested to evaluate their permeability,bending load,and bending fatigue life.The experimental results indicate that increasing the content of recycled aggregates led to a reduction in both permeability and bending load.Additionally,the inclusion of recycled aggregates diminished the energy dissipation capacity of the specimens.These findings were used to establish a robust relationship between the initial damage in cement-stabilized permeable recycled aggregate material specimens and their fatigue life,and to propose a predictive model for their fatigue performance.Further,a method for assessing fatigue damage based on the evolution of fatigue-induced strain and energy dissipation was developed.The findings of this study provide valuable insights into the mechanical behavior and fatigue performance of cement-stabilized permeable recycled aggregate materials,offering guidance for the design of low-carbon-emission,permeable,and durable roadways incorporating recycled aggregates.展开更多
In most agricultural areas in the semi-arid region of the southern United States, wheat (Triticum aestivum L.) production is a primary economic activity. This region is drought-prone and projected to have a drier clim...In most agricultural areas in the semi-arid region of the southern United States, wheat (Triticum aestivum L.) production is a primary economic activity. This region is drought-prone and projected to have a drier climate in the future. Predicting the yield loss due to an anticipated drought is crucial for wheat growers. A reliable way for predicting the drought-induced yield loss is to use a plant physiology-based drought index, such as Agricultural Reference Index for Drought (ARID). Since different wheat cultivars exhibit varying levels of sensitivity to water stress, the impact of drought could be different on the cultivars belonging to different drought sensitivity groups. The objective of this study was to develop the cultivar drought sensitivity (CDS) group-specific, ARID-based models for predicting the drought-induced yield loss of winter wheat in the Llano Estacado region in the southern United States by accounting for the phenological phase-specific sensitivity to drought. For the study, the historical (1947-2021) winter wheat grain yield and daily weather data of two locations in the region (Bushland, TX and Clovis, NM) were used. The logical values of the drought sensitivity parameters of the yield models, especially for the moderately-sensitive and highly-sensitive CDS groups, indicated that the yield models reflected the phenomenon of water stress decreasing the winter wheat yields in this region satisfactorily. The reasonable values of the Nash-Sutcliffe Index (0.65 and 0.72), the Willmott Index (0.88 and 0.92), and the percentage error (23 and 22) for the moderately-sensitive and highly-sensitive CDS groups, respectively, indicated that the yield models for these groups performed reasonably well. These models could be useful for predicting the drought-induced yield losses and scheduling irrigation allocation based on the phenological phase-specific drought sensitivity as influenced by cultivar genotype.展开更多
AlphaFold[1]has turned everyone into a structural biologist.No need for knowledge of Fourier transforms or spectral density,driven by artificial intelligence(AI),all one needs to do is enter the primary structure of a...AlphaFold[1]has turned everyone into a structural biologist.No need for knowledge of Fourier transforms or spectral density,driven by artificial intelligence(AI),all one needs to do is enter the primary structure of a folded protein,and out pops a tertiary structure nearly as good as one from an experiment-based structure.展开更多
Pharmaceutical pollution is becoming an increasing threat to aquatic environments since inactive compounds do not break down,and the drug products are accumulated in living organisms.The ability of a drug to dissolve ...Pharmaceutical pollution is becoming an increasing threat to aquatic environments since inactive compounds do not break down,and the drug products are accumulated in living organisms.The ability of a drug to dissolve in water(i.e.,LogS)is an important parameter for assessing a drug’s environmental fate,biovailability,and toxicity.LogS is typically measured in a laboratory setting,which can be costly and time-consuming,and does not provide the opportunity to conduct large-scale analyses.This research develops and evaluates machine learning models that can produce LogS estimates and may improve the environmental risk assessments of toxic pharmaceutical pollutants.We used a dataset from the ChEMBL database that contained 8832 molecular compounds.Various data preprocessing and cleaning techniques were applied(i.e.,removing the missing values),we then recorded chemical properties by normalizing and,even,using some feature selection techniques.We evaluated logS with a total of several machine learning and deep learning models,including;linear regression,random forests(RF),support vector machines(SVM),gradient boosting(GBM),and artificial neural networks(ANNs).We assessed model performance using a series of metrics,including root mean square error(RMSE)and mean absolute error(MAE),as well as the coefficient of determination(R^(2)).The findings show that the Least Angle Regression(LAR)model performed the best with an R^(2) value close to 1.0000,confirming high predictive accuracy.The OMP model performed well with good accuracy(R^(2)=0.8727)while remaining computationally cheap,while other models(e.g.,neural networks,random forests)performed well but were too computationally expensive.Finally,to assess the robustness of the results,an error analysis indicated that residuals were evenly distributed around zero,confirming the results from the LAR model.The current research illustrates the potential of AI in anticipating drug solubility,providing support for green pharmaceutical design and environmental risk assessment.Future work should extend predictions to include degradation and toxicity to enhance predictive power and applicability.展开更多
Soil mineralized nitrogen(N)is a vital component of soil N supply capacity and an important N source for rice growth.Unveiling N mineralization(Nm)process characteristics and developing a simple and effective approach...Soil mineralized nitrogen(N)is a vital component of soil N supply capacity and an important N source for rice growth.Unveiling N mineralization(Nm)process characteristics and developing a simple and effective approach to evaluate soil Nm are imperative to guide N fertilizer application and enhance its efficiency in various paddy soils with different physicochemical properties.Soil properties are important driving factors contributing to soil Nm differences and must be considered to achieve effective N management.Nevertheless,discrepancies in Nm capacity and other key influencing factors remain uncertain.To address this knowledge gap,this study collected 52 paddy soil samples from Taihu Lake Basin,China,which possess vastly different physicochemical properties.The samples were subjected to a 112-d submerged anaerobic incubation experiment at a constant temperature to obtain the soil Nm characteristics.Reaction kinetics models,including one-pool exponential model,two-pool exponential model,and effective cumulative temperature model,were employed to compare characteristic differences between Nm potential(Nmp)and short-term accumulated mineralized N(Amn)processes in relation to soil physicochemical properties.Based on these relationships,simplified Nmp prediction methods for paddy soils were established.The results revealed that the Nmp values were 145.18,88.64,and 21.03 mg kg-1 in paddy soils with pH<6.50,6.50≤pH≤7.50,and pH>7.50,respectively.Significantly,short-term Amn at day 14 showed a good correlation(P<0.01)with Nmp(R2=0.94),indicating that the prevailing short-term incubation experiment is an acceptable marker for Nmp.Moreover,Nmp correlated well with the ultraviolet absorbance value at 260 nm based on NaHCO3 extraction(Na260),further streamlining the Nmp estimation method.The incorporation of easily obtainable soil properties,including pH,total N(TN),and the ratio of total organic carbon to TN(C/N),alongside Na260 for Nmp evaluation allowed the multiple regression model,Nmp=58.62×TN-23.18×pH+13.08×C/N+86.96×Na260,to achieve a high prediction accuracy(R2=0.95).The reliability of this prediction was further validated with published data of paddy soils in the same region and other rice regions,demonstrating the regional applicability and prospects of this model.This study underscored the roles of soil properties in Nm characteristics and mechanisms and established a site-specific prediction model based on rapid extractions and edaphic properties of paddy soils,paving the way for developing rapid and precise Nm prediction models.展开更多
BACKGROUND The prevalence and mortality rates of gastric carcinoma are disproportionately elevated in China,with the disease's intricate and varied characteristics further amplifying its health impact.Precise fore...BACKGROUND The prevalence and mortality rates of gastric carcinoma are disproportionately elevated in China,with the disease's intricate and varied characteristics further amplifying its health impact.Precise forecasting of overall survival(OS)is of paramount importance for the clinical management of individuals afflicted with this malignancy.AIM To develop and validate a nomogram model that provides precise gastric cancer prevention and treatment guidance and more accurate survival outcome prediction for patients with gastric carcinoma.METHODS Data analysis was conducted on samples collected from hospitalized gastric cancer patients between 2018 and 2020.Least absolute shrinkage and selection operator,univariate,and multivariate Cox regression analyses were employed to identify independent prognostic factors.A nomogram model was developed to predict gastric cancer patient outcomes.The model's predictability and discriminative ability were evaluated via receiver operating characteristic curves.To evaluate the clinical utility of the model,Kaplan-Meier and decision curve analyses were performed.RESULTS A total of ten independent prognostic factors were identified,including body mass index,tumor-node-metastasis(TNM)stage,radiation,chemotherapy,surgery,albumin,globulin,neutrophil count,lactate dehydrogenase,and platelet-to-lymphocyte ratio.The area under the curve(AUC)values for the 1-,3-,and 5-year survival prediction in the training set were 0.843,0.850,and 0.821,respectively.The AUC values were 0.864,0.820,and 0.786 for the 1-,3-,and 5-year survival prediction in the validation set,respectively.The model exhibited strong discriminative ability,with both the time AUC and time C-index exceeding 0.75.Compared with TNM staging,the model demonstrated superior clinical utility.Ultimately,a nomogram was developed via a web-based interface.CONCLUSION This study established and validated a novel nomogram model for predicting the OS of gastric cancer patients,which demonstrated strong predictive ability.Based on these findings,this model can aid clinicians in implementing personalized interventions for patients with gastric cancer.展开更多
BACKGROUND Hepatocellular carcinoma(HCC)surveillance is crucial for patients with compensated cirrhosis(CC)and decompensated cirrhosis(DC).Increasing evidence has revealed a connection between thyroid hormone(TH)and H...BACKGROUND Hepatocellular carcinoma(HCC)surveillance is crucial for patients with compensated cirrhosis(CC)and decompensated cirrhosis(DC).Increasing evidence has revealed a connection between thyroid hormone(TH)and HCC,although this relationship remains contentious.Complements and immunoglobulin(Ig),which serve as surrogates of cirrhosis-associated immune dysfunc-tion,are associated with the severity and outcomes of liver cirrhosis(LC).To date,there is a lack of evidence supporting the recommendation of TH,Ig,and com-plement tests in patients at high risk of HCC.AIM To assess the predictive value of TH,Ig,and complements for HCC development.METHODS Data from 142 patients,comprising 72 patients with CC and 70 patients with DC,were analysed as a training set.Among them,100 patients who underwent complement and Ig tests were considered for internal validation.Logistic regression was employed to identify independent risk factors for HCC development.RESULTS The median follow-up duration was 32(24-37 months)months.The incidence of HCC was significantly higher in the DC group(16/70,22.9%)compared to the CC group(3/72,4.2%)(χ^(2)=10.698,P<0.01).Patients with DC exhibited lower total tetraiodothyronine(TT4),total triiodothyronine(TT3),free triiodothyronine,complement C3,and C4(all P<0.01),and higher IgA and IgG(both P<0.01).In both CC and DC patients,TT3 and TT4 positively correlated with alanine transaminase(ALT),aspartate transaminase(AST),and gamma-glutamyl transpeptidase(GGT).IgG positively correlated with IgM,IgA,ALT,and AST,while it negatively correlated with C3 and C4.Multivariable analysis indicated that age,DC status,and GGT were independent risk factors for HCC development.CONCLUSION The predictive value of TH,Ig,and complements for HCC development is suboptimal.Age,DC,and GGT emerge as more significant factors during HCC surveillance in hepatitis B virus-related LC.展开更多
High dropout rates in short-term job skills training programs hinder workforce development.This study applies machine learning to predict program completion while addressing class imbalance challenges.A dataset of6548...High dropout rates in short-term job skills training programs hinder workforce development.This study applies machine learning to predict program completion while addressing class imbalance challenges.A dataset of6548 records with 24 demographic,educational,program-specific,and employment-related features was analyzed.Data preprocessing involved cleaning,encoding categorical variables,and balancing the dataset using the Synthetic Minority Oversampling Technique(SMOTE),as only 15.9% of participants were dropouts.six machine learning models-Logistic Regression,Random Forest,SupportVector Machine,K-Nearest Neighbors,Naive Bayes,and XGBoost-were evaluated on both balanced and unbalanced datasets using an 80-20 train-test split.Performance was assessed using Accuracy,Precision,Recall,F1-score,and ROC-AUC.XGBoost achieved the highest performance on the balanced dataset,with an F1-score of 0.9200 and aROC-AUC of0.9684,followed by Random Forest.These findings highlight the potential of machine learning for early identification of dropout trainees,aiding in retention strategies for workforce training.The results support the integration of predictive analytics to optimize intervention efforts in short-term training programs.展开更多
The urgent necessity for enhanced risk stratification to improve the efficiency of colonoscopy screening is underscored by the fact that colorectal cancer(CRC)continues to be a primary cause of global cancer mortality...The urgent necessity for enhanced risk stratification to improve the efficiency of colonoscopy screening is underscored by the fact that colorectal cancer(CRC)continues to be a primary cause of global cancer mortality.Conventional models mostly rely on generalized obesity markers including body mass index(BMI),which does not effectively represent oncogenic risk linked with abdominal obesity.Liu et al undertook a large-scale case-control study comprising 6484 firsttime colonoscopy patients at a prominent Chinese hospital between 2020 and 2023 to overcome this restriction.Age,male sex,smoking status,and raised waist-hip ratio(WHR)were found by multivariate logistic regression as independent predictors of advanced colorectal neoplasia(ACN).In a validation cohort of 1891 individuals,a new 7-point risk scoring model was created and stratified into low-(5.0%)ACN prevalence,moderate-(10.3%)and high-risk(17.6%).With C-statistic=0.66 the model showed better discriminating ability than the Asia-Pacific Colorectal Screening(APCS)score(C-statistic=0.63)and the BMI-modified APCS model.These results fit newly published data showing central obesity as a major carcinogenic driver via pro-inflammatory visceral adipokine channels.With the use of WHR,patient risk classification is greatly improved,providing a practical tool to make the most of screening resources in the face of rising CRC incidence rates.Finally,multi-ethnic validation is necessary for the WHR-based scoring model to be considered for integration into global CRC preventive frameworks,since it improves the accuracy of ACN risk prediction.展开更多
Spectrum prediction is considered as a key technology to assist spectrum decision.Despite the great efforts that have been put on the construction of spectrum prediction,achieving accurate spectrum prediction emphasiz...Spectrum prediction is considered as a key technology to assist spectrum decision.Despite the great efforts that have been put on the construction of spectrum prediction,achieving accurate spectrum prediction emphasizes the need for more advanced solutions.In this paper,we propose a new multichannel multi-step spectrum prediction method using Transformer and stacked bidirectional LSTM(Bi-LSTM),named TSB.Specifically,we use multi-head attention and stacked Bi-LSTM to build a new Transformer based on encoder-decoder architecture.The self-attention mechanism composed of multiple layers of multi-head attention can continuously attend to all positions of the multichannel spectrum sequences.The stacked Bi-LSTM can learn these focused coding features by multi-head attention layer by layer.The advantage of this fusion mode is that it can deeply capture the long-term dependence of multichannel spectrum data.We have conducted extensive experiments on a dataset generated by a real simulation platform.The results show that the proposed algorithm performs better than the baselines.展开更多
Microvascular invasion(MVI)is a critical factor in hepatocellular carcinoma(HCC)prognosis,particularly in hepatitis B virus(HBV)-related cases.This editorial examines a recent study by Xu et al who developed models to...Microvascular invasion(MVI)is a critical factor in hepatocellular carcinoma(HCC)prognosis,particularly in hepatitis B virus(HBV)-related cases.This editorial examines a recent study by Xu et al who developed models to predict MVI and high-risk(M2)status in HBV-related HCC using contrast-enhanced computed tomography(CECT)radiomics and clinicoradiological factors.The study analyzed 270 patients,creating models that achieved an area under the curve values of 0.841 and 0.768 for MVI prediction,and 0.865 and 0.798 for M2 status prediction in training and validation datasets,respectively.These results are comparable to previous radiomics-based approaches,which reinforces the potential of this method in MVI prediction.The strengths of the study include its focus on HBV-related HCC and the use of widely accessible CECT imaging.However,limitations,such as retrospective design and manual segmentation,highlight areas for improvement.The editorial discusses the implications of the study including the need for standardized radiomics approaches and the potential impact on personalized treatment strategies.It also suggests future research directions,such as exploring mechanistic links between radiomics features and MVI,as well as integrating additional biomarkers or imaging modalities.Overall,this study contributes significantly to HCC management,paving the way for more accurate,personalized treatment approaches in the era of precision oncology.展开更多
BACKGROUND Few studies have specifically modeled the risk of venous thromboembolism(VTE)for postoperative hepatocellular carcinoma(HCC)patients,although HCC is the third leading cause of cancer death worldwide.This st...BACKGROUND Few studies have specifically modeled the risk of venous thromboembolism(VTE)for postoperative hepatocellular carcinoma(HCC)patients,although HCC is the third leading cause of cancer death worldwide.This study aimed to develop and validate a nomogram that accurately predicts the risk of VTE in patients after HCC surgery.AIM To develop and validate a nomogram to accurately predict the risk of VTE in postoperative HCC patients by integrating clinical and laboratory risk factors.The model seeks to provide a user-friendly tool for identifying high-risk individuals who may benefit from targeted anticoagulation therapy,thereby improving clinical decision-making and patient outcomes.METHODS Data from patients who underwent HCC surgery at Chongqing University Cancer Hospital in China were analyzed.Through univariate and multivariate logistic regression analyses,independent risk factors for VTE were identified and integrated into a nomogram.The predictive performance of the nomogram was assessed via receiver operating characteristic curves,calibration curves,decision curve analysis and other relevant metrics.RESULTS Of 905 postoperative HCC patients were included in the study.The nomogram incorporated eight independent risk factors for VTE:Karnofsky Performance Scale,base disease,cancer stage(tumor-node-metastasis),chemotherapy,D-dimer concentration,white blood cell count,hemoglobin,and fibrinogen.The C-index for the nomogram model was 0.825 in the training cohort and 0.820 in the validation cohort,indicating good discriminative ability.Calibration plots of the model revealed high concordance between the predicted probabilities and observed outcomes.CONCLUSION We developed and validated a novel nomogram that can accurately estimate the risk of VTE in individual postoperative HCC patients.This model can identify high-risk patients who may benefit from targeted anticoagulation therapy.展开更多
Prediction of weaning success from invasive mechanical ventilation remains a challenge in everyday clinical practice.Several prediction scores have been developed to guide success during spontaneous breathing trials t...Prediction of weaning success from invasive mechanical ventilation remains a challenge in everyday clinical practice.Several prediction scores have been developed to guide success during spontaneous breathing trials to help with weaning decisions.These scores aim to provide a structured framework to support clinical judgment.However,their effectiveness varies across patient populations,and their predictive accuracy remains inconsistent.In this review,we aim to identify the strengths and limitations of commonly used clinical prediction tools in assessing readiness for ventilator liberation.While scores such as the Rapid Shallow Breathing Index and the Integrative Weaning Index are widely adopted,their sensitivity and specificity often fall short in complex clinical settings.Factors such as underlying disease pathophysiology,patient characteristics,and clinician subjectivity impact score performance and reliability.Moreover,disparities in validation across diverse populations limit generalizability.With growing interest in artificial intelligence(AI)and machine learning,there is potential for enhanced prediction models that integrate multidimensional data and adapt to individual patient profiles.However,current AI approaches face challenges related to interpretability,bias,and ethical implementation.This paper underscores the need for more robust,individualized,and transparent prediction systems and advocates for careful integration of emerging technologies into clinical workflows to optimize weaning success and patient outcomes.展开更多
Objective:This study aimed to construct a model that predicts invasive lung cancer using longitudinal radiological features from multiple low-dose computed tomography(LDCT)scans,thereby addressing overdiagnosis in lun...Objective:This study aimed to construct a model that predicts invasive lung cancer using longitudinal radiological features from multiple low-dose computed tomography(LDCT)scans,thereby addressing overdiagnosis in lung cancer screening.Methods:In this retrospective study,628 patients with pulmonary nodules who underwent three LDCT scans followed by surgical resection were categorized into invasive carcinoma(n=155)and non-invasive nodule(n=473)groups on the basis of pathological diagnosis.This derivation aimed to identify risk factors and construct a multivariate logistic model.The predictive performance was externally validated in two independent cohorts(retrospectively designed,n=252;prospectively designed,n=269).The discrimination and calibration of the model were evaluated using area under the curve(AUC),and calibration plots.Decision curve analysis(DCA)was further performed to evaluate the net benefit in practical clinical scenarios.Results:The model,termed multiple CTs-invasive lung cancer(MCT-ILC),incorporated eleven factors encompassing nodule features at baseline and feature variability during follow-up.The standard deviation of diameter variability(SD_(diameter))was the most reliable predictor,with an odds ratio[95% confidence interval(95%CI)of 7.35(5.32-10.16)(P<0.001)].AUCs with 95% CIs for the MCT-ILC model were 0.912(0.864-0.960)and 0.906(0.833-0.979)in the two testing cohorts and were superior to those for the model containing only features at baseline(PD_(elong)=0.002 and 0.021,respectively).For calibration,the Brier scores of the MCT-ILC model were0.091(95% CI:0.064-0.118) and 0.078(95% CI:0.055-0.101)in the two test sets.The decision curve image showed that the MCT-ILC model was the only model that maintained positive net benefits across the entire threshold range.Furthermore,the MCT-ILC model score could classify more than 90% of patients with invasive nodules into the high-risk group.Conclusions:The MCT-ILC model could assess pulmonary nodule invasiveness,potentially mitigating overdiagnosis in lung cancer screening.展开更多
Objective To develop an onset risk prediction nomogram for patients with homocysteine-type(H-type)hypertension(HTH)based on pulse diagram parameters to assist early clinical prediction and diagnosis of HTH.Methods Pat...Objective To develop an onset risk prediction nomogram for patients with homocysteine-type(H-type)hypertension(HTH)based on pulse diagram parameters to assist early clinical prediction and diagnosis of HTH.Methods Patients diagnosed with essential hypertension and admitted to Shanghai Shuguang Hospital Affiliated to Shanghai University of Traditional Chinese Medicine,Shang-hai Hospital of Traditional Chinese Medicine,and Shanghai Hospital of Integrated Tradition-al Chinese and Western Medicine from July 6th 2020 to June 16th 2021,and from August 11th 2023 to January 22nd 2024,were enrolled in this retrospective research.The baselines and clinical biochemical indicators of patients were collected.The SMART-I TCM pulse instru-ment was applied to gather pulse diagram parameters.Multivariate logistic regression was adopted to analyze the risk factors for HTH.RStudio was employed to construct the nomo-gram model,receiver operating characteristic(ROC)curve,and calibration curve(bootstrap self-sampling 200 times),and clinical decision curve were drawn to evaluate the model’s dis-crimination and clinical effectiveness.Results A total of 168 hospitalized patients with essential hypertension were selected and di-vided into non-HTH group(n=29)and HTH group(n=139).Compared with non-HTH group,HTH group had a lower body mass index(BMI),and higher proportions of male pa-tients and drinkers(P<0.05).The ventricular wall thickening(VWT)could not be deter-mined.The proportions of left common carotid intima-media wall thickness(LCCIMWT)and serum creatinine(SCR)were higher in HTH group(P<0.05).The pulse diagram parameter As was significantly higher,and H4/H1 and T1/T were lower in HTH group(P<0.05).Gender,al-cohol consumption,serum creatinine,and the pulse diagram parameter H4/H1 were identi-fied as independent risk factors for HTH(P<0.05).The nomogram’s area under the ROC curve(AUC)was 0.795[95%confidence interval(CI):(0.7066,0.8828)],with a specificity of 0.724 and sensitivity of 0.799.After 200 times repeated bootstrap self-samplings,the calibra-tion curve showed that the simulated curve fits well with the actual curve(x^(2)=9.5002,P=0.3019).The clinical decision curve indicated that the nomogram’s applicability was optimal when the threshold for predicting HTH was between 0.38 and 1.00.Conclusion The nomogram model could be valuable for predicting the onset risk of HTH and pulse diagram parameters can facilitate early screening and prevention of HTH.展开更多
文摘BACKGROUND Laparoscopic distal pancreatectomy(LDP)has emerged as the preferred approach for both benign and malignant lesions located in the pancreatic body and tail.Nevertheless,a notable deficiency persists in the absence of a standardized,procedure-specific metric for evaluating and comparing surgical quality.A composite measure termed“textbook outcome(TO)”,which encompasses key short-term endpoints,has been validated in laparoscopic pancreatoduodenectomy but has not yet been established in dedicated LDP cohorts.The definition and prediction of TO in this context could aid in facilitating cross-institutional benchmarking and fostering advancements in quality improvement.AIM To establish procedure-specific criteria for TO and identify independent predictors of TO failure in patients undergoing LDP.METHODS Consecutive patients who underwent LDP at a single high-volume pancreatic center between January 2015 and August 2022 were retrospectively analyzed.TO was defined as the absence of clinically relevant postoperative pancreatic fistula(grade B/C),post-pancreatectomy hemorrhage(grade B/C),severe complications(Clavien-Dindo≥III),readmission within 30 days,and in-hospital or 30-day mortality.Multivariable logistic regression was employed to identify independent predictors of TO failure,and a nomogram was constructed and internally validated.RESULTS Among 405 eligible patients,286(70.6%)attained TO.Multivariable analysis revealed that female sex[odds ratio(OR)=0.62,95%confidence interval(CI):0.39-0.99]conferred a protective effect,while preoperative endoscopic ultrasound-guided fine-needle aspiration(OR=2.66,95%CI:1.05-6.73),pancreatic portal hypertension(OR=2.81,95%CI:1.06-7.45),and cystic-solid(OR=2.51,95%CI:1.34-4.69)or solid lesions(OR=1.91,95%CI:1.06-3.44)were independently associated with TO failure(all P<0.05).The derived nomogram exhibited modest discrimination and calibration when assessed in both the training and validation datasets.CONCLUSION The proposed LDP-specific definition of TO is feasible and discriminative,and the developed nomogram provides an objective tool for individualized risk assessment.
文摘Accurately predicting environmental parameters in solar greenhouses is crucial for achieving precise environmental control.In solar greenhouses,temperature,humidity,and light intensity are crucial environmental parameters.The monitoring platform collected data on the internal environment of the solar greenhouse for one year,including temperature,humidity,and light intensity.Additionally,meteorological data,comprising outdoor temperature,outdoor humidity,and outdoor light intensity,was gathered during the same time frame.The characteristics and interrelationships among these parameters were investigated by a thorough analysis.The analysis revealed that environmental parameters in solar greenhouses displayed characteristics such as temporal variability,non-linearity,and periodicity.These parameters exhibited complex coupling relationships.Notably,these characteristics and coupling relationships exhibited pronounced seasonal variations.The multi-parameter multi-step prediction model for solar greenhouse(MPMS-SGH)was introduced,aiming to accurately predict three key greenhouse environmental parameters,and the model had certain seasonal adaptability.MPMS-SGH was structured with multiple layers,including an input layer,a preprocessing layer,a feature extraction layer,and a prediction layer.The input layer was used to generate the original sequence matrix,which included indoor temperature,indoor humidity,indoor light intensity,as well as outdoor temperature and outdoor light intensity.Then the preprocessing layer normalized,decomposed,and positionally encoded the original sequence matrix.In the feature extraction layer,the time attention mechanism and frequency attention mechanism were used to extract features from the trend component and the seasonal component,respectively.Finally,the prediction layer used a multi-layer perceptron to perform multi-step prediction of indoor environmental parameters(i.e.temperature,humidity,and light intensity).The parameter selection experiment evaluated the predictive performance of MPMS-SGH on input and output sequences of different lengths.The results indicated that with a constant output sequence length,the prediction accuracy of MPMS-SGH was firstly increased and then decreased with the increase of input sequence length.Specifically,when the input sequence length was 100,MPMS-SGH had the highest prediction accuracy,with RMSE of 0.22℃,0.28%,and 250lx for temperature,humidity,and light intensity,respectively.When the length of the input sequence remained constant,as the length of the output sequence increased,the accuracy of the model in predicting the three environmental parameters was continuously decreased.When the length of the output sequence exceeded 45,the prediction accuracy of MPMS-SGH was significantly decreased.In order to achieve the best balance between model size and performance,the input sequence length of MPMS-SGH was set to be 100,while the output sequence length was set to be 35.To assess MPMS-SGH’s performance,comparative experiments with four prediction models were conducted:SVR,STL-SVR,LSTM,and STL-LSTM.The results demonstrated that MPMS-SGH surpassed all other models,achieving RMSE of 0.15℃for temperature,0.38%for humidity,and 260lx for light intensity.Additionally,sequence decomposition can contribute to enhancing MPMS-SGH’s prediction performance.To further evaluate MPMS-SGH’s capabilities,its prediction accuracy was tested across different seasons for greenhouse environmental parameters.MPMS-SGH had the highest accuracy in predicting indoor temperature and the lowest accuracy in predicting humidity.And the accuracy of MPMS-SGH in predicting environmental parameters of the solar greenhouse fluctuated with seasons.MPMS-SGH had the highest accuracy in predicting the temperature inside the greenhouse on sunny days in spring(R^(2)=0.91),the highest accuracy in predicting the humidity inside the greenhouse on sunny days in winter(R^(2)=0.83),and the highest accuracy in predicting the light intensity inside the greenhouse on cloudy days in autumm(R^(2)=0.89).MPMS-SGH had the lowest accuracy in predicting three environmental parameters in a sunny summer greenhouse.
基金supported by the National Key Research and Development Program of China(No.2024YFC3506900)Science and Technology Program of Tianjin(No.24ZXZSSS00460)Special Project for Technological Innovation in New Productive Forces of Modern Chinese Medicines(No.24ZXZKSY00010)。
文摘Due to its synergistic effects and reduced side effects,combination therapy has become an important strategy for treating complex diseases.In traditional Chinese medicine(TCM),the“monarch,minister,assistant,envoy”compatibilities theory provides a systematic framework for drug compatibility and has guided the formation of a large number of classic formulas.However,due to the complex compositions and diverse mechanisms of action of TCM,it is difficult to comprehensively reveal its potential synergistic patterns using traditional methods.Synergistic prediction based on molecular compatibility theory provides new ideas for identifying combinations of active compounds in TCM.Compared to resource-intensive traditional experimental methods,artificial intelligence possesses the ability to mine synergistic patterns from multi-omics and structural data,providing an efficient means for modeling and optimizing TCM combinations.This paper systematically reviews the application progress of AI in the synergistic prediction of TCM active compounds and explores the challenges and prospects of its application in modeling combination relationships,thereby contributing to the modernization of TCM theory and methodological innovation.
基金supported by the National Natural Science Foundation of China(52274055)the Shandong Provincial Natural Science Foundation(ZR2022YQ50)the Taishan Scholar Program of Shandong Province(tsqn202408088)。
文摘Predicting the productivity of multistage fractured horizontal wells plays an important role in exploiting unconventional resources.In recent years,machine learning(ML)models have emerged as a new approach for such studies.However,the scarcity of sufficient real data for model training often leads to imprecise predictions,even though the models trained with real data better characterize geological and engineering features.To tackle this issue,we propose an ML model that can obtain reliable results even with a small amount of data samples.Our model integrates the synthetic minority oversampling technique(SMOTE)to expand the data volume,the support vector machine(SVM)for model training,and the particle swarm optimization(PSO)algorithm for optimizing hyperparameters.To enhance the model performance,we conduct feature fusion and dimensionality reduction.Additionally,we examine the influences of different sample sizes and ML models for training.The proposed model demonstrates higher prediction accuracy and generalization ability,achieving a predicted R^(2)value of up to 0.9 for the test set,compared to the traditional ML techniques with an R^(2)of 0.13.This model accurately predicts the production of fractured horizontal wells even with limited samples,supplying an efficient tool for optimizing the production of unconventional resources.Importantly,the model holds the potential applicability to address similar challenges in other fields constrained by scarce data samples.
基金the National Council for Scientific and Technological Development of Brazil(CNPQ)the Coordination for the Improvement of Higher Education Personnel-Brazil(CAPES)(Grant PROAP 88887.842889/2023-00-PUC/MG,Grant PDPG 88887.708960/2022-00-PUC/MG-INFORMATICA and Finance Code 001)Minas Gerais State Research Support Foundation(FAPEMIG)under Grant No.:APQ-01929-22,and the Pontifical Catholic University of Minas Gerais,Brazil.
文摘Higher education institutions are becoming increasingly concerned with the retention of their students.This work is motivated by the interest in predicting and reducing student dropout,and consequently in reducing the financial losses of said institutions.Based on the characterization of the dropout problem and the application of a knowledge discovery process,an ensemble model is proposed to improve dropout prediction.The ensemble model combines the results of three models:logistic regression,neural networks,and decision tree.As a result,the model can correctly classify 89%of the students as enrolled or dropped and accurately identify 98.1%of dropouts.When compared with the Random Forest ensemble method,the proposed model demonstrates desirable characteristics to assist management in proposing actions to retain students.
基金Project(2024JJ2073)supported by the Science Fund for Distinguished Young Scholars of Hunan Province,ChinaProjects(2023YFC3807205,2019YFC1904704)+4 种基金supported by the National Key R&D Program of ChinaProject(52178443)supported by the National Natural Science Foundation of ChinaProject(2024ZZTS0109)supported by Fundamental Research Funds for the Central Universities of Central South University,China。
文摘Permeable roads generally exhibit inferior mechanical properties and shorter service life than traditional dense-graded/impermeable roads.Furthermore,the incorporation of recycled aggregates in their construction may exacerbate these limitations.To address these issues,this study introduced a novel cement-stabilized permeable recycled aggregate material.A total of 162 beam specimens prepared with nine different levels of cement-aggregate ratio were tested to evaluate their permeability,bending load,and bending fatigue life.The experimental results indicate that increasing the content of recycled aggregates led to a reduction in both permeability and bending load.Additionally,the inclusion of recycled aggregates diminished the energy dissipation capacity of the specimens.These findings were used to establish a robust relationship between the initial damage in cement-stabilized permeable recycled aggregate material specimens and their fatigue life,and to propose a predictive model for their fatigue performance.Further,a method for assessing fatigue damage based on the evolution of fatigue-induced strain and energy dissipation was developed.The findings of this study provide valuable insights into the mechanical behavior and fatigue performance of cement-stabilized permeable recycled aggregate materials,offering guidance for the design of low-carbon-emission,permeable,and durable roadways incorporating recycled aggregates.
文摘In most agricultural areas in the semi-arid region of the southern United States, wheat (Triticum aestivum L.) production is a primary economic activity. This region is drought-prone and projected to have a drier climate in the future. Predicting the yield loss due to an anticipated drought is crucial for wheat growers. A reliable way for predicting the drought-induced yield loss is to use a plant physiology-based drought index, such as Agricultural Reference Index for Drought (ARID). Since different wheat cultivars exhibit varying levels of sensitivity to water stress, the impact of drought could be different on the cultivars belonging to different drought sensitivity groups. The objective of this study was to develop the cultivar drought sensitivity (CDS) group-specific, ARID-based models for predicting the drought-induced yield loss of winter wheat in the Llano Estacado region in the southern United States by accounting for the phenological phase-specific sensitivity to drought. For the study, the historical (1947-2021) winter wheat grain yield and daily weather data of two locations in the region (Bushland, TX and Clovis, NM) were used. The logical values of the drought sensitivity parameters of the yield models, especially for the moderately-sensitive and highly-sensitive CDS groups, indicated that the yield models reflected the phenomenon of water stress decreasing the winter wheat yields in this region satisfactorily. The reasonable values of the Nash-Sutcliffe Index (0.65 and 0.72), the Willmott Index (0.88 and 0.92), and the percentage error (23 and 22) for the moderately-sensitive and highly-sensitive CDS groups, respectively, indicated that the yield models for these groups performed reasonably well. These models could be useful for predicting the drought-induced yield losses and scheduling irrigation allocation based on the phenological phase-specific drought sensitivity as influenced by cultivar genotype.
基金supported by the U.S.National Natural Science Foundation(CHE-2203505 and MCB-2335137).
文摘AlphaFold[1]has turned everyone into a structural biologist.No need for knowledge of Fourier transforms or spectral density,driven by artificial intelligence(AI),all one needs to do is enter the primary structure of a folded protein,and out pops a tertiary structure nearly as good as one from an experiment-based structure.
文摘Pharmaceutical pollution is becoming an increasing threat to aquatic environments since inactive compounds do not break down,and the drug products are accumulated in living organisms.The ability of a drug to dissolve in water(i.e.,LogS)is an important parameter for assessing a drug’s environmental fate,biovailability,and toxicity.LogS is typically measured in a laboratory setting,which can be costly and time-consuming,and does not provide the opportunity to conduct large-scale analyses.This research develops and evaluates machine learning models that can produce LogS estimates and may improve the environmental risk assessments of toxic pharmaceutical pollutants.We used a dataset from the ChEMBL database that contained 8832 molecular compounds.Various data preprocessing and cleaning techniques were applied(i.e.,removing the missing values),we then recorded chemical properties by normalizing and,even,using some feature selection techniques.We evaluated logS with a total of several machine learning and deep learning models,including;linear regression,random forests(RF),support vector machines(SVM),gradient boosting(GBM),and artificial neural networks(ANNs).We assessed model performance using a series of metrics,including root mean square error(RMSE)and mean absolute error(MAE),as well as the coefficient of determination(R^(2)).The findings show that the Least Angle Regression(LAR)model performed the best with an R^(2) value close to 1.0000,confirming high predictive accuracy.The OMP model performed well with good accuracy(R^(2)=0.8727)while remaining computationally cheap,while other models(e.g.,neural networks,random forests)performed well but were too computationally expensive.Finally,to assess the robustness of the results,an error analysis indicated that residuals were evenly distributed around zero,confirming the results from the LAR model.The current research illustrates the potential of AI in anticipating drug solubility,providing support for green pharmaceutical design and environmental risk assessment.Future work should extend predictions to include degradation and toxicity to enhance predictive power and applicability.
基金supported by the Youth Innovation Promotion Association of Chinese Academy of Sciences(No.Y201956)the Young Elite Scientists Sponsorship Program by China Association for Science and Technology(No.2023QNRC001)the National Key Research and Development Program of China(No.2017YFD200104).
文摘Soil mineralized nitrogen(N)is a vital component of soil N supply capacity and an important N source for rice growth.Unveiling N mineralization(Nm)process characteristics and developing a simple and effective approach to evaluate soil Nm are imperative to guide N fertilizer application and enhance its efficiency in various paddy soils with different physicochemical properties.Soil properties are important driving factors contributing to soil Nm differences and must be considered to achieve effective N management.Nevertheless,discrepancies in Nm capacity and other key influencing factors remain uncertain.To address this knowledge gap,this study collected 52 paddy soil samples from Taihu Lake Basin,China,which possess vastly different physicochemical properties.The samples were subjected to a 112-d submerged anaerobic incubation experiment at a constant temperature to obtain the soil Nm characteristics.Reaction kinetics models,including one-pool exponential model,two-pool exponential model,and effective cumulative temperature model,were employed to compare characteristic differences between Nm potential(Nmp)and short-term accumulated mineralized N(Amn)processes in relation to soil physicochemical properties.Based on these relationships,simplified Nmp prediction methods for paddy soils were established.The results revealed that the Nmp values were 145.18,88.64,and 21.03 mg kg-1 in paddy soils with pH<6.50,6.50≤pH≤7.50,and pH>7.50,respectively.Significantly,short-term Amn at day 14 showed a good correlation(P<0.01)with Nmp(R2=0.94),indicating that the prevailing short-term incubation experiment is an acceptable marker for Nmp.Moreover,Nmp correlated well with the ultraviolet absorbance value at 260 nm based on NaHCO3 extraction(Na260),further streamlining the Nmp estimation method.The incorporation of easily obtainable soil properties,including pH,total N(TN),and the ratio of total organic carbon to TN(C/N),alongside Na260 for Nmp evaluation allowed the multiple regression model,Nmp=58.62×TN-23.18×pH+13.08×C/N+86.96×Na260,to achieve a high prediction accuracy(R2=0.95).The reliability of this prediction was further validated with published data of paddy soils in the same region and other rice regions,demonstrating the regional applicability and prospects of this model.This study underscored the roles of soil properties in Nm characteristics and mechanisms and established a site-specific prediction model based on rapid extractions and edaphic properties of paddy soils,paving the way for developing rapid and precise Nm prediction models.
文摘BACKGROUND The prevalence and mortality rates of gastric carcinoma are disproportionately elevated in China,with the disease's intricate and varied characteristics further amplifying its health impact.Precise forecasting of overall survival(OS)is of paramount importance for the clinical management of individuals afflicted with this malignancy.AIM To develop and validate a nomogram model that provides precise gastric cancer prevention and treatment guidance and more accurate survival outcome prediction for patients with gastric carcinoma.METHODS Data analysis was conducted on samples collected from hospitalized gastric cancer patients between 2018 and 2020.Least absolute shrinkage and selection operator,univariate,and multivariate Cox regression analyses were employed to identify independent prognostic factors.A nomogram model was developed to predict gastric cancer patient outcomes.The model's predictability and discriminative ability were evaluated via receiver operating characteristic curves.To evaluate the clinical utility of the model,Kaplan-Meier and decision curve analyses were performed.RESULTS A total of ten independent prognostic factors were identified,including body mass index,tumor-node-metastasis(TNM)stage,radiation,chemotherapy,surgery,albumin,globulin,neutrophil count,lactate dehydrogenase,and platelet-to-lymphocyte ratio.The area under the curve(AUC)values for the 1-,3-,and 5-year survival prediction in the training set were 0.843,0.850,and 0.821,respectively.The AUC values were 0.864,0.820,and 0.786 for the 1-,3-,and 5-year survival prediction in the validation set,respectively.The model exhibited strong discriminative ability,with both the time AUC and time C-index exceeding 0.75.Compared with TNM staging,the model demonstrated superior clinical utility.Ultimately,a nomogram was developed via a web-based interface.CONCLUSION This study established and validated a novel nomogram model for predicting the OS of gastric cancer patients,which demonstrated strong predictive ability.Based on these findings,this model can aid clinicians in implementing personalized interventions for patients with gastric cancer.
基金Supported by The Research Foundation of Jiangsu Province Administration of Traditional Chinese Medicine,No.MS2023088The Science and Technology Project of Changzhou,No.CE20225040+1 种基金The Research Foundation of Nanjing Medical University Changzhou Medical Center,No.CMCC202311Leading Talent of Changzhou“The 14th Five-Year Plan”High-Level Health Talents Training Project,No.2022CZLJ021.
文摘BACKGROUND Hepatocellular carcinoma(HCC)surveillance is crucial for patients with compensated cirrhosis(CC)and decompensated cirrhosis(DC).Increasing evidence has revealed a connection between thyroid hormone(TH)and HCC,although this relationship remains contentious.Complements and immunoglobulin(Ig),which serve as surrogates of cirrhosis-associated immune dysfunc-tion,are associated with the severity and outcomes of liver cirrhosis(LC).To date,there is a lack of evidence supporting the recommendation of TH,Ig,and com-plement tests in patients at high risk of HCC.AIM To assess the predictive value of TH,Ig,and complements for HCC development.METHODS Data from 142 patients,comprising 72 patients with CC and 70 patients with DC,were analysed as a training set.Among them,100 patients who underwent complement and Ig tests were considered for internal validation.Logistic regression was employed to identify independent risk factors for HCC development.RESULTS The median follow-up duration was 32(24-37 months)months.The incidence of HCC was significantly higher in the DC group(16/70,22.9%)compared to the CC group(3/72,4.2%)(χ^(2)=10.698,P<0.01).Patients with DC exhibited lower total tetraiodothyronine(TT4),total triiodothyronine(TT3),free triiodothyronine,complement C3,and C4(all P<0.01),and higher IgA and IgG(both P<0.01).In both CC and DC patients,TT3 and TT4 positively correlated with alanine transaminase(ALT),aspartate transaminase(AST),and gamma-glutamyl transpeptidase(GGT).IgG positively correlated with IgM,IgA,ALT,and AST,while it negatively correlated with C3 and C4.Multivariable analysis indicated that age,DC status,and GGT were independent risk factors for HCC development.CONCLUSION The predictive value of TH,Ig,and complements for HCC development is suboptimal.Age,DC,and GGT emerge as more significant factors during HCC surveillance in hepatitis B virus-related LC.
文摘High dropout rates in short-term job skills training programs hinder workforce development.This study applies machine learning to predict program completion while addressing class imbalance challenges.A dataset of6548 records with 24 demographic,educational,program-specific,and employment-related features was analyzed.Data preprocessing involved cleaning,encoding categorical variables,and balancing the dataset using the Synthetic Minority Oversampling Technique(SMOTE),as only 15.9% of participants were dropouts.six machine learning models-Logistic Regression,Random Forest,SupportVector Machine,K-Nearest Neighbors,Naive Bayes,and XGBoost-were evaluated on both balanced and unbalanced datasets using an 80-20 train-test split.Performance was assessed using Accuracy,Precision,Recall,F1-score,and ROC-AUC.XGBoost achieved the highest performance on the balanced dataset,with an F1-score of 0.9200 and aROC-AUC of0.9684,followed by Random Forest.These findings highlight the potential of machine learning for early identification of dropout trainees,aiding in retention strategies for workforce training.The results support the integration of predictive analytics to optimize intervention efforts in short-term training programs.
文摘The urgent necessity for enhanced risk stratification to improve the efficiency of colonoscopy screening is underscored by the fact that colorectal cancer(CRC)continues to be a primary cause of global cancer mortality.Conventional models mostly rely on generalized obesity markers including body mass index(BMI),which does not effectively represent oncogenic risk linked with abdominal obesity.Liu et al undertook a large-scale case-control study comprising 6484 firsttime colonoscopy patients at a prominent Chinese hospital between 2020 and 2023 to overcome this restriction.Age,male sex,smoking status,and raised waist-hip ratio(WHR)were found by multivariate logistic regression as independent predictors of advanced colorectal neoplasia(ACN).In a validation cohort of 1891 individuals,a new 7-point risk scoring model was created and stratified into low-(5.0%)ACN prevalence,moderate-(10.3%)and high-risk(17.6%).With C-statistic=0.66 the model showed better discriminating ability than the Asia-Pacific Colorectal Screening(APCS)score(C-statistic=0.63)and the BMI-modified APCS model.These results fit newly published data showing central obesity as a major carcinogenic driver via pro-inflammatory visceral adipokine channels.With the use of WHR,patient risk classification is greatly improved,providing a practical tool to make the most of screening resources in the face of rising CRC incidence rates.Finally,multi-ethnic validation is necessary for the WHR-based scoring model to be considered for integration into global CRC preventive frameworks,since it improves the accuracy of ACN risk prediction.
基金supported in part by the National Natural Science Foundation of China under Grants 62231015,62427801in part by Jiangsu Province Frontier Leading Technology Basic Research Project BK20232030.
文摘Spectrum prediction is considered as a key technology to assist spectrum decision.Despite the great efforts that have been put on the construction of spectrum prediction,achieving accurate spectrum prediction emphasizes the need for more advanced solutions.In this paper,we propose a new multichannel multi-step spectrum prediction method using Transformer and stacked bidirectional LSTM(Bi-LSTM),named TSB.Specifically,we use multi-head attention and stacked Bi-LSTM to build a new Transformer based on encoder-decoder architecture.The self-attention mechanism composed of multiple layers of multi-head attention can continuously attend to all positions of the multichannel spectrum sequences.The stacked Bi-LSTM can learn these focused coding features by multi-head attention layer by layer.The advantage of this fusion mode is that it can deeply capture the long-term dependence of multichannel spectrum data.We have conducted extensive experiments on a dataset generated by a real simulation platform.The results show that the proposed algorithm performs better than the baselines.
文摘Microvascular invasion(MVI)is a critical factor in hepatocellular carcinoma(HCC)prognosis,particularly in hepatitis B virus(HBV)-related cases.This editorial examines a recent study by Xu et al who developed models to predict MVI and high-risk(M2)status in HBV-related HCC using contrast-enhanced computed tomography(CECT)radiomics and clinicoradiological factors.The study analyzed 270 patients,creating models that achieved an area under the curve values of 0.841 and 0.768 for MVI prediction,and 0.865 and 0.798 for M2 status prediction in training and validation datasets,respectively.These results are comparable to previous radiomics-based approaches,which reinforces the potential of this method in MVI prediction.The strengths of the study include its focus on HBV-related HCC and the use of widely accessible CECT imaging.However,limitations,such as retrospective design and manual segmentation,highlight areas for improvement.The editorial discusses the implications of the study including the need for standardized radiomics approaches and the potential impact on personalized treatment strategies.It also suggests future research directions,such as exploring mechanistic links between radiomics features and MVI,as well as integrating additional biomarkers or imaging modalities.Overall,this study contributes significantly to HCC management,paving the way for more accurate,personalized treatment approaches in the era of precision oncology.
文摘BACKGROUND Few studies have specifically modeled the risk of venous thromboembolism(VTE)for postoperative hepatocellular carcinoma(HCC)patients,although HCC is the third leading cause of cancer death worldwide.This study aimed to develop and validate a nomogram that accurately predicts the risk of VTE in patients after HCC surgery.AIM To develop and validate a nomogram to accurately predict the risk of VTE in postoperative HCC patients by integrating clinical and laboratory risk factors.The model seeks to provide a user-friendly tool for identifying high-risk individuals who may benefit from targeted anticoagulation therapy,thereby improving clinical decision-making and patient outcomes.METHODS Data from patients who underwent HCC surgery at Chongqing University Cancer Hospital in China were analyzed.Through univariate and multivariate logistic regression analyses,independent risk factors for VTE were identified and integrated into a nomogram.The predictive performance of the nomogram was assessed via receiver operating characteristic curves,calibration curves,decision curve analysis and other relevant metrics.RESULTS Of 905 postoperative HCC patients were included in the study.The nomogram incorporated eight independent risk factors for VTE:Karnofsky Performance Scale,base disease,cancer stage(tumor-node-metastasis),chemotherapy,D-dimer concentration,white blood cell count,hemoglobin,and fibrinogen.The C-index for the nomogram model was 0.825 in the training cohort and 0.820 in the validation cohort,indicating good discriminative ability.Calibration plots of the model revealed high concordance between the predicted probabilities and observed outcomes.CONCLUSION We developed and validated a novel nomogram that can accurately estimate the risk of VTE in individual postoperative HCC patients.This model can identify high-risk patients who may benefit from targeted anticoagulation therapy.
文摘Prediction of weaning success from invasive mechanical ventilation remains a challenge in everyday clinical practice.Several prediction scores have been developed to guide success during spontaneous breathing trials to help with weaning decisions.These scores aim to provide a structured framework to support clinical judgment.However,their effectiveness varies across patient populations,and their predictive accuracy remains inconsistent.In this review,we aim to identify the strengths and limitations of commonly used clinical prediction tools in assessing readiness for ventilator liberation.While scores such as the Rapid Shallow Breathing Index and the Integrative Weaning Index are widely adopted,their sensitivity and specificity often fall short in complex clinical settings.Factors such as underlying disease pathophysiology,patient characteristics,and clinician subjectivity impact score performance and reliability.Moreover,disparities in validation across diverse populations limit generalizability.With growing interest in artificial intelligence(AI)and machine learning,there is potential for enhanced prediction models that integrate multidimensional data and adapt to individual patient profiles.However,current AI approaches face challenges related to interpretability,bias,and ethical implementation.This paper underscores the need for more robust,individualized,and transparent prediction systems and advocates for careful integration of emerging technologies into clinical workflows to optimize weaning success and patient outcomes.
基金funded by grants from Project supported by the Funds for Noncommunicable Chronic Diseases-National Science and Technology Major Project(No.2024ZD0520000,2024ZD0520003)Noncommunicable Chronic Diseases-National Science and Technology Major Project(No.2024ZD0524400,2024ZD0524403)+2 种基金National Natural Science Foundation of China(No.82388102)Jiangsu Medical Association Medical Research Project of Health Management,SYH-32099-0119(No.2024023)the Specialized Diseases Clinical Research Fund of Jiangsu Province Hospital(No.DL202411)。
文摘Objective:This study aimed to construct a model that predicts invasive lung cancer using longitudinal radiological features from multiple low-dose computed tomography(LDCT)scans,thereby addressing overdiagnosis in lung cancer screening.Methods:In this retrospective study,628 patients with pulmonary nodules who underwent three LDCT scans followed by surgical resection were categorized into invasive carcinoma(n=155)and non-invasive nodule(n=473)groups on the basis of pathological diagnosis.This derivation aimed to identify risk factors and construct a multivariate logistic model.The predictive performance was externally validated in two independent cohorts(retrospectively designed,n=252;prospectively designed,n=269).The discrimination and calibration of the model were evaluated using area under the curve(AUC),and calibration plots.Decision curve analysis(DCA)was further performed to evaluate the net benefit in practical clinical scenarios.Results:The model,termed multiple CTs-invasive lung cancer(MCT-ILC),incorporated eleven factors encompassing nodule features at baseline and feature variability during follow-up.The standard deviation of diameter variability(SD_(diameter))was the most reliable predictor,with an odds ratio[95% confidence interval(95%CI)of 7.35(5.32-10.16)(P<0.001)].AUCs with 95% CIs for the MCT-ILC model were 0.912(0.864-0.960)and 0.906(0.833-0.979)in the two testing cohorts and were superior to those for the model containing only features at baseline(PD_(elong)=0.002 and 0.021,respectively).For calibration,the Brier scores of the MCT-ILC model were0.091(95% CI:0.064-0.118) and 0.078(95% CI:0.055-0.101)in the two test sets.The decision curve image showed that the MCT-ILC model was the only model that maintained positive net benefits across the entire threshold range.Furthermore,the MCT-ILC model score could classify more than 90% of patients with invasive nodules into the high-risk group.Conclusions:The MCT-ILC model could assess pulmonary nodule invasiveness,potentially mitigating overdiagnosis in lung cancer screening.
基金National Natural Science Foundation of China (81973749 and 8143594)State Administration of Traditional Chinese Medicine High-level Chinese Medicine Key Discipline Construction Project (zyyzdxk-2023069)。
文摘Objective To develop an onset risk prediction nomogram for patients with homocysteine-type(H-type)hypertension(HTH)based on pulse diagram parameters to assist early clinical prediction and diagnosis of HTH.Methods Patients diagnosed with essential hypertension and admitted to Shanghai Shuguang Hospital Affiliated to Shanghai University of Traditional Chinese Medicine,Shang-hai Hospital of Traditional Chinese Medicine,and Shanghai Hospital of Integrated Tradition-al Chinese and Western Medicine from July 6th 2020 to June 16th 2021,and from August 11th 2023 to January 22nd 2024,were enrolled in this retrospective research.The baselines and clinical biochemical indicators of patients were collected.The SMART-I TCM pulse instru-ment was applied to gather pulse diagram parameters.Multivariate logistic regression was adopted to analyze the risk factors for HTH.RStudio was employed to construct the nomo-gram model,receiver operating characteristic(ROC)curve,and calibration curve(bootstrap self-sampling 200 times),and clinical decision curve were drawn to evaluate the model’s dis-crimination and clinical effectiveness.Results A total of 168 hospitalized patients with essential hypertension were selected and di-vided into non-HTH group(n=29)and HTH group(n=139).Compared with non-HTH group,HTH group had a lower body mass index(BMI),and higher proportions of male pa-tients and drinkers(P<0.05).The ventricular wall thickening(VWT)could not be deter-mined.The proportions of left common carotid intima-media wall thickness(LCCIMWT)and serum creatinine(SCR)were higher in HTH group(P<0.05).The pulse diagram parameter As was significantly higher,and H4/H1 and T1/T were lower in HTH group(P<0.05).Gender,al-cohol consumption,serum creatinine,and the pulse diagram parameter H4/H1 were identi-fied as independent risk factors for HTH(P<0.05).The nomogram’s area under the ROC curve(AUC)was 0.795[95%confidence interval(CI):(0.7066,0.8828)],with a specificity of 0.724 and sensitivity of 0.799.After 200 times repeated bootstrap self-samplings,the calibra-tion curve showed that the simulated curve fits well with the actual curve(x^(2)=9.5002,P=0.3019).The clinical decision curve indicated that the nomogram’s applicability was optimal when the threshold for predicting HTH was between 0.38 and 1.00.Conclusion The nomogram model could be valuable for predicting the onset risk of HTH and pulse diagram parameters can facilitate early screening and prevention of HTH.