基于监督机器学习算法构建急诊多发伤患者院内死亡的预测模型

Construction of an in-hospital mortality prediction model for emergency multiple trauma patients based on supervised machine learning algorithms

原文传递

导出

摘要目的:基于不同监督机器学习算法,构建适用于急诊多发伤患者院内死亡风险的最佳预测模型。方法:回顾性分析2019年1月—2023年12月首都医科大学大兴教学医院收治的817例急诊多发伤患者的临床资料,其中男性602例,女性215例;年龄18~89岁,平均(54.82±17.25)岁。以患者的一般资料、实验室检查指标等作为相关预测变量,研究终点为院内死亡。将患者按照7:3比例简单随机拆分为训练集(n=571)和测试集(n=246),在训练集中对院内生存组与死亡组的相关变量进行单因素分析,筛选出两组间差异具有统计学意义的变量后进行LASSO回归分析,筛选出非零系数变量作为最终入选特征。选择逻辑回归(LR)、随机森林(RF)、支持向量机(SVM)3种监督机器学习算法构建模型。在测试集中对各模型的性能进行评估,采用受试者操作特征(ROC)曲线验证模型的预测效能。正态分布的计量资料以均数±标准差(x^-±s)表示,组间比较采用t检验;非正态分布的计量资料以中位数和四分位间距[M(Q_(1),Q_(3))]表示,组间比较采用秩和检验。计数资料以例数和百分比[例(%)]表示,组间比较采用χ^(2)检验或Fisher确切概率法。结果:共纳入817例患者,死亡65例,死亡率为8.0%。基于训练集数据进行单因素分析,之后将差异具有统计学意义的变量进行LASSO回归分析,结果显示,患者年龄、白蛋白、红细胞计数(RBC)、肌酸激酶、葡萄糖、脑钠肽、C反应蛋白、乳酸、二氧化碳分压(PCO_(2))、低密度脂蛋白胆固醇、凝血酶原时间(PT)、纤维蛋白原、纤维蛋白降解产物(FDP)、肌钙蛋白I、降钙素原(PCT)、创伤程度评分(ISS)、格拉斯哥昏迷评分(GCS)共17个变量为急诊多发伤患者院内死亡的危险因素。根据上述17个变量建立3种监督机器学习模型,LR模型中重要性排名前5位的分别为PCO_(2)、PCT、FDP、PT和RBC,RF模型中重要性排名前5位的分别为PCO_(2)、ISS、葡萄糖、白蛋白和GCS,SVM模型中重要性排名前5位的分别为PCT、FDP、PCO_(2)、PT、葡萄糖。在测试集中进行模型效果评估,结果显示,LR模型的曲线下面积(AUC)为0.952,特异性为0.996,准确率为0.963,灵敏度和召回率均为0.600。RF模型的AUC值为0.970,优于LR和SVM模型,特异性为0.987,准确率为0.959,灵敏度和召回率均为0.650。SVM模型的AUC值为0.944,特异性为0.996,准确率为0.947,灵敏度和召回率均为0.400。3种模型各有优势,但RF模型在综合性能上表现最优。结论:以PCO_(2)、ISS、葡萄糖、白蛋白、GCS等17个最佳变量构建的RF模型对急诊多发伤患者院内死亡有较强的预测能力,值得临床进一步研究。 Objective:To construct the optimal prediction model for in-hospital mortality risk in emergency multiple trauma patients based on different supervised machine learning algorithms.Methods:A retrospective analysis was conducted on the clinical data of 817 patients with emergency multiple trauma who were admitted to the Daxing Teaching Hospital,Capital Medical University from January 2019 to December 2023.Among them,602 were males and 215 were females,the age ranged from 18 to 89 years,with an average of(54.82±17.25)years.The general information and laboratory test indicators of patients were collected as relevant predictor variables,with in-hospital mortality defined as the study endpoint.The patients were simply and randomly divided into the training set(n=571)and the testing set(n=246)in a 7∶3 ratio.Univariate analysis was performed on the training set to compare the relevant variables between the survival and death groups.Variables with statistical significance were then subjected to LASSO regression analysis to identify predictors with non-zero coefficients,which were selected as final features.Three supervised machine learning models,namely Logistic regression(LR),random forest(RF),and support vector machine(SVM)were selected to construct the model.The predictive performance of each model in testing set was evaluated,and the predictive efficacy of the models was verified using receiver operating characteristic(ROC)curve.The measurement data of normal distribution were expressed as mean±standard deviation(x^-±s),and comparisons between groups were conducted using the t-test.The measurement data with non-normal distribution were expressed as median and interquartile range[M(Q_(1),Q_(3))],and comparisons between groups were conducted using rank-sum tests.The count data were expressed as the number of cases and percentages,and comparisons between groups were conducted using the Chi-test or Fisher exact probability method.Results:A total of 817 patients were included,with 65 deaths,resulting in a mortality of 8.0%.Univariate analysis was conducted based on the training set data,and then LASSO regression analysis was performed on the variables with statistically significant differences.The results showed 17 variables were risk factors for in-hospital mortality in patients with emergency multiple trauma,including age,albumin,red blood cell(RBC),creatine kinase(CK),glucose(GLU),brain natriuretic peptide(BNP),C-reactive protein(CRP),lactic acid,PCO_(2),low-density lipoprotein cholesterol(LDL-C),prothrombin time(PT),fibrinogen(FIB),fibrin degradation products(FDP),troponin I(TNI),procalcitonin(PCT),injury severity score(ISS),and Glasgow coma scale(GCS).Based on the above 17 variables,three supervised machine learning models were established.Among the LR model,the top 5 in terms of importance were PCO_(2),PCT,FDP,PT,and RBC.Among the RF model,the top 5 in terms of importance were PCO_(2),ISS,GLU,ALB,and GCS.Among the SVM model,the top 5 in terms of importance were PCT,FDP,PCO_(2),PT,and GLU.Model performance evaluation in the testing set showed that the area under the curve(AUC)of the LR model was 0.952,the specificity was 0.996,the accuracy was 0.963,and both the sensitivity and recall rate were 0.600.The AUC of the RF model was 0.970,better than the LR and SVM models,the specificity was 0.987,the accuracy was 0.959,and both the sensitivity and recall rate were 0.650.The AUC of the SVM model was 0.944,the specificity was 0.996,the accuracy was 0.947,and both the sensitivity and recall rate were 0.400.Each model had its strengths,but the RF model demonstrated the best overall performance.Conclusion:The RF model constructed using 17 optimal variables such as PCO_(2),ISS,GLU,ALB,and GCS shows strong predictive capability for in-hospital mortality in emergency multiple trauma patients and warrants further clinical investigation.

作者黄东明王卫粮 Huang Dongming;Wang Weiliang(Department of Emergency,Daxing Teaching Hospital,Capital Medical University,Beijing 102600,China;Department of Trauma,Daxing Teaching Hospital,Capital Medical University,Beijing 102600,China)

机构地区首都医科大学大兴教学医院急诊科首都医科大学大兴教学医院创伤科

出处《国际外科学杂志》 2025年第11期753-760,F0003,共9页 International Journal of Surgery

基金北京市大兴区人民医院院级课题(4202406497)。

关键词多处创伤急诊治疗医院死亡率监督机器学习预测模型 Multiple trauma Emergeney treatment Hospital mortality Supervised machine leaming Predictive model

分类号 R641 [医药卫生—外科学]

引文网络
相关文献

参考文献2

1白丽爽,王兴义,杨立山.多发伤患者预后列线图模型的构建和研究[J].中华急诊医学杂志,2023,32(4):540-545. 被引量：4
2Katherine C Bergus,Kelli N Patterson,Lindsey Asti,Josh Bricker,Tariku J Beyene,Lauren N Schulz,Dana M Schwartz,Rajan K Thakkar,Eric A Sribnick.Association of initial assessment variables and mortality in severe pediatric traumatic brain injury[J].World Journal of Pediatric Surgery,2024,7(2):125-133. 被引量：2

二级参考文献12

1都定元,王建柏.中国创伤外科发展现状与展望[J].创伤外科杂志,2018,20(3):161-165. 被引量：40
2黄彪,李建国,黄发贵.多发伤的诊疗进展[J].医学综述,2019,25(5):973-977. 被引量：25
3冯珂,陈中伟,杜武军,黄存.初始凝血指标在多发伤患者病情严重程度评价中的应用[J].创伤外科杂志,2019,21(12):948-951. 被引量：4
4谭明东.多发伤骨盆骨折患者早期急诊救治流程的建立方法及效果[J].山西医药杂志,2020,49(13):1690-1692. 被引量：13
5张鹏,杜哲,刘中砥,黄伟,张亚军,王天兵.我国医生对严重创伤评估能力的多中心调查[J].中华急诊医学杂志,2021,30(5):533-536. 被引量：18
6漆靖,孙传政,刘怀政,周柯夫,戴哲人,唐亦舒.肾灌注压估算值对严重多发伤患者发生急性肾损伤的预测价值[J].中华急诊医学杂志,2021,30(8):968-972. 被引量：11
7周鑫,徐炎松,孙远松,尹纯林,姜大同,李贺.HMGB1、suPAR、WBC、PCT在创伤脓毒症中的早期诊断及预后评估价值[J].中华急诊医学杂志,2021,30(8):1015-1018. 被引量：25
8任杰,朱飞奇.重型颅脑损伤合并多发伤患者早期死亡的决策树模型研究[J].中国急救医学,2022,42(4):343-346. 被引量：11
9王雄伟,郁毅刚,姚猛飞,叶军明.创伤评分在危重症创伤患者评估中的应用[J].赣南医学院学报,2022,42(2):204-208. 被引量：5
10白恒,梁祎鑫,刘思扬,牟雪琳,党星波.多发伤合并休克患者临床治疗标准化措施的研究进展分析[J].中国标准化,2022(18):241-244. 被引量：4

共引文献4

1苗振军,张登奎,梁亚鹏,周峰,刘志祯,蔡华忠.多发伤患者院内死亡独立危险因素分析及预测模型的构建与验证[J].中华创伤杂志,2023,39(7):643-651. 被引量：9
2徐子文,喻婷,邵爱丹,闫红丽,张晓霞,董新玲.严重创伤进展为持续炎症-免疫抑制-分解代谢综合征的影响因素分析[J].临床急诊杂志,2023,24(11):555-560. 被引量：3
3王京京,蒋文佳,李彦泽,薛婷,叶英,燕宪亮,许铁,花嵘.急性多发伤后早期血糖波动对创伤后应激障碍发病的影响[J].中华急诊医学杂志,2024,33(5):623-629. 被引量：4
4刘威,侯君,唐龙泉,周鹏,钟艳,罗沁妍,况小雨,刘华,熊紫清,熊伟,吴承高,乐爱平.基于多中心的儿童颅脑创伤患者临床输血影响因素分析及预测模型构建[J].实用医学杂志,2025,41(4):553-560. 被引量：2

1黄淘克,郝本川,刘宏斌.老年心力衰竭患者院内死亡风险列线图预测模型的构建与评价[J].中华老年心脑血管病杂志,2025,27(5):581-586.
2彭小玲,阮春花,张梦玲,彭烨,陈卓敏.急诊多发伤病人早期死亡的危险因素及列线图预测模型建立[J].安徽医药,2025,29(6):1138-1142.
3李辰华,张腾飞,沈银龙.基于信息数字化平台的多学科协作模式对多发性创伤患者的急救效果[J].医学临床研究,2025,42(11):1996-1998.
4邢云超,陈学明,冯海.院内获得性肺栓塞患者的临床特征及预后情况分析[J].国际外科学杂志,2025,52(3):163-169.
5韩萍,李伟,刘梦珠,黄佩佩,段玲,王磊.基于创伤评分的一体化急救护理模式在急诊多发性创伤患者急救中的应用价值[J].医学临床研究,2025,42(6):1056-1058.
6谭晶今,张雁林.急性呼吸窘迫综合征相关生物标志物研究进展[J].中国呼吸与危重监护杂志,2025,24(10):745-754.
7张登奎,苗振军,梁亚鹏,周峰,尹其翔,蔡华忠.多发伤患者合并急性肾损伤的危险因素及其预测模型构建与效能评估[J].中华创伤杂志,2025,41(2):177-187. 被引量：3
8张晶,王丽竹,黄晓霞,封秀琴.重度肥胖患者严重创伤后合并急性呼吸窘迫综合征的护理[J].中华急危重症护理杂志,2025,6(8):957-959. 被引量：1
9桂妮.心理护理在急诊内科:驱散患者心头的阴霾[J].家庭科学,2025(11):150-151.
10万家.离不掉逃不了过不好的婚姻[J].恋爱·婚姻·家庭(上半月),2024(7):32-33.

国际外科学杂志

2025年第11期

浏览历史

内容加载中请稍等...

基于监督机器学习算法构建急诊多发伤患者院内死亡的预测模型

参考文献2

二级参考文献12

共引文献4

相关作者

相关机构

相关主题

浏览历史