期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
Development and validation of a stroke risk prediction model using regional healthcare big data and machine learning
1
作者 Yunxia Duan Rui Wang +6 位作者 Yumei Sun Wendi Zhu Yi Li Na Yu Yu Zhu Peng Shen Hongyu Sun 《International Journal of Nursing Sciences》 2025年第6期558-565,I0002,共9页
Objectives:This study aimed to develop and validate a stroke risk prediction model based on machine learning(ML)and regional healthcare big data,and determine whether it may improve the prediction performance compared... Objectives:This study aimed to develop and validate a stroke risk prediction model based on machine learning(ML)and regional healthcare big data,and determine whether it may improve the prediction performance compared with the conventional Logistic Regression(LR)model.Methods:This retrospective cohort study analyzed data from the CHinese Electronic health Records Research in Yinzhou(CHERRY)(2015–2021).We included adults aged 18–75 from the platform who had established records before 2015.Individuals with pre-existing stroke,key data absence,or excessive missingness(>30%)were excluded.Data on demographic,clinical measures,lifestyle factors,comorbidities,and family history of stroke were collected.Variable selection was performed in two stages:an initial screening via univariate analysis,followed by a prioritization of variables based on clinical relevance and actionability,with a focus on those that are modifiable.Stroke prediction models were developed using LR and four ML algorithms:Decision Tree(DT),Random Forest(RF),eXtreme Gradient Boosting(XGBoost),and Back Propagation Neural Network(BPNN).The dataset was split 7:3 for training and validation sets.Performance was assessed using receiver operating characteristic(ROC)curves,calibration,and confusion matrices,and the cutoff value was determined by Youden's index to classify risk groups.Results:The study cohort comprised 92,172 participants with 436 incident stroke cases(incidence rate:474/100,000 person-years).Ultimately,13 predictor variables were included.RF achieved the highest accuracy(0.935),precision(0.923),sensitivity(recall:0.947),and F1 score(0.935).Model evaluation demonstrated superior predictive performance of ML algorithms over conventional LR,with training/validation areaunderthe curve(AUC)sof0.777/0.779(LR),0.921/0.918(BPNN),0.988/0.980(RF),0.980/0.955(DT),and 0.962/0.958(XGBoost).Calibration analysis revealed a better fit for DT,LR and BPNN compared to RF and XGBoost model.Based on the optimal performance of the RF model,the ranking of factors in descending order of importance was:hypertension,age,diabetes,systolic blood pressure,waist,high-density lipoprotein Cholesterol,fasting blood glucose,physical activity,BMI,low-density lipoprotein cholesterol,total cholesterol,dietary habits,and family history of stroke.Using Youden's index as the optimal cutoff,the RF model stratified individuals into high-risk(>0.789)and low-risk(≤0.789)groups with robust discrimination.Conclusions:The ML-based prediction models demonstrated superior performance metrics compared to conventional LR and the RF is the optimal prediction model,providing an effective tool for risk stratifi cation in primary stroke prevention in community settings. 展开更多
关键词 Big data Machine learning NURSING Prediction model STROKE
暂未订购
酒精依赖的基于基因和基于通路的全基因组关联研究(英文) 被引量:1
2
作者 Lingjun ZUO Clarence K.ZHANG +5 位作者 Frederick G.SAYWARD Kei-Hoi CHEUNG Kesheng WANG John H.KRYSTAL Hongyu ZHAO Xingguang LUO 《上海精神医学》 CSCD 2015年第2期111-118,共8页
背景:信号通路中风险基因的构成可能可以解释酒精依赖风险基因协同的神经生物学作用。目的:识别酒精依赖的风险基因和风险基因通路。方法:我们采用基因富集(gene-set-rich)分析方法对酒精依赖进行了基于通路的全基因组关联分析(GWAS)。... 背景:信号通路中风险基因的构成可能可以解释酒精依赖风险基因协同的神经生物学作用。目的:识别酒精依赖的风险基因和风险基因通路。方法:我们采用基因富集(gene-set-rich)分析方法对酒精依赖进行了基于通路的全基因组关联分析(GWAS)。在包括1409名欧裔美国人(European-American,EA)酒精依赖者和1518名EA健康对照者的探索性样本人群中检测了近一百万个基因标志物。此外,将681名非裔美国人(African-American,AA)病例和508名AA健康受试者作为重测样本。结果:我们发现了几个与酒精依赖显著相关的可重复的全基因组风险基因和风险通路。在多重比较Bonferroni校正后,"细胞-细胞外基质相互作用"通路(EA样本中p<2.0E-4)和该通路中PXN基因(编码桩蛋白paxillin)(EA样本中p=3.9E-7)是最有可能的酒精依赖的危险因素。在EA样本(0.015≤p≤0.035)和AA样本(0.025≤p≤0.050)中还有两条富含酒精依赖相关基因的可重复的通路:"Na+/Cl-依赖性神经递质转运体"通路和"其他聚糖降解"通路。结论:一些基因和生物信号传导过程可能与酒精依赖的风险相关,本研究的发现为此提供了新的证据。 展开更多
关键词 全基因组 酒精 基础 关联 神经生物学 细胞外基质 信号通路 相互作用
暂未订购
YPED: An Integrated Bioinformatics Suite and Database for Mass Spectrometry-based Proteomics Research 被引量:4
3
作者 Christopher M.Colangelo Mark Shifman +11 位作者 Kei-Hoi Cheung Kathryn L.Stone Nicholas J.Carriero Erol E.Gulcicek TuKiet T.Lam Terence Wu Robert D.Bjornson Can Bruce Angus C.Nairn Jesse Rinehart Perry L.Miller Kenneth R.Williams 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2015年第1期25-35,共11页
We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database(YPED) that is used by investigators at more than 300 institutions worldwide. YPED ... We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database(YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a singlelaboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography–tandem mass spectrometry(LC–MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring(MRM)/selective reaction monitoring(SRM) assay development. We have linked YPED's database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results. 展开更多
关键词 Proteomics Database Bioinformatics Mass spectrometry Repository Spectral library
原文传递
生物信息学简介
4
作者 梅长江 《国际精神病学杂志》 2011年第3期142-146,共5页
生物信息学是利用计算机技术为工具,使用应用数学、信息学、统计学方法研究生物学问题的一个新兴学科。它是在二十世纪八十年代计算机技术大大普及之后发展起来的。生物信息学研究的目的就是从这些数据中总结归纳出有用的结果和结论来,... 生物信息学是利用计算机技术为工具,使用应用数学、信息学、统计学方法研究生物学问题的一个新兴学科。它是在二十世纪八十年代计算机技术大大普及之后发展起来的。生物信息学研究的目的就是从这些数据中总结归纳出有用的结果和结论来,并应用这些结果和结论指导我们的生活实践或进一步的研究。 展开更多
关键词 生物信息学 生命科学
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部