Regional healthcare platforms collect clinical data from hospitals in specific areas for the purpose of healthcare management.It is a common requirement to reuse the data for clinical research.However,we have to face ...Regional healthcare platforms collect clinical data from hospitals in specific areas for the purpose of healthcare management.It is a common requirement to reuse the data for clinical research.However,we have to face challenges like the inconsistence of terminology in electronic health records (EHR) and the complexities in data quality and data formats in regional healthcare platform.In this paper,we propose methodology and process on constructing large scale cohorts which forms the basis of causality and comparative effectiveness relationship in epidemiology.We firstly constructed a Chinese terminology knowledge graph to deal with the diversity of vocabularies on regional platform.Secondly,we built special disease case repositories (i.e.,heart failure repository) that utilize the graph to search the related patients and to normalize the data.Based on the requirements of the clinical research which aimed to explore the effectiveness of taking statin on 180-days readmission in patients with heart failure,we built a large-scale retrospective cohort with 29647 cases of heart failure patients from the heart failure repository.After the propensity score matching,the study group (n=6346) and the control group (n=6346) with parallel clinical characteristics were acquired.Logistic regression analysis showed that taking statins had a negative correlation with 180-days readmission in heart failure patients.This paper presents the workflow and application example of big data mining based on regional EHR data.展开更多
目的·采用开源性软件REDCap,以母婴干预预防肥胖队列为示范,阐述基于网络的在线电子数据采集系统构建和数据质控管理。方法·安装REDCap软件,建立母婴干预队列为示范项目。基于研究方案,将队列数据收集的时点和问卷模块列成随...目的·采用开源性软件REDCap,以母婴干预预防肥胖队列为示范,阐述基于网络的在线电子数据采集系统构建和数据质控管理。方法·安装REDCap软件,建立母婴干预队列为示范项目。基于研究方案,将队列数据收集的时点和问卷模块列成随访图,设置和创建问卷和电子病历报告表单(electronic case report forms,eCRF),设置数据质量控制程序,设置用户使用权限。结果·在REDCap系统中创建示范项目,建立孕前期、孕期及儿童随访的各期eCRF。经测试后上线,进行数据收集和数据质量控制。设置访问权限实现多中心数据采集。通过质控程序核查和发送数据疑问,进行质控管理。REDCap数据可导出生成SAS、SPSS、R等多种格式的数据文件供常用统计软件使用。结论·基于REDCap建立的母婴干预队列数据平台和数据管理为项目提供了支撑。该模式可推广运用于其他流行病学人群研究。展开更多
基金Supported by the National Major Scientific and Technological Special Project for"Significant New Drugs Development’’(No.2018ZX09201008)Special Fund Project for Information Development from Shanghai Municipal Commission of Economy and Information(No.201701013)
文摘Regional healthcare platforms collect clinical data from hospitals in specific areas for the purpose of healthcare management.It is a common requirement to reuse the data for clinical research.However,we have to face challenges like the inconsistence of terminology in electronic health records (EHR) and the complexities in data quality and data formats in regional healthcare platform.In this paper,we propose methodology and process on constructing large scale cohorts which forms the basis of causality and comparative effectiveness relationship in epidemiology.We firstly constructed a Chinese terminology knowledge graph to deal with the diversity of vocabularies on regional platform.Secondly,we built special disease case repositories (i.e.,heart failure repository) that utilize the graph to search the related patients and to normalize the data.Based on the requirements of the clinical research which aimed to explore the effectiveness of taking statin on 180-days readmission in patients with heart failure,we built a large-scale retrospective cohort with 29647 cases of heart failure patients from the heart failure repository.After the propensity score matching,the study group (n=6346) and the control group (n=6346) with parallel clinical characteristics were acquired.Logistic regression analysis showed that taking statins had a negative correlation with 180-days readmission in heart failure patients.This paper presents the workflow and application example of big data mining based on regional EHR data.
文摘目的·采用开源性软件REDCap,以母婴干预预防肥胖队列为示范,阐述基于网络的在线电子数据采集系统构建和数据质控管理。方法·安装REDCap软件,建立母婴干预队列为示范项目。基于研究方案,将队列数据收集的时点和问卷模块列成随访图,设置和创建问卷和电子病历报告表单(electronic case report forms,eCRF),设置数据质量控制程序,设置用户使用权限。结果·在REDCap系统中创建示范项目,建立孕前期、孕期及儿童随访的各期eCRF。经测试后上线,进行数据收集和数据质量控制。设置访问权限实现多中心数据采集。通过质控程序核查和发送数据疑问,进行质控管理。REDCap数据可导出生成SAS、SPSS、R等多种格式的数据文件供常用统计软件使用。结论·基于REDCap建立的母婴干预队列数据平台和数据管理为项目提供了支撑。该模式可推广运用于其他流行病学人群研究。