应用机器学习建立北京协和医院急性肾损伤预测模型: 单中心研究计划

Establishing AKI Warning System in Peking Union Medical College Hospital from a Machine Learning Approach: A Single-center Research Protocol

  • 摘要:
      研究背景及目的  院内急性肾损伤(acute kidney injury, AKI)是住院患者住院时间延长和预后不良的独立危险因素。利用住院电子病历系统(electronic medical record, EMR)早期预警模型对AKI进行识别并及时干预,对降低AKI发生率、减轻AKI严重程度并改善患者预后具有重要意义。目前基于EMR的AKI相关研究主要针对单学科病房住院患者,采用传统统计学方法进行回顾性分析,尚缺乏基于人工智能技术的大规模多学科病房含时效信息的AKI风险预警模型并以此进行前瞻性干预的研究。本研究计划基于全病程全病历系统收集住院患者的完整临床信息,通过大样本数据及机器学习算法,旨在建立多学科病房的AKI预测模型。
      方法  本研究计划分为回顾性研究和前瞻性研究两部分。回顾性研究中,纳入2016年1月1日至2020年12月31日北京协和医院所有成年住院患者。通过全病程全病历系统,收集其一般资料、临床诊断、生命体征、实验室检查结果和住院病历等相关信息,采用Logistic回归、朴素贝叶斯、随机森林、支持向量机、梯度提升决策树、循环神经网络的机器学习算法,构建可预测AKI发生风险的预警模型,并对模型的准确性进行验证。前瞻性研究纳入北京协和医院连续12个月的所有成年住院患者。其中AKI预警系统启动前6个月的所有成年住院患者为对照组,AKI预警系统启动后6个月的所有成年住院患者为干预组。干预组中,将AKI预警系统嵌入EMR,对所有住院24 h以上的患者每6小时进行1次实时未来48 h内AKI风险评估,并对高危患者进行早期干预。对照组无AKI风险评估及报警提示,无相应干预措施。比较两组患者AKI与AKI 3级发生率、AKI缓解率、终末期肾病进展率、住院期间死亡率及住院时间、住院费用等指标差异。
      预期结果  回顾性研究中,共纳入约127 000例住院患者,其中院内AKI患者14 605例。构建的多学科病房AKI预测模型可提前24~48 h预测住院患者发生AKI的风险,其中提前24 h预测AKI的受试者工作特征曲线下面积>0.80。前瞻性研究中,纳入34 748例住院患者,其中干预组和对照组均为17 374例。干预组肾脏替代治疗的时间、住院时间较对照组缩短(P<0.05),肾脏替代治疗的比例、AKI与AKI 3级发生率、终末期肾病进展率、住院期间死亡率、住院费用均低于对照组(P<0.05),AKI缓解率高于对照组(P<0.05)。
      预期结论  基于EMR构建的多学科病房AKI预测模型可提前24~48 h预测住院患者发生AKI的风险,降低AKI发生率及其严重程度,改善患者预后。

     

    Abstract:
      Background and Objective  In-hospital acute kidney injury (AKI) has a significant negative impact on patients' outcome and the length of hospital stay. It is significantly important to use the early warning of electronic medical records (EMR) to identify and intervene AKI in a timely manner so as to reduce the severity of AKI and to improve the prognosis of patients. At present, AKI-related research based on the EMR system mainly uses traditional statistical methods for retrospective analysis, mainly for inpatients in single-disciplinary wards, and there is still a lack of early warning models of AKI risk based on artificial intelligence technology in large-scale multi-disciplinary wards with time-sensitive information and further prospective research. This study aims to develop a multiple-ward AKI prediction model tailored for general hospitals in China based on machine-learning algorithms and big data acquired by the EMR system.
      Methods  This single-center study consists of both a retrospective observational study and a prospective study. All hospitalized adult patients admitted in Peking Union Medical College Hospital (PUMCH) between 2016 and 2020 were included in the retrospective study. Logistic regression, naive Bayes, random forest, support vector machine, gradient boosting and recurrent neural network will be used for modeling based on demographics, clinical feature, vital signs, imaging, lab results and hospitalized medical records, which aims to predict AKI 24-48 h in advance and will be internally validated. The prospective study intends to include all adult inpatients in PUMCH for 12 consecutive months. Among them, all adult hospitalized patients within 6 months before the AKI early warning system is launched will be of the control group, and all adult hospitalized patients within 6 months after the AKI early warning system is launched will be of the intervention group. In the intervention group, the AKI early warning system will be embedded in the EMR, and all patients hospitalized for more than 24 hours will be assessed for AKI risk in the next 48 hours in real time every 6 hours. Early intervention will be carried out for high-risk patients. The control group does not have above-mentioned high-risk and alarm prompts of AKI, and no corresponding intervention measures. The incidence of AKI and AKI grade 3, AKI remission rate, end-stage renal disease progression rate, mortality during hospitalization, length of stay, hospitalization expenses and other indicators will be compared between the two groups.
      Expected Results  An estimated number of 127 000 in-hospital patients will be included in the retrospective study, among which 14 605 patients suffer from AKI. The prediction model is expected to predict AKI 24-48 h in advance and the aim for area under receiver operating characteristics curve should be > 0.80. In the prospective study, 34 748 inpatients will be enrolled, including 17 374 in both the intervention group and the control group. The duration time of renal replacement therapy and length of hospital stay in the intervention group should be shorter than those in the control group (P < 0.05); the proportion of renal replacement therapy, the incidence of AKI and AKI 3, the rate of progression of end-stage renal disease, the mortality rate during hospitalization, and the hospitalization cost should be lower than those in the control group (P < 0.05), and the AKI remission rate should be higher than that in the control group (P < 0.05).
      Expected conclusion  EMR-based multi-ward AKI prediction model will predict AKI risk 24-48 h in advance, which will lower AKI incidence and severity, and improve clinical outcomes.

     

/

返回文章
返回