可解释机器学习模型预测心脏骤停患者院内死亡风险：基于MIMIC-Ⅳ 2.0数据库

龚欢欢; 柯晓伟; 王爱民; 李湘民

doi:10.12290/xhyxzz.2022-0733

可解释机器学习模型预测心脏骤停患者院内死亡风险：基于MIMIC-Ⅳ 2.0数据库

An Interpretable Machine Learning Model for Predicting In-hospital Death Risk in Patients with Cardiac Arrest: Based on US Medical Information Mart for Intensive Care Database Ⅳ 2.0

摘要

摘要:
目的构建可预测心脏骤停患者住院期间死亡风险的机器学习模型，并对其进行解释。
方法提取美国重症监护医学信息数据库Ⅳ(Medical Information Mart for Intensive Care database Ⅳ，MIMIC-Ⅳ)2.0中心脏骤停患者转入ICU 24 h内首次临床资料及住院期间转归，基于机器学习算法构建6种可预测心脏骤停患者院内死亡风险的模型，包括XGBoost模型、轻量级梯度提升机(light gradient boosting machine, LGBM)模型、决策树(decision tree, DT)模型、K近邻(K-nearest neighbor，KNN)模型、Logistic回归模型、随机森林(random forest, RF)模型。采用受试者操作特征(receiver operator characteristic, ROC)曲线、临床决策曲线及校准曲线对模型进行评价，并采用Shapley加性解释(Shapley additive explanation, SHAP)算法评估不同临床特征对最优模型的影响，以增加模型的可解释性。
结果共1465例符合纳入与排除标准的心脏骤停患者入选本研究。其中住院期间存活773例、死亡692例。经筛选，共纳入82个临床特征用于机器学习模型构建。模型评价结果显示，相较于其余5种模型，LGBM模型预测心脏骤停患者院内死亡的曲线下面积(area under the curve，AUC)更高0.834(95% CI: 0.688~0.894)，且相对于Logistic回归模型、XGBoost模型，其对死亡风险的预测准确性更高(校准度：0.166)，临床决策性能更优，整体性能最佳。SHAP算法分析显示，对LGBM模型输出结果影响最大的3个临床特征分别为格拉斯哥睁眼反应评分、碳酸氢盐水平、白细胞计数。
结论基于大型公共医疗卫生数据库建立的可预测心脏骤停患者住院期间死亡风险的机器学习模型中，LGBM模型性能最优，其可辅助临床进行更高效的疾病管理和更精准的医疗干预。

Abstract:
Objective To develop and validate an interpretable machine learning model based on clinical characteristics to predict the risk of in-hospital death in patients with cardiac arrest.
Methods First clinical data of cardiac arrest patients admitted to ICU within 24 h and outcomes during hospitalization were extracted from Medical Information Mart for Intensive Care database Ⅳ (MIMIC-Ⅳ) 2.0. Six models predicting in-hospital death risk of cardiac arrest patients were constructed based on machine learning algorithm: XGBoost model, light gradient boosting machine (LGBM) model, decision tree (DT) model, K-nearest neighbor (KNN) model, Logistic regression model, and random forest (RF) model. Receiver operator characteristic (ROC) curve, clinical decision curve and calibration curve were used to evaluate the 6 models. Shapley additive explanation (SHAP) algorithm was used to explain and evaluate the effects of different clinical features on the optimal model to increase its interpretability.
Results A total of 1465 patients with cardiac arrest who met inclusion and exclusion criteria were included in the study. Among them, 773 patients survived and 692 died during hospitalization. After screening, a total of 82 clinical features were included for machine learning model construction. Compared with the other five models, the LGBM model had a higher area under the curve for predicting in-hospital death in cardiac arrest patients 0.834(95% CI: 0.688-0.894), higher prediction accuracy for the risk of death than the Logistic regression model and XGBoost model (calibration degree: 0.166), better clinical decision performance, and displayed optimal overall performance. SHAP algorithm analysis showed that the three clinical features that had the greatest impact on the output of LGBM model were Glasgow eyes score, bicarbonate level and white blood cell count.
Conclusion Based on a large public medical and health database, a machine learning model named LGBM has the best performance to predict the risk of in-hospital death in patients with cardiac arrest, which will be helpful to assist more efficient clinical disease management and more precise medical intervention.

HTML全文

参考文献(28)

施引文献

资源附件(0)