重症监护病房脓毒症患者临床表型识别与验证

Clinical Phenotype Identification and Validation of Patients with Sepsis in the Intensive Care Unit

  • 摘要:
    目的 探讨重症监护病房(intensive care unit, ICU)脓毒症患者临床表型的识别与验证。
    方法 采用人工智能算法处理重症监护医学信息Ⅳ(Medical Information Mart for Intensive Care Ⅳ, MIMIC-Ⅳ)2.2数据库中脓毒症患者的临床数据, 包括人口学特征、ICU首日实验室指标及治疗措施等89个临床特征, 首先应用无监督机器学习算法(K均值聚类和层次聚类)进行表型识别, 然后应用有监督机器学习算法(轻量级梯度增强机)进行表型预测, 并结合机器学习可解释性算法SHAP识别重要特征; 最后通过传统统计学方法, 对各表型间的临床特征差异与临床结局差异进行验证。
    结果 在22 517例脓毒症患者中发现了3种临床特征及结局显著不同的临床表型。其中表型1患者的死亡风险最高(28 d死亡率为46.4%), 以肾功能异常和疾病严重程度评分升高为主; 表型3患者的死亡风险最低(28 d死亡率为11.2%), 神经功能评分最佳。通过可解释性机器学习, 识别出阴离子间隙、血尿素氮、肌酐、格拉斯哥昏迷量表评分、凝血酶原时间和序贯性器官功能衰竭评分6个特征(首日最差值)在表型识别(AUC≥0.89)和表型预后预测(AUC≥0.74)方面表现良好。在患者出ICU后28 d、60 d、90 d及1年内, 表型3的死亡风险均最低(HR < 1)。
    结论 利用机器学习算法成功识别了3种具有不同临床特征及预后的脓毒症临床表型, 并筛选出6个关键临床特征, 预期将在脓毒症亚型分类及预后评估中发挥重要作用, 有助于患者的个体化治疗。

     

    Abstract:
    Objective To identify and validate the clinical phenotypes of patients with sepsis in the intensive care unit(ICU).
    Methods We applied unsupervised machine learning algorithms (K-means clusteringand hierarchical clustering) to identify the phenotypes of sepsis patients in the Medical Information Mart for Intensive Care Ⅳ (MIMIC-Ⅳ) 2.2 database, based on 89 clinical features including demographic characteristics, laboratory indicators and treatment measures on the first day in ICU. Then, supervised machine learning algorithms (lightweight gradient boosting machine) were used for the prediction of the patient's phenotypes, and were further combined with SHAP (Shapely Additive eXplanations) for the identification of important features. Finally, traditional statistical methods were used to validate the differences in clinical characteristics and clinical outcomes among the phenotypes.
    Results We identified three phenotypes in 22 517 sepsis patients. The phenotype 1 patients had the highest risk of death (28-day mortality of 46.4%), dominated by abnormal renal function and elevated disease severity scores, while the phenotype 3 patients had the lowest risk of death (28-day mortality of 11.2%), and the best neurological function score. Using interpretable machine learning, we identified six features (all the worst value on the first day) that showed good performance in phenotypic identification(AUC≥0.89) and phenotypic prognostic prediction (AUC≥0.74): anion gap, blood urea nitrogen, creatinine, Glasgow Coma Scale score, prothrombin time, and Sequential Organ Failure Assessment score. The mortality risk of phenotype 3 patients was the lowest at 28 days, 60 days, 90 days, and 1 year after ICU discharge (HR < 1).
    Conclusion Using machine learning methods, we successfully identified three clinical phenotypes of sepsis patients with different clinical characteristics and prognosis and screened out six key clinical features, which are expected to play an important role in the phenotype classification and prognostic assessment of sepsis and are conducive to individualized treatment.

     

/

返回文章
返回