An Individualized Algorithm to Predict Mortality in COVID-19 Pneumonia: a Machine Learning Based Study
IntroductionIdentifying SARS-CoV-2 patients at higher risk of mortality is crucial in the management of a pandemic. Artificial intelligence techniques allow to analyze big amount of data to find hidden patterns. We aimed to develop and validate a mortality score at admission for COVID-19 based on high-level machine learning.Material and methodsWe conducted a retrospective cohort study on hospitalized adults COVID-19 patients between March and December 2020. The primary outcome was in-hospital mortality. A machine learning approach on vital parameters, laboratory values, and demographic features was applied to develop different models. Then, a feature importance analysis was performed to reduce the number of variables included in the model, to develop a risk score with good overall performance, that was finally evaluated in terms of discrimination and calibration capabilities. All results underwent cross-validation.Results1,135 consecutive patients (median age 70 years, 64% males) were enrolled, 48 patients were excluded, the cohort was randomly divided in training (760) and test (327). During hospitalization, 251 (22%) patients died. After feature selection, the best performing classifier was random forest (AUC 0.88±0.03). Based on the relative importance of each variable, a pragmatic score was developed, showing good performances (AUC 0.85, ±0.025), and three levels were defined that correlated well with in-hospital mortality.ConclusionsMachine learning techniques were applied in order to develop an accurate in-hospital mortality risk score for COVID-19 based on ten variables. The application of the proposed score has utility in clinical settings to guide the management and prognostication of COVID-19 patients.