TY - JOUR
T1 - Cost-Sensitive Ordinal Classification Methods to Predict SARS-CoV-2 Pneumonia Severity
AU - Garcia-Garcia, Fernando
AU - Lee, Dae Jin
AU - Espana Yandiola, Pedro Pablo
AU - Urrutia Landa, Isabel
AU - Martinez-Minaya, Joaquin
AU - Hayet-Otero, Miren
AU - Nieves Ermecheo, Monica
AU - Quintana, Jose Maria
AU - Menendez, Rosario
AU - Torres, Antoni
AU - Zalacain Jorge, Rafael
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2024/5/1
Y1 - 2024/5/1
N2 - Objective: To study the suitability of cost-sensitive ordinal artificial intelligence-machine learning (AI-ML) strategies in the prognosis of SARS-CoV-2 pneumonia severity. Materials & methods: Observational, retrospective, longitudinal, cohort study in 4 hospitals in Spain. Information regarding demographic and clinical status was supplemented by socioeconomic data and air pollution exposures. We proposed AI-ML algorithms for ordinal classification via ordinal decomposition and for cost-sensitive learning via resampling techniques. For performance-based model selection, we defined a custom score including per-class sensitivities and asymmetric misprognosis costs. 260 distinct AI-ML models were evaluated via 10 repetitions of 5 × 5 nested cross-validation with hyperparameter tuning. Model selection was followed by the calibration of predicted probabilities. Final overall performance was compared against five well-established clinical severity scores and against a 'standard' (non-cost sensitive, non-ordinal) AI-ML baseline. In our best model, we also evaluated its explainability with respect to each of the input variables. Results: The study enrolled n = 1548 patients: 712 experienced low, 238 medium, and 598 high clinical severity. d = 131 variables were collected, becoming dprime = 148 features after categorical encoding. Model selection resulted in our best-performing AI-ML pipeline having: 1)no imputation of missing data, 2)no feature selection (i.e. using the full set of dprime features), 3)'Ordered Partitions' ordinal decomposition, 4)cost-based reimbalance, and 5)a Histogram-based Gradient Boosting classifier. This best model (calibrated) obtained a median accuracy of 68.1% [67.3%, 68.8%] (95% confidence interval), a balanced accuracy of 57.0% [55.6%, 57.9%], and an overall area under the curve (AUC) 0.802 [0.795, 0.808]. In our dataset, it outperformed all five clinical severity scores and the 'standard' AI-ML baseline. Discussion & conclusion: We conducted an exhaustive exploration of AI-ML methods designed for both ordinal and cost-sensitive classification, motivated by a real-world application domain (clinical severity prognosis) in which these topics arise naturally. Our model with the best classification performance exploited successfully the ordering information of ground truth classes, coping with imbalance and asymmetric costs. However, these ordinal and cost-sensitive aspects are seldom explored in the literature.
AB - Objective: To study the suitability of cost-sensitive ordinal artificial intelligence-machine learning (AI-ML) strategies in the prognosis of SARS-CoV-2 pneumonia severity. Materials & methods: Observational, retrospective, longitudinal, cohort study in 4 hospitals in Spain. Information regarding demographic and clinical status was supplemented by socioeconomic data and air pollution exposures. We proposed AI-ML algorithms for ordinal classification via ordinal decomposition and for cost-sensitive learning via resampling techniques. For performance-based model selection, we defined a custom score including per-class sensitivities and asymmetric misprognosis costs. 260 distinct AI-ML models were evaluated via 10 repetitions of 5 × 5 nested cross-validation with hyperparameter tuning. Model selection was followed by the calibration of predicted probabilities. Final overall performance was compared against five well-established clinical severity scores and against a 'standard' (non-cost sensitive, non-ordinal) AI-ML baseline. In our best model, we also evaluated its explainability with respect to each of the input variables. Results: The study enrolled n = 1548 patients: 712 experienced low, 238 medium, and 598 high clinical severity. d = 131 variables were collected, becoming dprime = 148 features after categorical encoding. Model selection resulted in our best-performing AI-ML pipeline having: 1)no imputation of missing data, 2)no feature selection (i.e. using the full set of dprime features), 3)'Ordered Partitions' ordinal decomposition, 4)cost-based reimbalance, and 5)a Histogram-based Gradient Boosting classifier. This best model (calibrated) obtained a median accuracy of 68.1% [67.3%, 68.8%] (95% confidence interval), a balanced accuracy of 57.0% [55.6%, 57.9%], and an overall area under the curve (AUC) 0.802 [0.795, 0.808]. In our dataset, it outperformed all five clinical severity scores and the 'standard' AI-ML baseline. Discussion & conclusion: We conducted an exhaustive exploration of AI-ML methods designed for both ordinal and cost-sensitive classification, motivated by a real-world application domain (clinical severity prognosis) in which these topics arise naturally. Our model with the best classification performance exploited successfully the ordering information of ground truth classes, coping with imbalance and asymmetric costs. However, these ordinal and cost-sensitive aspects are seldom explored in the literature.
KW - Artificial intelligence
KW - COVID-19
KW - SARS-CoV-2 pneumonia
KW - cost-sensitive classification
KW - ordinal classification
KW - severity prediction
UR - https://www.scopus.com/pages/publications/85187291720
U2 - 10.1109/JBHI.2024.3363765
DO - 10.1109/JBHI.2024.3363765
M3 - Article
C2 - 38329848
AN - SCOPUS:85187291720
SN - 2168-2194
VL - 28
SP - 2613
EP - 2623
JO - IEEE Journal of Biomedical and Health Informatics
JF - IEEE Journal of Biomedical and Health Informatics
IS - 5
ER -