TY - GEN
T1 - Data-Centric Label Smoothing for Explainable Glaucoma Screening from Eye Fundus Images
AU - Galdran, Adrian
AU - Ballester, Miguel A.Gonzalez
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - As current computing capabilities increase, modern machine learning and computer vision system tend to increase in complexity, mostly by means of larger models and advanced optimization strategies. Although often neglected, in many problems there is also much to be gained by considering potential improvements in understanding and better leveraging already-available training data, including annotations. This so-called data-centric approach can lead to substantial performance increases, sometimes beyond what can be achieved by larger models. In this paper we adopt such an approach for the task of justifiable glaucoma screening from retinal images. In particular, we focus on how to combine information from multiple annotators of different skills into a tailored label smoothing scheme that allows us to better employ a large collection of fundus images, instead of discarding samples suffering from inter-rater variability. Internal validation results indicate that our bespoke label smoothing approach surpasses the performance of a standard resnet50 model and also the same model trained with conventional label smoothing techniques, in particular for the multi-label scenario of predicting clinical reasons of glaucoma likelihood in a highly imbalanced screening context. Our code is made available at github.com/agaldran/justraigs.
AB - As current computing capabilities increase, modern machine learning and computer vision system tend to increase in complexity, mostly by means of larger models and advanced optimization strategies. Although often neglected, in many problems there is also much to be gained by considering potential improvements in understanding and better leveraging already-available training data, including annotations. This so-called data-centric approach can lead to substantial performance increases, sometimes beyond what can be achieved by larger models. In this paper we adopt such an approach for the task of justifiable glaucoma screening from retinal images. In particular, we focus on how to combine information from multiple annotators of different skills into a tailored label smoothing scheme that allows us to better employ a large collection of fundus images, instead of discarding samples suffering from inter-rater variability. Internal validation results indicate that our bespoke label smoothing approach surpasses the performance of a standard resnet50 model and also the same model trained with conventional label smoothing techniques, in particular for the multi-label scenario of predicting clinical reasons of glaucoma likelihood in a highly imbalanced screening context. Our code is made available at github.com/agaldran/justraigs.
KW - Data-Centric Computer Vision
KW - Explainability
KW - Glaucoma Screening
KW - Label Smoothing
UR - https://www.scopus.com/pages/publications/85203350590
U2 - 10.1109/ISBI56570.2024.10635220
DO - 10.1109/ISBI56570.2024.10635220
M3 - Conference contribution
AN - SCOPUS:85203350590
T3 - Proceedings - International Symposium on Biomedical Imaging
BT - IEEE International Symposium on Biomedical Imaging, ISBI 2024 - Conference Proceedings
PB - IEEE Computer Society
T2 - 21st IEEE International Symposium on Biomedical Imaging, ISBI 2024
Y2 - 27 May 2024 through 30 May 2024
ER -