TY - GEN
T1 - Real-time informative laryngoscopic frame classification with pre-trained convolutional neural networks
AU - Galdran, Adrian
AU - Costa, P.
AU - Campilho, A.
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/4
Y1 - 2019/4
N2 - Visual exploration of the larynx represents a relevant technique for the early diagnosis of laryngeal disorders. However, visualizing an endoscopy for finding abnormalities is a time-consuming process, and for this reason much research has been dedicated to the automatic analysis of endoscopic video data. In this work we address the particular task of discriminating among informative laryngoscopic frames and those that carry insufficient diagnostic information. In the latter case, the goal is also to determine the reason for this lack of information. To this end, we analyze the possibility of training three different state-of-the-art Convolutional Neural Networks, but initializing their weights from configurations that have been previously optimized for solving natural image classification problems. Our findings show that the simplest of these three architectures not only is the most accurate (outperforming previously proposed techniques), but also the fastest and most efficient, with the lowest inference time and minimal memory requirements, enabling real-time application and deployment in portable devices.
AB - Visual exploration of the larynx represents a relevant technique for the early diagnosis of laryngeal disorders. However, visualizing an endoscopy for finding abnormalities is a time-consuming process, and for this reason much research has been dedicated to the automatic analysis of endoscopic video data. In this work we address the particular task of discriminating among informative laryngoscopic frames and those that carry insufficient diagnostic information. In the latter case, the goal is also to determine the reason for this lack of information. To this end, we analyze the possibility of training three different state-of-the-art Convolutional Neural Networks, but initializing their weights from configurations that have been previously optimized for solving natural image classification problems. Our findings show that the simplest of these three architectures not only is the most accurate (outperforming previously proposed techniques), but also the fastest and most efficient, with the lowest inference time and minimal memory requirements, enabling real-time application and deployment in portable devices.
KW - Convolutional neural networks
KW - Informative frame classification
KW - Laryngoscopy
KW - Real-time
UR - https://www.scopus.com/pages/publications/85073895580
U2 - 10.1109/ISBI.2019.8759511
DO - 10.1109/ISBI.2019.8759511
M3 - Conference contribution
AN - SCOPUS:85073895580
T3 - Proceedings - International Symposium on Biomedical Imaging
SP - 87
EP - 90
BT - ISBI 2019 - 2019 IEEE International Symposium on Biomedical Imaging
PB - IEEE Computer Society
T2 - 16th IEEE International Symposium on Biomedical Imaging, ISBI 2019
Y2 - 8 April 2019 through 11 April 2019
ER -