Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge

  • Sharib Ali*
  • , Noha Ghatwary
  • , Debesh Jha
  • , Ece Isik-Polat
  • , Gorkem Polat
  • , Chen Yang
  • , Wuyang Li
  • , Adrian Galdran
  • , Miguel Ángel González Ballester
  • , Vajira Thambawita
  • , Steven Hicks
  • , Sahadev Poudel
  • , Sang Woong Lee
  • , Ziyi Jin
  • , Tianyuan Gan
  • , Cheng Hui Yu
  • , Jiang Peng Yan
  • , Doyeob Yeo
  • , Hyunseok Lee
  • , Nikhil Kumar Tomar
  • Mahmood Haithami, Amr Ahmed, Michael A. Riegler, Christian Daul, Pål Halvorsen, Jens Rittscher, Osama E. Salem, Dominique Lamarque, Renato Cannizzaro, Stefano Realdon, Thomas de Lange, James E. East
*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

42 Citations (Scopus)

Abstract

Polyps are well-known cancer precursors identified by colonoscopy. However, variability in their size, appearance, and location makes the detection of polyps challenging. Moreover, colonoscopy surveillance and removal of polyps are highly operator-dependent procedures and occur in a highly complex organ topology. There exists a high missed detection rate and incomplete removal of colonic polyps. To assist in clinical procedures and reduce missed rates, automated methods for detecting and segmenting polyps using machine learning have been achieved in past years. However, the major drawback in most of these methods is their ability to generalise to out-of-sample unseen datasets from different centres, populations, modalities, and acquisition systems. To test this hypothesis rigorously, we, together with expert gastroenterologists, curated a multi-centre and multi-population dataset acquired from six different colonoscopy systems and challenged the computational expert teams to develop robust automated detection and segmentation methods in a crowd-sourcing Endoscopic computer vision challenge. This work put forward rigorous generalisability tests and assesses the usability of devised deep learning methods in dynamic and actual clinical colonoscopy procedures. We analyse the results of four top performing teams for the detection task and five top performing teams for the segmentation task. Our analyses demonstrate that the top-ranking teams concentrated mainly on accuracy over the real-time performance required for clinical applicability. We further dissect the devised methods and provide an experiment-based hypothesis that reveals the need for improved generalisability to tackle diversity present in multi-centre datasets and routine clinical procedures.

Original languageEnglish
Article number2032
JournalScientific Reports
Volume14
Issue number1
DOIs
Publication statusPublished - Dec 2024
Externally publishedYes

Fingerprint

Dive into the research topics of 'Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge'. Together they form a unique fingerprint.

Cite this