Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Understanding metric-related pitfalls in image analysis validation

  • Annika Reinke*
  • , Minu D. Tizabi*
  • , Michael Baumgartner
  • , Matthias Eisenmann
  • , Doreen Heckmann-Nötzel
  • , A. Emre Kavur
  • , Tim Rädsch
  • , Carole H. Sudre
  • , Laura Acion
  • , Michela Antonelli
  • , Tal Arbel
  • , Spyridon Bakas
  • , Arriel Benis
  • , Florian Buettner
  • , M. Jorge Cardoso
  • , Veronika Cheplygina
  • , Jianxu Chen
  • , Evangelia Christodoulou
  • , Beth A. Cimini
  • , Keyvan Farahani
  • Luciana Ferrer, Adrian Galdran, Bram van Ginneken, Ben Glocker, Patrick Godau, Daniel A. Hashimoto, Michael M. Hoffman, Merel Huisman, Fabian Isensee, Pierre Jannin, Charles E. Kahn, Dagmar Kainmueller, Bernhard Kainz, Alexandros Karargyris, Jens Kleesiek, Florian Kofler, Thijs Kooi, Annette Kopp-Schneider, Michal Kozubek, Anna Kreshuk, Tahsin Kurc, Bennett A. Landman, Geert Litjens, Amin Madani, Klaus Maier-Hein, Anne L. Martel, Erik Meijering, Bjoern Menze, Karel G.M. Moons, Henning Müller, Brennan Nichyporuk, Felix Nickel, Jens Petersen, Susanne M. Rafelski, Nasir Rajpoot, Mauricio Reyes, Michael A. Riegler, Nicola Rieke, Julio Saez-Rodriguez, Clara I. Sánchez, Shravya Shetty, Ronald M. Summers, Abdel A. Taha, Aleksei Tiulpin, Sotirios A. Tsaftaris, Ben Van Calster, Gaël Varoquaux, Ziv R. Yaniv, Paul F. Jäger*, Lena Maier-Hein*
*Autor correspondiente de este trabajo
  • German Cancer Research Center
  • Heidelberg University 
  • University College London
  • King's College London
  • Universidad de Buenos Aires
  • McGill University
  • Indiana University Bloomington
  • University of Pennsylvania
  • Holon Institute of Technology
  • European Federation for Medical Informatics
  • Frankfurt Cancer Insititute
  • IT University of Copenhagen
  • Leibniz-Institut für Analytische Wissenschaften
  • Broad Institute
  • National Institutes of Health
  • Ciudad Autónoma de Buenos Aires
  • University of Adelaide
  • Fraunhofer Institute for Digital Medicine
  • Radboud University Nijmegen
  • Imperial College London
  • Princess Margaret Cancer Centre
  • University of Toronto
  • Vector Institute
  • Ltsi - Umr 1099
  • Institut national de la santé et de la recherche médicale
  • Max Delbrück Center for Molecular Medicine in the Helmholtz Association
  • University of Potsdam
  • Friedrich-Alexander University Erlangen-Nürnberg
  • IHU Strasbourg
  • University of Duisburg-Essen
  • Helmholtz AI
  • Lunit, Inc.
  • Masaryk University
  • European Molecular Biology Laboratory
  • Stony Brook University
  • Vanderbilt University
  • University Health Network
  • Sunnybrook Research Institute
  • University of New South Wales
  • University of Zurich
  • Utrecht University
  • University of Applied Sciences Western Switzerland
  • University of Geneva
  • MILA (Québec Artificial Intelligence Institute)
  • University of Hamburg
  • Allen Institute for Cell Science
  • University of Warwick
  • University of Bern
  • Simula Metropolitan Center for Digital Engineering
  • University of Tromsø – The Arctic University of Norway
  • NVIDIA
  • University of Amsterdam
  • Alphabet Inc.
  • Vienna University of Technology
  • University of Oulu
  • University of Edinburgh
  • KU Leuven
  • Leiden University
  • Institut national de recherche en informatique et en automatique

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

113 Citas (Scopus)

Resumen

Validation metrics are key for tracking scientific progress and bridging the current chasm between artificial intelligence research and its translation into practice. However, increasing evidence shows that, particularly in image analysis, metrics are often chosen inadequately. Although taking into account the individual strengths, weaknesses and limitations of validation metrics is a critical prerequisite to making educated choices, the relevant knowledge is currently scattered and poorly accessible to individual researchers. Based on a multistage Delphi process conducted by a multidisciplinary expert consortium as well as extensive community feedback, the present work provides a reliable and comprehensive common point of access to information on pitfalls related to validation metrics in image analysis. Although focused on biomedical image analysis, the addressed pitfalls generalize across application domains and are categorized according to a newly created, domain-agnostic taxonomy. The work serves to enhance global comprehension of a key topic in image analysis validation.

Idioma originalInglés
Páginas (desde-hasta)182-194
Número de páginas13
PublicaciónNature Methods
Volumen21
N.º2
DOI
EstadoPublicada - feb 2024
Publicado de forma externa

Huella

Profundice en los temas de investigación de 'Understanding metric-related pitfalls in image analysis validation'. En conjunto forman una huella única.

Citar esto