Skip to main navigation Skip to search Skip to main content

AutoPET Challenge on Fully Automated Lesion Segmentation in Oncologic PET/CT Imaging, Part 2: Domain Generalization

  • Jakob Dexl
  • , Sergios Gatidis
  • , Marcel Früh
  • , Katharina Jeblick
  • , Andreas Mittermeier
  • , Anna Theresa Stüber
  • , Balthasar Schachtner
  • , Johanna Topalis
  • , Matthias P. Fabritius
  • , Sijing Gu
  • , Gowtham Krishnan Murugesan
  • , Jeff VanOss
  • , Jin Ye
  • , Junjun He
  • , Anissa Alloula
  • , Bartłomiej W. Papież
  • , Zacharia Mesbah
  • , Romain Modzelewski
  • , Matthias Hadlich
  • , Zdravko Marinov
  • Rainer Stiefelhagen, Fabian Isensee, Klaus H. Maier-Hein, Adrian Galdran, Konstantin Nikolaou, Christian la Fougère, Moon Kim, Nico Kallenberg, Jens Kleesiek, Ken Herrmann, Rudolf Werner, Michael Ingrisch, Clemens C. Cyran, Thomas Küstner
  • Munich Center for Machine Learning
  • University of Tübingen
  • Stanford University
  • Ludwig Maximilian University of Munich
  • Comprehensive Pneumology Center
  • Konrad Zuse School of Excellence in Reliable AI
  • Grand Rapids
  • Shanghai Artificial Intelligence Laboratory
  • Université Rouen Normandie
  • Henri Becquerel Cancer Center
  • Siemens Healthcare SAS
  • Karlsruhe Institute of Technology
  • German Cancer Research Center
  • Heidelberg University 
  • University of Adelaide
  • Tübingen
  • Partner Site Tübingen
  • University of Duisburg-Essen

Research output: Contribution to journalArticlepeer-review

Abstract

This article reports the results of the second iteration of the autoPET challenge on automated lesion segmentation in whole-body PET/CT, held in conjunction with the 26th International Conference on Medical Image Computing and Computer Assisted Intervention in 2023. In contrast to the first autoPET challenge, which served as a proof of concept, this study investigates whether machine learning-based segmentation models trained on data from a single source can maintain performance across clinically relevant variations in PET/CT data, reflecting the demands of real-world deployment. Methods: A comprehensive biomedical segmentation challenge on PET/CT domain generalization was designed and conducted. Participants were tasked to train machine learning models on annotated whole-body 18F-FDG data (n = 1,014). These models were then evaluated on a test set of 200 samples from 5 clinically relevant domains, including variations in institutions, pathologies, and populations and a different tracer. Performance was measured in terms of average dice similarity coefficient, average false-positive volume, and average false-negative volume. The best-performing teams were awarded in 3 categories. Furthermore, a detailed analysis was conducted after the challenge, examining results across domains and unique instances, along with a ranking analysis. Results: Generalization from a single-source domain remains a significant challenge. Seventeen international teams successfully participated in the challenge. The best-performing team reached an average dice similarity coefficient of 0.5038, a mean false-positive volume of 87.8388 mL, and a mean false-negative volume of 8.4154 mL on the test set. nnU-Net was the most commonly used framework, with most participants using a 3-dimensional U-Net. Despite competitive in-domain results, out-of-domain performance deteriorated substantially, particularly on pediatric and prostate-specific membrane antigen data. Detailed error analysis revealed frequent false-positives due to physiologic uptake and decreased sensitivity in detecting small or low-uptake lesions. A majority-vote ensemble offered minimal performance gains, whereas an oracle ensemble indicates hypothetical gains. Ranking analysis showed no single team consistently outperformed all others across ranking schemes. Conclusion: The second autoPET challenge provides a comprehensive evaluation of the current state of automated PET/CT tumor segmentation, highlighting both progress and persistent challenges of single-source domain generalization and the need for diverse public datasets to enhance algorithm robustness.

Original languageEnglish
Pages (from-to)481-488
Number of pages8
JournalJournal of Nuclear Medicine
Volume67
Issue number3
DOIs
Publication statusPublished - 2 Mar 2026
Externally publishedYes

Keywords

  • biomedical image analysis challenge
  • deep learning
  • domain generalization
  • oncology
  • PET/CT
  • segmentation

Fingerprint

Dive into the research topics of 'AutoPET Challenge on Fully Automated Lesion Segmentation in Oncologic PET/CT Imaging, Part 2: Domain Generalization'. Together they form a unique fingerprint.

Cite this