Journal Article DZNE-2024-01425

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Replication study of PD-L1 status prediction in NSCLC using PET/CT radiomics.

 ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;

2025
Elsevier Science Amsterdam [u.a.]

European journal of radiology 183, 111825 () [10.1016/j.ejrad.2024.111825]

This record in other databases:    

Please use a persistent id in citations: doi:

Abstract: This study investigates the predictive capability of radiomics in determining programmed cell death ligand 1 (PD-L1) expression (>=1%) status in non-small cell lung cancer (NSCLC) patients using a newly collected [18F]FDG PET/CT dataset. We aimed to replicate and validate the radiomics-based machine learning (ML) model proposed by Zhao et al. [1] predicting PD-L1 status from PET/CT-imaging. An independent cohort of 254 NSCLC patients underwent [18F]FDG PET/CT imaging, with primary tumor segmentation conducted using lung tissue window (LTW) and more conservative soft tissue window (STW) methods. Radiomics models ('Rad-score' and 'complex model') and a clinical-stage model from Zhao et al. were evaluated via 10-fold cross-validation and AUC analysis, alongside a benchmark-study comparing different ML-model pipelines. Clinicopathological data were collected from medical records. On our data, the Rad-score model yielded mean AUCs of 0.593 (STW) and 0.573 (LTW), below Zhao et al.'s 0.761. The complex model achieved mean AUCs of 0.505 (STW) and 0.519 (LTW), lower than Zhao et al.'s 0.769. The clinical model showed a mean AUC of 0.555, below Zhao et al.'s 0.64. All models performed significantly lower than Zhao et al.'s findings. Our benchmark study on four ML pipelines revealed consistently low performance across all configurations. Our study failed to replicate original findings, suggesting poor model performance and questioning predictive value of radiomics features in classifying PD-L1 expression from PET/CT imaging. These results highlight challenges in replicating radiomics-based ML models and stress the need for rigorous validation.

Keyword(s): Machine learning benchmark ; NSCLC ; PD-L1 ; PET/CT imaging data ; Radiomics ; Replication study

Classification:

Contributing Institute(s):
  1. Molecular Neurodegeneration (AG Haass)
Research Program(s):
  1. 352 - Disease Mechanisms (POF4-352) (POF4-352)

Appears in the scientific report 2025
Database coverage:
Medline ; Clarivate Analytics Master Journal List ; Current Contents - Clinical Medicine ; Ebsco Academic Search ; Essential Science Indicators ; IF < 5 ; JCR ; NationallizenzNationallizenz ; SCOPUS ; Science Citation Index Expanded ; Web of Science Core Collection
Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Institute Collections > M DZNE > M DZNE-AG Haass
Public records
Publications Database

 Record created 2024-12-16, last modified 2025-01-20


Fulltext:
DZNE-2024-01425 SUP - Download fulltext PDF Download fulltext PDF (PDFA)
DZNE-2024-01425_Restricted - Download fulltext PDF Download fulltext PDF (PDFA)
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)