What question did this study set out to answer?

The aim is to enhance the detection accuracy of prostate cancer lesions in biparametric MRI through innovative machine learning strategies.

April 22, 2026Open Access

Enhancing Reliable Prostate Lesion Detection: Integrating Multi-Expert Annotations and Tailored nnU-Net Ensemble Learning Strategies

Key Points

The aim is to enhance the detection accuracy of prostate cancer lesions in biparametric MRI through innovative machine learning strategies.
Developed an adapted nnU-Net framework with preprocessing to tackle voxel-level class imbalance.
Created a composite loss function that incorporates Dice, binary cross-entropy, and focal loss for better lesion sensitivity.
Implemented a multi-expert learning strategy using separate reader annotations to guide model training.
The adapted nnU-Net improved Average Precision from 0.69 to 0.75 and AUROC from 0.92 to 0.96 on the test set.
Combining multi-expert annotations further enhanced Average Precision to 0.81 and AUROC to 0.99.
The PI-CAI score increased from 0.81 to 0.90, reflecting overall better lesion detection performance.

Abstract

Accurate detection of prostate cancer suspicious areas in biparametric MRI (bpMRI) remains challenging because of severe lesion-to-background imbalance, limited lesion contrast, and inter-reader variability in lesion delineation. Unlike prior approaches that collapse inter-reader disagreement into a single consensus label, this study makes three contributions: (1) an adapted nnU-Net framework with prostate-centered preprocessing to reduce voxel-level class imbalance; (2) a class-imbalance-aware composite loss combining Dice, binary cross-entropy, and tailored focal loss to improve sensitivity to small and low-contrast lesions; and (3) a multi-expert learning strategy that preserves reader-specific annotations as separate supervision targets and aggregates predictions at the ensemble level. The method was developed on a single-center dataset of 378 bpMRI studies independently annotated by three board-certified radiologists. Of these, 323 studies were used for model development with patient-level 5-fold cross-validation, and 55 studies were reserved as a fixed independent test set. Compared with our previously published U-Net baseline, the proposed consensus-based nnU-Net improved Average Precision (AP) from 0.69 to 0.75, AUROC from 0.92 to 0.96, and the PI-CAI score from 0.81 to 0.85 on the independent test set. In addition, the multi-expert approach further improved AP to 0.81 versus 0.76 (+6.6%, p < 0.01), AUROC to 0.99 versus 0.95 (+4.2%, p < 0.01), and the PI-CAI score to 0.90 versus 0.86 (+4.7%). These findings demonstrate that explicitly preserving expert disagreement as a training signal, combined with anatomically targeted preprocessing and tailored loss design, substantially improves prostate lesion detection in bpMRI, providing a strong basis for future multi center external validation.

Enhancing Reliable Prostate Lesion Detection: Integrating Multi-Expert Annotations and Tailored nnU-Net Ensemble Learning Strategies

Key Points

Abstract

Cite This Study