What question did this study set out to answer?

The study aims to enhance semi-supervised medical image segmentation by addressing semantic ambiguity using textual information.

March 23, 2026

Decoupling Target Semantics via Text-anchored Visual Contrast for Semi-supervised Medical Image Segmentation

Puntos clave

The study aims to enhance semi-supervised medical image segmentation by addressing semantic ambiguity using textual information.
Developed a Text-anchored Visual Decoupling (TeViD) framework using a teacher-student architecture.
Implemented dual-decoders to separate target and background representations.
Introduced a reversed cross-supervision mechanism for unlabeled data.
Proposed teacher-guided visual and text-anchored contrastive loss objectives.
TeViD outperformed standard semi-supervised learning and text-enhanced methods.
Achieved an average improvement of 5.72% in Dice score.
Achieved an average improvement of 8.15% in mean Intersection over Union (mIoU).

Resumen

Semi-supervised learning (SSL) provides an effective means of reducing reliance on large-scale annotated datasets by leveraging unlabeled data. However, existing SSL methods often struggle with semantic ambiguity, especially under limited supervision. Recent studies have incorporated textual information to provide contextual guidance, yet most focus on feature fusion rather than emphasizing target semantics critical for segmentation. In this paper, we proposed a novel Text-anchored Visual Decoupling (TeViD) framework for semi-supervised medical image segmentation. TeViD is built upon a teacher-student architecture with a dual-decoder design that explicitly disentangles target and background representations using both labeled and unlabeled data. For unlabeled data, a reversed cross-supervision mechanism is introduced to enhance decoder diversity and semantic separation. Furthermore, two contrastive learning objectives are proposed: a teacher-guided visual contrastive loss and a text-anchored contrastive loss, both designed to reinforce semantic disentanglement from visual and textual perspectives. Extensive experiments on five public datasets (covering X-ray, pathology, ultrasound, MRI, and CT) demonstrate that TeViD consistently outperforms both standard SSL and text-enhanced SSL methods, achieving average improvements of 5.72% in Dice and 8.15% in mIoU over the second-best competitor. The code is available at: https://github.com/jgfiuuuu/TeViD.

Me gusta

Guardar

Cite This Study

Zeng et al. (Thu,) studied this question.

synapsesocial.com/papers/69c0de74fddb9876e79c12ea https://doi.org/https://doi.org/10.1109/tip.2026.3674365

Me gusta

Guardar