What question did this study set out to answer?

The research aims to enhance few-shot open-set object detection by accurately identifying known objects while rejecting unknowns.

April 4, 2026Open Access

Few-Shot Open-Set Object Detection with a Synthesized Monument Guided by Contrastive Distilled Prompts

Key Points

The research aims to enhance few-shot open-set object detection by accurately identifying known objects while rejecting unknowns.
Developed a guided prompt–monument network (GPMN) for improved detection.
Implemented a contrastive distilled prompts (CDP) module for teacher–student framework optimization.
Utilized a synthesized monument module for memory retention and stable unknown rejection.
GPMN significantly improves unknown recall compared to standard methods.
Achieved higher few-shot mAP on VOC10-5-5 and VOC-COCO benchmarks over existing approaches.
Explicit modeling reduces base-class bias and enhances seen–unseen separation.

Abstract

Few-shot open-set object detection (FS-OSOD) remains challenging in real-world scenarios, where detectors must accurately recognize known objects from few examples while reliably rejecting vast unknown categories. Under this setting, decision boundaries between known and unknown classes are easily distorted by data scarcity and background clutter, leading to severe overfitting on base classes and overconfident misclassification of unknowns. Recent research attempts to alleviate these issues by regularizing detection heads to suppress base-class bias, or by leveraging vision–language priors through open-vocabulary alignment and prompt tuning to enhance semantic transferability. However, these solutions often overlook explicit modeling of truly out-of-set unknowns and the instability of prompt adaptation in low-data regimes, which can cause boundary drifts and make unknown proposals be absorbed by similar seen classes or even suppressed as background. To alleviate these issues, a guided prompt–monument network (GPMN) that is proposed, which jointly enhances prompt learning and feature representation learning for FS-OSOD. First, the contrastive distilled prompts (CDP) module employs a teacher–student prompt framework to decouple optimization across base, novel, and unknown classes. This strategy preserves transferability between zero-shot and few-shot settings while enhancing discrimination on base categories. Second, a synthesized monument module (SMM) maintains class-centered memory with momentum-updated prototypes and a non-parametric classifier, which compresses the overlap between seen and unseen distributions and provides a stable rejection margin for unknowns with strong co-occurrence and background noise. Compared with existing head-regularization and open-vocabulary prompt-tuning pipelines, GPMN explicitly targets both base-class bias and seen–unseen overlap at the region level. Extensive experiments on VOC10-5-5 and VOC-COCO benchmarks demonstrate that GPMN consistently improves unknown recall and few-shot mAP over representative FS-OSOD baselines. These results suggest that prompt-level decoupling mitigates base-class bias, whereas memory-anchored regularization enlarges the seen–unseen margin, jointly supporting reliable unknown rejection in scarce-supervision regimes.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Chen et al. (Thu,) studied this question.

www.synapsesocial.com/papers/69d0aefd659487ece0fa4e15 — DOI: https://doi.org/10.3390/app16073474

Authors

H. Matthew Chen

Yu Chen

Journals

Applied Sciences

Actions

Institutions

Jiangnan University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Few-Shot Open-Set Object Detection with a Synthesized Monument Guided by Contrastive Distilled Prompts

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion