What question did this study set out to answer?

This research aims to develop a RoI-based classification method for AtoNs that is efficient in terms of data usage for MASS applications.

April 10, 2026

A Study on RoI-based Data-efficient AtoN Classification for MASS using VLM

Key Points

This research aims to develop a RoI-based classification method for AtoNs that is efficient in terms of data usage for MASS applications.
Utilized a RoI-based classification approach for AtoNs under limited supervision.
Compared a YOLOv12 classifier with a CLIP-based method for performance evaluation.
Implemented domain-specific prompt engineering and LoRA-based few-shot tuning for enhanced data efficiency.
Conducted experiments using both synthetic and real-world images under various visibility conditions.
The VLM-based classifier achieved strong performance with limited training samples.
Maintained higher robustness in challenging environments with degraded visibility.
Demonstrated effectiveness in data-efficient AtoN classification compared to traditional methods.

Abstract

This study proposes a RoI-based, data-efficient fine-grained Aids to Navigation (AtoN) classification method using vision– language models (VLMs) for Maritime Autonomous Surface Ship (MASS). The reliability of the Electronic Chart Display and Information System (ECDIS) can be limited by operating anomalies and discrepancies between charted and actual environments, motivating camera-based situational awareness to support human watch-keeping. AtoNs, which are crucial indicators for coastal navigation, are typically observed as a distant and small-scale objects, making large-scale labeled data collection difficult and degrading full-frame classification due to background dominance. To address this, we focus on RoI-based classification under limited supervision and compare a supervised YOLOv12 classifier baseline with CLIP (Contrastive Language–Image Pre-training). CLIP maximizes data efficiency through domain-specific prompt engineering grounded in IALA Region B attributes and LoRA-based few-shot tuning. Experiments on Virtual RobotX (VRX) simulation datasets under clear and foggy conditions and on real-sea RoI images demonstrate that the proposed VLM-based classifier achieves robust performance with limited training samples and maintains higher robustness under degraded visibility. These results suggest an effective direction for practical, data-efficient AtoN classification in maritime environments via RoI-based preprocessing and parameter-efficient VLM adaptation.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

S.B. Im

Si-Won Kim

Seonghyeon Jung

Journals

Journal of the Society of Naval Architects of Korea

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

A Study on RoI-based Data-efficient AtoN Classification for MASS using VLM

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study