What question did this study set out to answer?

The research aims to enhance traversability perception for autonomous mobile robots using a novel segmentation framework.

March 15, 2026Open Access

TARTS: Training-Free Adaptive Reference-Guided Traversability Segmentation with Automated Footprint Supervision and Experimental Verification

Key Points

The research aims to enhance traversability perception for autonomous mobile robots using a novel segmentation framework.
Introduced TARTS framework for training-free terrain segmentation.
Utilized one-shot prototype initialization from a single reference image.
Applied automated footprint supervision and Exponential Moving Average updates for online adaptation.
Achieved 94.5% Intersection over Union (IoU) on Reference-guided Traversability Segmentation Dataset.
Outperformed traditional supervised methods with strong performance on Off-Road Freespace Detection benchmark.
Maintained 17-24 FPS performance on embedded platforms.

Abstract

Autonomous mobile robots require robust traversability perception to navigate safely in diverse outdoor environments. However, traditional deep learning approaches are data-hungry, requiring large-scale manual annotations, and struggle to adapt quickly to unseen environments. This paper introduces TARTS (Training-free Adaptive Reference-guided Traversability Segmentation), a novel framework combining one-shot prototype initialization with trajectory-guided online adaptation for terrain segmentation. Using a single reference image of desired traversable terrain, TARTS establishes an initial prototype from pre-trained DINO Vision Transformer (ViT) features. The system performs segmentation through superpixel-based feature aggregation and valley-emphasis Otsu thresholding while continuously refining the prototype via Exponential Moving Average (EMA) updates driven by automated footprint supervision from the robot’s traversed trajectory. Extensive experiments on our introduced Reference-guided Traversability Segmentation Dataset (RTSD) and the challenging Off-Road Freespace Detection (ORFD) benchmark demonstrate strong performance, achieving 94.5% IoU on RTSD and 94.1% IoU on ORFD, outperforming state-of-the-art supervised methods that require multi-modal inputs and dedicated training. The framework maintains efficient performance (17–24 FPS) on embedded platforms, enabling practical deployment with only a reference image as initialization.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper