What type of study is this?

This is a Experimental Study study.

October 13, 2025Open Access

DreamNav: A Trajectory-Based Imaginative Framework for Zero-Shot Vision-and-Language Navigation

Puntos clave

DreamNav outperforms existing methods by improving agent planning capabilities and aligning actions with language instructions.
The framework leverages innovations like the EgoView Corrector and Imagination Predictor to enhance perceptive stability and proactive thinking.
Innovative features enable DreamNav to achieve a new zero-shot state-of-the-art on VLN-CE, surpassing baseline metrics significantly.
By unifying trajectory-level planning and imagination with egocentric inputs, DreamNav sets a new standard for navigation technology.

Resumen

Vision-and-Language Navigation in Continuous Environments (VLN-CE), which links language instructions to perception and control in the real world, is a core capability of embodied robots. Recently, large-scale pretrained foundation models have been leveraged as shared priors for perception, reasoning, and action, enabling zero-shot VLN without task-specific training. However, existing zero-shot VLN methods depend on costly perception and passive scene understanding, collapsing control to point-level choices. As a result, they are expensive to deploy, misaligned in action semantics, and short-sighted in planning. To address these issues, we present DreamNav that focuses on the following three aspects: (1) for reducing sensory cost, our EgoView Corrector aligns viewpoints and stabilizes egocentric perception; (2) instead of point-level actions, our Trajectory Predictor favors global trajectory-level planning to better align with instruction semantics; and (3) to enable anticipatory and long-horizon planning, we propose an Imagination Predictor to endow the agent with proactive thinking capability. On VLN-CE and real-world tests, DreamNav sets a new zero-shot state-of-the-art (SOTA), outperforming the strongest egocentric baseline with extra information by up to 7.49\% and 18.15\% in terms of SR and SPL metrics. To our knowledge, this is the first zero-shot VLN method to unify trajectory-level planning and active imagination while using only egocentric inputs.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Yunheng Wang

Yuetong Fang

Taowen Wang

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

DreamNav: A Trajectory-Based Imaginative Framework for Zero-Shot Vision-and-Language Navigation

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider