What type of study is this?

This is a Literature Review study.

September 20, 2025

The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning

Key Points

Integration of large language models and vision-language models enhances various reinforcement learning challenges, such as reward design.
The survey identifies three roles for LLM/VLM: agent, planner, and reward, facilitating effective decision-making.
Key issues explored include grounding, bias mitigation, and the need for improved representations in reinforcement learning.
Establishing a framework for future research, this survey aims to advance the integration of different understanding modalities in RL.

Abstract

Reinforcement learning (RL) has shown impressive results in sequential decision-making tasks. Large Language Models (LLMs) and Vision-Language Models (VLMs) have recently emerged, exhibiting impressive capabilities in multimodal understanding and reasoning. These advances have led to a surge of research integrating LLMs and VLMs into RL. This survey reviews representative works in which LLMs and VLMs are used to overcome key challenges in RL, such as lack of prior knowledge, long-horizon planning, and reward design. We present a taxonomy that categorizes these LLM/VLM-assisted RL approaches into three roles: agent, planner, and reward. We conclude by exploring open problems, including grounding, bias mitigation, improved representations, and action advice. By consolidating existing research and identifying future directions, this survey establishes a framework for integrating LLMs and VLMs into RL, advancing approaches that unify natural language and visual understanding with sequential decision-making.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Sheila Schoepp

Masoud Jafaripour

Yingyue Cao

Actions

Institutions

University of Alberta

Nanjing University

Intel (United States)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider