May 25, 2024Open Access

Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

In numerous reinforcement learning (RL) problems involving safety-critical systems, a key challenge lies in balancing multiple objectives while simultaneously meeting all stringent safety constraints. To tackle this issue, we propose a primal-based framework that orchestrates policy optimization between multi-objective learning and constraint adherence. Our method employs a novel natural policy gradient manipulation method to optimize multiple RL objectives and overcome conflicting gradients between different tasks, since the simple weighted average gradient direction may not be beneficial for specific tasks' performance due to misaligned gradients of different task objectives. When there is a violation of a hard constraint, our algorithm steps in to rectify the policy to minimize this violation. We establish theoretical convergence and constraint violation guarantees in a tabular setting. Empirically, our proposed method also outperforms prior state-of-the-art methods on challenging safe multi-objective reinforcement learning tasks.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Gu et al. (Sat,) studied this question.

www.synapsesocial.com/papers/68e686d2b6db64358760fece — DOI: https://doi.org/10.48550/arxiv.2405.16390

Authors

Shangding Gu

Bilgehan Sel

Yuhao Ding

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Also consider