Key points are not available for this paper at this time.
Safe reinforcement learning tasks with multiple constraints are a challenging domain despite being very common in the real world. To address this challenge, we propose Objective Suppression, a novel method that adaptively suppresses the task reward maximizing objectives according to a safety critic. We benchmark Objective Suppression in two multi-constraint safety domains, including an autonomous driving domain where any incorrect behavior can lead to disastrous consequences. Empirically, we demonstrate that our proposed method, when combined with existing safe RL algorithms, can match the task reward achieved by our baselines with significantly fewer constraint violations.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhou et al. (Fri,) studied this question.
www.synapsesocial.com/papers/68e77f50b6db6435876f2f1d — DOI: https://doi.org/10.48550/arxiv.2402.15650
Zihan Zhou
Jonathan Booher
Wei Liu
Building similarity graph...
Analyzing shared references across papers
Loading...