What type of study is this?

This is a Quantitative Study study.

October 1, 2025Open Access

TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback

Key Points

The proposed safety model estimates decision steps' impact on overall safety, enhancing safe decision making in reinforcement learning.
Utilizing a dataset of diverse trajectories with corresponding binary safety labels shows promising results in safety assessment.
Our reformulation of the safe reinforcement learning problem enables the derivation of an effective algorithm for optimizing safety and rewards.
Empirical results illustrate that our approach is scalable to various continuous control tasks, addressing the challenges of unknown safety definitions.

Abstract

In safe reinforcement learning (RL), auxiliary safety costs are used to align the agent to safe decision making. In practice, safety constraints, including cost functions and budgets, are unknown or hard to specify, as it requires anticipation of all possible unsafe behaviors. We therefore address a general setting where the true safety definition is unknown, and has to be learned from sparsely labeled data. Our key contributions are: first, we design a safety model that performs credit assignment to estimate each decision step's impact on the overall safety using a dataset of diverse trajectories and their corresponding binary safety labels (i.e., whether the corresponding trajectory is safe/unsafe). Second, we illustrate the architecture of our safety model to demonstrate its ability to learn a separate safety score for each timestep. Third, we reformulate the safe RL problem using the proposed safety model and derive an effective algorithm to optimize a safe yet rewarding policy. Finally, our empirical results corroborate our findings and show that this approach is effective in satisfying unknown safety definition, and scalable to various continuous control tasks.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Siow Meng Low

Akshat Kumar

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider