Safety Optimized Reinforcement Learning via Multi-Objective Policy Optimization | Synapse