Aligning Agent Policies with Preferences: Human-Centered Interpretable Reinforcement Learning | Synapse