High-Dimensional Continuous Control Using Generalized Advantage Estimation | Synapse