Towards better recommendations: Integrating counterfactual learning and trust regions in digital platforms | Synapse