Optimizing Controller Placement for SDN using Self-Play Reinforcement Learning | Synapse