Reinforcement Learning for Circular Manufacturing: A Proximal Policy Optimization Approach for Sustainable Production Planning | Synapse