Reinforcing Language Agents via Policy Optimization with Action Decomposition | Synapse