Assessing the Ineffectiveness of Synthetic Reinforcement Learning Feedback in Fine-Tuning Large Language Models | Synapse