Training Compute-Optimal Large Language Models | Synapse