Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining | Synapse