Sub-Scaling Laws: On the Role of Data Density and Training Strategies in LLMs | Synapse