GATmath and GATLc: Comprehensive benchmarks for evaluating Arabic large language models | Synapse