CoMMIT: Coordinated Instruction Tuning for Multimodal Large Language Models | Synapse