A Comprehensive Literature Review on Multimodal Large Language Models for Integrated Text, Image, and Speech Understanding | Synapse