Bridging the Gap Between Multimodal Foundation Models and World Models | Synapse