A Benchmark for Multi-modal Foundation Models on Low-level Vision: from Single Images to Pairs | Synapse