Enabling Low-Latency, GPU-Efficient Serverless Inference with Model Swapping | Synapse