Warm-start or cold-start? A comparison of generalizability in gradient-based hyperparameter tuning | Synapse