Benchmarking Large Language Model Performance in Perioperative Crisis Responses | Synapse