Evaluating large language models in biomedical data science challenges through a classroom experiment | Synapse