Evaluating large language models for automated TNM staging from PET-CT reports: a multi-cancer comparative study | Synapse