Our study evaluated a large language model (gpt-4o-mini) for surgical site infection (SSI) adjudication, achieving 100% sensitivity but 69.4% specificity. While reducing the manual screening workload by 66%, the agent generated many false positives, underscoring the need for refined models to improve specificity without compromising accuracy.
Building similarity graph...
Analyzing shared references across papers
Loading...
Eugenia Miranti
Timothy Keyes
Alvaro Ayala
Infection Control and Hospital Epidemiology
Stanford University
Stanford Health Care
Building similarity graph...
Analyzing shared references across papers
Loading...
Miranti et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69df2b49e4eeef8a2a6b03d8 — DOI: https://doi.org/10.1017/ice.2026.10432
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: