Abstract Large language models (LLMs) have achieved remarkable success across many fields but face challenges in complex real-world scenarios like medical consultation, particularly regarding inquiry quality and safety concerns. In this paper, we introduce a healthcare agent designed to address these issues, which cannot be fully resolved through a vanilla one-time fine-tuning process. The healthcare agent includes three components: a dialogue component for planning safe and effective conversations, a memory component for storing patient conversations and medical history, and a processing component for report generation. To evaluate our healthcare agent’s performance in medical consultations, we employ both expert assessment from medical professionals and an automated evaluation system powered by ChatGPT for large-scale testing. Our results demonstrate that the healthcare agent significantly enhances the capabilities of general LLMs in medical consultations, particularly in inquiry quality, response quality, and safety. Through extensive ablation studies, we also analyze the impact of each component.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhiyao Ren
Yibing Zhan
YU Bao-sheng
Building similarity graph...
Analyzing shared references across papers
Loading...
Ren et al. (Mon,) studied this question.
www.synapsesocial.com/papers/68bb49bc6d6d5674bccff506 — DOI: https://doi.org/10.1038/s44387-025-00021-x
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: