Large Language Models (LLMs) have demonstrated significant potential in enhancing healthcare services, including clinical decision support, patient engagement, and medical research. However, their susceptibility to hallucinations generating factually incorrect, misleading, or fabricated information poses serious risks in high-stakes medical contexts. This study proposes a reinforcement learning (RL)-based framework to detect and mitigate hallucinations in LLM outputs tailored for healthcare applications. The approach integrates domain-specific knowledge bases with reward-driven fine-tuning to penalize inaccurate or unsupported responses and reinforce factual precision. The model leverages automated fact-checking, uncertainty estimation, and expert-in-the-loop feedback to refine its reasoning process. Experimental evaluation across multiple healthcare datasets, including medical question-answering and clinical note summarization, shows a substantial reduction in hallucination frequency while preserving response fluency and contextual relevance. This research offers a scalable, adaptive strategy for improving the trustworthiness, safety, and ethical deployment of LLMs in healthcare systems.
Building similarity graph...
Analyzing shared references across papers
Loading...
Srikanth Gorle
Subba Rao
Prabhu Muthusamy
Journal of AI-powered medical innovations.
Building similarity graph...
Analyzing shared references across papers
Loading...
Gorle et al. (Fri,) studied this question.
www.synapsesocial.com/papers/68c1e08354b1d3bfb60fd85d — DOI: https://doi.org/10.60087/japmi.vol.03.issue.01.id.011