Key points are not available for this paper at this time.
In this study, we investigated the effects of self-reflection in large language models (LLMs) on problem-solving performance. We instructed nine popular LLMs to answer a series of multiple-choice questions to provide a performance baseline. For each incorrectly answered question, we instructed eight types of self-reflecting LLM agents to reflect on their mistakes and provide themselves with guidance to improve problem-solving. Then, using this guidance, each self-reflecting agent attempted to re-answer the same questions. Our results indicate that LLM agents are able to significantly improve their problem-solving performance through self-reflection (p < 0.001). In addition, we compared the various types of self-reflection to determine their individual contribution to performance. All code and data are available on GitHub at https://github.com/matthewrenze/self-reflection
Building similarity graph...
Analyzing shared references across papers
Loading...
Matthew Renze
Erhan Guven
Johns Hopkins University
Building similarity graph...
Analyzing shared references across papers
Loading...
Renze et al. (Tue,) studied this question.
www.synapsesocial.com/papers/6a08aa7f7de338f10b10e515 — DOI: https://doi.org/10.1109/fllm63129.2024.10852426