Los puntos clave no están disponibles para este artículo en este momento.
Recent studies show that self-feedback improves large language models (LLMs) on certain tasks while worsens other tasks. We discovered that such a contrary is due to LLM's bias towards their own output. In this paper, we formally define LLM's self-bias -- the tendency to favor its own generation -- using two statistics. We analyze six LLMs on translation, constrained text generation, and mathematical reasoning tasks. We find that self-bias is prevalent in all examined LLMs across multiple languages and tasks. Our analysis reveals that while the self-refine pipeline improves the fluency and understandability of model outputs, it further amplifies self-bias. To mitigate such biases, we discover that larger model size and external feedback with accurate assessment can significantly reduce bias in the self-refine pipeline, leading to actual performance improvement in downstream tasks.
Building similarity graph...
Analyzing shared references across papers
Loading...
Xu et al. (Sat,) studied this question.
www.synapsesocial.com/papers/68e78cdeb6db6435876fead9 — DOI: https://doi.org/10.48550/arxiv.2402.11436
Wenda Xu
Guanglei Zhu
Xuandong Zhao
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: