February 17, 2024Open Access

Perils of Self-Feedback: Self-Bias Amplifies in Large Language Models

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

Recent studies show that self-feedback improves large language models (LLMs) on certain tasks while worsens other tasks. We discovered that such a contrary is due to LLM's bias towards their own output. In this paper, we formally define LLM's self-bias -- the tendency to favor its own generation -- using two statistics. We analyze six LLMs on translation, constrained text generation, and mathematical reasoning tasks. We find that self-bias is prevalent in all examined LLMs across multiple languages and tasks. Our analysis reveals that while the self-refine pipeline improves the fluency and understandability of model outputs, it further amplifies self-bias. To mitigate such biases, we discover that larger model size and external feedback with accurate assessment can significantly reduce bias in the self-refine pipeline, leading to actual performance improvement in downstream tasks.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Xu et al. (Sat,) studied this question.

www.synapsesocial.com/papers/68e78cdeb6db6435876fead9 — DOI: https://doi.org/10.48550/arxiv.2402.11436

Authors

Wenda Xu

Guanglei Zhu

Xuandong Zhao

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Perils of Self-Feedback: Self-Bias Amplifies in Large Language Models

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Also consider