June 6, 2024Open Access

Prototypical Reward Network for Data-Efficient RLHF

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

The reward model for Reinforcement Learning from Human Feedback (RLHF) has proven effective in fine-tuning Large Language Models (LLMs). Notably, collecting human feedback for RLHF can be resource-intensive and lead to scalability issues for LLMs and complex tasks. Our proposed framework Proto-RM leverages prototypical networks to enhance reward models under limited human feedback. By enabling stable and reliable structural learning from fewer samples, Proto-RM significantly enhances LLMs' adaptability and accuracy in interpreting human preferences. Extensive experiments on various datasets demonstrate that Proto-RM significantly improves the performance of reward models and LLMs in human feedback tasks, achieving comparable and usually better results than traditional methods, while requiring significantly less data. in data-limited scenarios. This research offers a promising direction for enhancing the efficiency of reward models and optimizing the fine-tuning of language models under restricted feedback conditions.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Zhang et al. (Thu,) studied this question.

www.synapsesocial.com/papers/68e65e3eb6db6435875ed11f — DOI: https://doi.org/10.48550/arxiv.2406.06606

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Authors

Jinghan Zhang

Xiting Wang

Yiqiao Jin

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Prototypical Reward Network for Data-Efficient RLHF

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion