What type of study is this?

This is a Experimental Study study.

October 13, 2025Open Access

CoPL: Collaborative Preference Learning for Personalizing LLMs

Key Points

CoPL outperforms existing personalized reward models for LLMs, effectively capturing diverse user preferences.
Experiments on UltraFeedback-P show significant improvements in aligning outputs with user preferences, indicating better flexibility.
Utilizing a mixture of LoRA experts, CoPL fine-tunes large language models while balancing shared and user-specific preferences.
The optimization-free adaptation strategy enables generalization to new users without requiring additional fine-tuning.

Abstract

Personalizing large language models (LLMs) is important for aligning outputs with diverse user preferences, yet existing methods struggle with flexibility and generalization. We propose CoPL (Collaborative Preference Learning), a graph-based collaborative filtering framework that models user-response relationships to enhance preference estimation, particularly in sparse annotation settings. By integrating a mixture of LoRA experts, CoPL efficiently fine-tunes LLMs while dynamically balancing shared and user-specific preferences. Additionally, an optimization-free adaptation strategy enables generalization to unseen users without fine-tuning. Experiments on UltraFeedback-P demonstrate that CoPL outperforms existing personalized reward models, effectively capturing both common and controversial preferences, making it a scalable solution for personalized LLM alignment. The code is available at https://github.com/ml-postech/CoPL.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Youn Seon Choi

S.H. Cho

M.J. Lee

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

CoPL: Collaborative Preference Learning for Personalizing LLMs

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study