February 29, 2024Open Access

Large Language Models are Learnable Planners for Long-Term Recommendation

Key Points

Key points are not available for this paper at this time.

Abstract

Planning for both immediate and long-term benefits becomes increasingly important in recommendation. Existing methods apply Reinforcement Learning (RL) to learn planning capacity by maximizing cumulative reward for long-term recommendation. However, the scarcity of recommendation data presents challenges such as instability and susceptibility to overfitting when training RL models from scratch, resulting in sub-optimal performance. In this light, we propose to leverage the remarkable planning capabilities over sparse data of Large Language Models (LLMs) for long-term recommendation. The key to achieving the target lies in formulating a guidance plan following principles of enhancing long-term engagement and grounding the plan to effective and executable actions in a personalized manner. To this end, we propose a Bi-level Learnable LLM Planner framework, which consists of a set of LLM instances and breaks down the learning process into macro-learning and micro-learning to learn macro-level guidance and micro-level personalized recommendation policies, respectively. Extensive experiments validate that the framework facilitates the planning ability of LLMs for long-term recommendation. Our code and data can be found at https://github.com/jizhi-zhang/BiLLP.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Shi et al. (Thu,) studied this question.

www.synapsesocial.com/papers/68e76fa2b6db6435876e567c — DOI: https://doi.org/10.1145/3626772.3657683

Also consider

Synapse has enriched 3 closely related papers on similar clinical questions. Consider them for comparative context:

DeepFM: A Factorization-Machine based Neural Network for CTR Prediction· 2017 · 2,225 citations
Deep reinforcement learning for page-wise recommendations· 2018 · 412 citations
Training language models to follow instructions with human feedback· 2022 · 4,276 citations

Authors

Wentao Shi

Xiangnan He

Yang Zhang

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Large Language Models are Learnable Planners for Long-Term Recommendation

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion