What question did this study set out to answer?

The aim is to enhance sequential recommendation systems by addressing behavioral alignment issues in large language models.

March 13, 2026Open Access

SBT-Rec: A Structured Behavioral Tokenization Framework for LLM-Based Sequential Recommendation

Key Points

The aim is to enhance sequential recommendation systems by addressing behavioral alignment issues in large language models.
Developed a hierarchical discrete structure discovery module using recursive residual quantization.
Created a multi-scale behavioral semantic reconstruction module to reduce data noise.
Introduced a residual-aware modality distribution aligner for compatible input token transformation.
Proposed a single-stage behavioral-semantic adaptive optimization strategy for efficient fine-tuning.
SBT-Rec shows improved recommendation accuracy compared to baseline models.
Significant enhancements in training efficiency are observed.
Demonstrated increased robustness against data noise.

Abstract

Generative recommendation systems based on Large Language Models leverage their reasoning capabilities to capture users’ latent interests. However, aligning continuous user behavioral embeddings with the discrete semantic space of LLMs remains a challenge. Direct alignment often leads to semantic mismatch and hallucination issues. Furthermore, existing methods typically rely on multi-stage training strategies to adapt to variations in feature distributions, thereby limiting training efficiency. To address the aforementioned issues, we propose SBT-Rec, a structured behavioral tokenization framework. Specifically, we first design a hierarchical discrete structure discovery module, utilizing a recursive residual quantization mechanism to decompose continuous behavioral vectors into discrete behavioral atoms to resolve modality discrepancies. Second, the multi-scale behavioral semantic reconstruction module reconstructs behavioral representations via residual superposition, thereby reducing data noise. Third, a residual-aware modality distribution aligner is introduced to transform behavioral features into input tokens compatible with the LLM via non-linear mapping. Finally, based on structured discrete representations, we propose a single-stage behavioral-semantic adaptive optimization strategy, achieving end-to-end parameter-efficient fine-tuning. Experiments on the MovieLens, LastFM, and Steam datasets demonstrate that SBT-Rec outperforms existing baseline models in terms of recommendation accuracy, training efficiency, and noise robustness.

SBT-Rec: A Structured Behavioral Tokenization Framework for LLM-Based Sequential Recommendation

Key Points

Abstract

Cite This Study