What question did this study set out to answer?

This study aims to enhance the performance of large language models in long-term multi-session conversations in Korean.

March 28, 2026Open Access

An Empirical Study on Enhancing Large Language Models for Long-Term Conversations in Korean

Puntos clave

This study aims to enhance the performance of large language models in long-term multi-session conversations in Korean.
Constructed an extended Korean multi-session conversation dataset.
Distinguished between persona memory and episode memory for better memory management.
Evaluated LLM performance on session summarization, memory update, and response generation tasks.
Compared various methods including LoRA, DPO, MoE, CPT, Layer Tuning, and neuron-level tuning.
Korean multi-session conversations are more challenging than English.
Neuron-level tuning outperformed other methods, showing improved performance and robustness.
Memory update and response generation tasks demand a high level of reasoning ability.

Resumen

Large language models (LLMs) have shown strong performance in open-domain dialogue, yet they continue to struggle with long-term multi-session conversations (MSC), particularly in non-English languages such as Korean. In this work, we present a comprehensive empirical study on enhancing Korean MSC capabilities of LLMs through dataset construction, memory modeling, and parameter-efficient fine-tuning. We introduce an extended Korean MSC dataset that explicitly distinguishes between persona memory (long-term user attributes) and episode memory (short-term, event-driven information), enabling more effective memory management across sessions. Using this dataset, we evaluate LLM performance on three core MSC tasks: session summarization, memory update, and response generation. Our experiments reveal that Korean MSC is intrinsically more challenging than English MSC and that memory update and response generation require substantial reasoning ability. To address these challenges, we compare LoRA, DPO, MoE, CPT, Layer Tuning, and neuron-level tuning methods. Results consistently show that neuron tuning, guided by a novel language-specific neuron identification method based on activation scores and entropy, achieves superior performance and robustness, particularly in continual learning settings. Overall, our findings highlight neuron-level adaptation as an effective and interpretable approach for improving long-term conversational ability in low-resource languages.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Kim et al. (Wed,) studied this question.

www.synapsesocial.com/papers/69c772818bbfbc51511e314f — DOI: https://doi.org/10.3390/app16073175

Authors

Ho Kim

Jeonghyun Kang

Journals

Applied Sciences

Actions

Institutions

Konkuk University

Electronics and Telecommunications Research Institute

Anyang University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

An Empirical Study on Enhancing Large Language Models for Long-Term Conversations in Korean

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion