August 29, 2024Open Access

MemLong: Memory-Augmented Retrieval for Long Text Modeling

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

Recent advancements in Large Language Models (LLMs) have yielded remarkable success across diverse fields. However, handling long contexts remains a significant challenge for LLMs due to the quadratic time and space complexity of attention mechanisms and the growing memory consumption of the key-value cache during generation. This work introduces MemLong: Memory-Augmented Retrieval for Long Text Generation, a method designed to enhance the capabilities of long-context language modeling by utilizing an external retriever for historical information retrieval. MemLong combines a non-differentiable ``ret-mem'' module with a partially trainable decoder-only language model and introduces a fine-grained, controllable retrieval attention mechanism that leverages semantic-level relevant chunks. Comprehensive evaluations on multiple long-context language modeling benchmarks demonstrate that MemLong consistently outperforms other state-of-the-art LLMs. More importantly, MemLong can extend the context length on a single 3090 GPU from 4k up to 80k. Our code is available at https://github.com/Bui1dMySea/MemLong

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Liu et al. (Thu,) studied this question.

www.synapsesocial.com/papers/68e5a818b6db6435875424aa — DOI: https://doi.org/10.48550/arxiv.2408.16967

Authors

Weijie Liu

Zecheng Tang

Juntao Li

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

MemLong: Memory-Augmented Retrieval for Long Text Modeling

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion