What type of study is this?

This is a Quantitative Study study.

October 20, 2025Open Access

LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs

Key Points

LADM significantly improves LLM performance on long-context tasks with only 1B tokens for continual training.
The framework uses attention-based dependency measurement to effectively identify high-quality long-context data.
LADM systematically captures contextual dependencies to ensure comprehensive quality measurement of data.
The study highlights the importance of training data selection for optimizing long-context modeling in LLMs.

Abstract

Long-context modeling has drawn more and more attention in the area of Large Language Models (LLMs). Continual training with long-context data becomes the de-facto method to equip LLMs with the ability to process long inputs. However, it still remains an open challenge to measure the quality of long-context training data. To address this issue, we propose a Long-context data selection framework with Attention-based Dependency Measurement (LADM), which can efficiently identify high-quality long-context data from a large-scale, multi-domain pre-training corpus. LADM leverages the retrieval capabilities of the attention mechanism to capture contextual dependencies, ensuring a comprehensive quality measurement of long-context data. Experimental results show that our LADM framework significantly boosts the performance of LLMs on multiple long-context tasks with only 1B tokens for continual training.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Jianghao Chen

Junhong Wu

Yangyifan Xu

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study