What type of study is this?

This is a Experimental Study study.

October 9, 2025Open Access

ETT: Expanding the Long Context Understanding Capability of LLMs at Test-Time

Puntos clave

ETT extends context length up to 32k tokens while maintaining a constant memory requirement.
Improvements reached up to 30 percent in model accuracy by fine-tuning specific transformer modules.
Effective fine-tuning of weights showed better performance compared to full model adjustments.
Evaluated on LongBench, highlighting strengths in memory and computational efficiencies.

Resumen

Transformer-based Language Models' computation and memory overhead increase quadratically as a function of sequence length. The quadratic cost poses challenges when employing LLMs for processing long sequences. In this work, we introduce ~ (Extend at Test-Time), method for extending the context length of short context Transformer-based LLMs, with constant memory requirement and linear computation overhead. ETT enable the extension of the context length at test-time by efficient fine-tuning the model's parameters on the input context, chunked into overlapping small subsequences. We evaluate ETT on LongBench by extending the context length of GPT-Large and Phi-2 up to 32 times, increasing from 1k to 32k tokens. This results in up to a 30 percent improvement in the model's accuracy. We also study how context can be stored in LLM's weights effectively and efficiently. Through a detailed ablation study, we examine which Transformer modules are most beneficial to fine-tune at test-time. Interestingly, we find that fine-tuning the second layer of the FFNs is more effective than full fine-tuning, leading to a further improvement in the models' accuracy.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Zahirnia et al. (Tue,) studied this question.

www.synapsesocial.com/papers/68e8439a9989581a2fd4e24d — DOI: https://doi.org/10.48550/arxiv.2507.06313

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer· 2024
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention· 2024 · 15 citations
Long-Context Language Modeling with Parallel Context Encoding· 2024 · 2 citations
DFMU: Distribution-based Framework for Modeling Aleatoric Uncertainty in Multimodal Sentiment Analysis· 2024 · 18 citations
Learning Neural Vocoder from Range-Null Space Decomposition

Authors

Kiarash Zahirnia

Zahra Golpayegani

Walid Ahmed

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

ETT: Expanding the Long Context Understanding Capability of LLMs at Test-Time

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion