What type of study is this?

September 10, 2025Open Access

Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?

Puntos clave

Supervised instruction tuning shows an optimal trade-off between performance and resource costs in few-shot learning.
In-context learning, while gaining popularity, struggles with language understanding for low-resource languages despite improving generation.
A systematic comparison examines performance and costs of three approaches across multiple languages and NLU tasks.
Target language adaptation of pretrained models improves generation superficially but does not enhance language understanding effectively.

Resumen

Abstract Supervised fine-tuning (SFT), supervised instruction tuning (SIT), and in-context learning (ICL) are three alternative, de facto standard approaches to few-shot learning. ICL has gained popularity recently with the advent of LLMs due to its versatile simplicity and sample efficiency. Prior research has conducted only limited investigation into how these approaches work for multilingual few-shot learning, and the focus so far has been mostly on their performance. In this work, we present an extensive and systematic comparison of the three approaches, testing them on a variety of high- and low-resource languages over five different NLU tasks, and a myriad of language and domain setups. Importantly, performance is only one aspect of the comparison, where we also analyze and discuss the approaches through the optics of their computational, inference and financial costs. Some of the highlighted findings concern an excellent trade-off between performance and resource requirements/cost for SIT. We further analyze the impact of target language adaptation of pretrained LLMs and find that the standard adaptation approaches can (superficially) improve target language generation capabilities, but language understanding elicited through ICL does not improve accordingly and remains limited, especially for low-resource languages.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Evgeniia Razumovskaia

Ivan Vulić

Anna Korhonen

Journals

Transactions of the Association for Computational Linguistics

Actions

Institutions

University of Cambridge

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study