Abstract Supervised fine-tuning (SFT), supervised instruction tuning (SIT), and in-context learning (ICL) are three alternative, de facto standard approaches to few-shot learning. ICL has gained popularity recently with the advent of LLMs due to its versatile simplicity and sample efficiency. Prior research has conducted only limited investigation into how these approaches work for multilingual few-shot learning, and the focus so far has been mostly on their performance. In this work, we present an extensive and systematic comparison of the three approaches, testing them on a variety of high- and low-resource languages over five different NLU tasks, and a myriad of language and domain setups. Importantly, performance is only one aspect of the comparison, where we also analyze and discuss the approaches through the optics of their computational, inference and financial costs. Some of the highlighted findings concern an excellent trade-off between performance and resource requirements/cost for SIT. We further analyze the impact of target language adaptation of pretrained LLMs and find that the standard adaptation approaches can (superficially) improve target language generation capabilities, but language understanding elicited through ICL does not improve accordingly and remains limited, especially for low-resource languages.
Building similarity graph...
Analyzing shared references across papers
Loading...
Evgeniia Razumovskaia
Ivan Vulić
Anna Korhonen
Transactions of the Association for Computational Linguistics
University of Cambridge
Building similarity graph...
Analyzing shared references across papers
Loading...
Razumovskaia et al. (Wed,) studied this question.
www.synapsesocial.com/papers/68c18f399b7b07f3a06159e6 — DOI: https://doi.org/10.1162/tacl.a.33