February 27, 2023Open Access

LLaMA: Open and Efficient Foundation Language Models

Key Points

Key points are not available for this paper at this time.

Abstract

We introduce LLaMA, a collection of foundation language models ranging from 7B to 65B parameters. We train our models on trillions of tokens, and show that it is possible to train state-of-the-art models using publicly available datasets exclusively, without resorting to proprietary and inaccessible datasets. In particular, LLaMA-13B outperforms GPT-3 (175B) on most benchmarks, and LLaMA-65B is competitive with the best models, Chinchilla-70B and PaLM-540B. We release all our models to the research community.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Hugo Touvron

Thibaut Lavril

Gautier Izacard

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

LLaMA: Open and Efficient Foundation Language Models

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study