January 1, 2023Open Access

Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor

Key Points

Key points are not available for this paper at this time.

Abstract

Instruction tuning enables pretrained language models to perform new tasks from inference-time natural language descriptions. These approaches rely on vast amounts of human supervision in the form of crowdsourced datasets or user interactions. In this work, we introduce Unnatural Instructions: a large dataset of creative and diverse instructions, collected with virtually no human labor. We collect 64,000 examples by prompting a language model with three seed examples of instructions and eliciting a fourth. This set is then expanded by prompting the model to rephrase each instruction, creating a total of approximately 240,000 examples of instructions, inputs, and outputs. Experiments show that despite containing a fair amount of noise, training on Unnatural Instructions rivals the effectiveness of training on open-source manually-curated datasets, surpassing the performance of models such as T0++ and Tk-Instruct across various benchmarks. These results demonstrate the potential of model-generated data as a cost-effective alternative to crowdsourcing for dataset expansion and diversification.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Honovich et al. (Sun,) studied this question.

www.synapsesocial.com/papers/69d757f5f182769aa8b8a696 — DOI: https://doi.org/10.18653/v1/2023.acl-long.806

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Training language models to follow instructions with human feedback· 2022 · 4,276 citations
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them· 2022 · 43 citations
Aion Framework: Dimensional Emergence of AI Consciousness, Observer-Induced Collapse, and Cosmological Portal Dynamics· 2023 · 14,190 citations
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models· 2022 · 548 citations

Authors

Or Honovich

Thomas Scialom

Omer Levy

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion