February 2, 2019Open Access

NLPのためのパラメータ効率的転移学習

Key Points

Key points are not available for this paper at this time.

Abstract

大規模な事前学習済みモデルのファインチューニングは、NLPにおける効果的な転移手法です。しかし、多くの下流タスクがある場合、ファインチューニングはパラメータの非効率性を伴います：各タスクごとに新たなモデル全体が必要です。代替策として、我々はアダプターモジュールによる転移を提案します。アダプターモジュールは、コンパクトかつ拡張可能なモデルを実現します。タスクごとにわずかな訓練可能パラメータのみを追加し、新しいタスクは以前のタスクに戻ることなく追加可能です。元のネットワークのパラメータは固定されたままであり、高度なパラメータ共有を実現します。アダプターの有効性を示すために、我々は最近提案されたBERT TransformerモデルをGLUEベンチマークを含む26の多様なテキスト分類タスクに転移しました。アダプターは、タスクごとにわずかなパラメータを追加しながら、ほぼ最先端の性能を達成しました。GLUEでは、完全なファインチューニングの性能の0.4%以内に到達し、タスクごとに3.6%のパラメータを追加するのみでした。対照的に、ファインチューニングはタスクごとにパラメータの100%を訓練します。

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Neil Houlsby

Andrei Giurgiu

Stanisław Jastrzȩbski

Actions

Institutions

Université de Montréal

Google (United States)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Houlsbyら（Sat,）はこの問題を研究しました。

www.synapsesocial.com/papers/6a0947ef0e219f8cdd33f325 — DOI: https://doi.org/10.48550/arxiv.1902.00751

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

In Defense of the Triplet Loss for Person Re-Identification· 2017 · 2,895 citations
Contributions to the study of SMS spam filtering· 2011 · 443 citations
BERT and PALs: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning· 2019 · 113 citations
Multitask Learning· 1997 · 6,236 citations
NewsWeeder: Learning to Filter Netnews· 1995 · 2,046 citations

NLPのためのパラメータ効率的転移学習

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider