January 1, 2020Open Access

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Key Points

Key points are not available for this paper at this time.

Abstract

Large-scale pre-trained language models such as BERT have brought significant improvements to NLP applications. However, they are also notorious for being slow in inference, which makes them difficult to deploy in realtime applications. We propose a simple but effective method, DeeBERT, to accelerate BERT inference. Our approach allows samples to exit earlier without passing through the entire model. Experiments show that DeeBERT is able to save up to 40% inference time with minimal degradation in model quality. Further analyses show different behaviors in the BERT transformer layers and also reveal their redundancy. Our work provides new ideas to efficiently apply deep transformer-based models to downstream tasks.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Xin et al. (Wed,) studied this question.

www.synapsesocial.com/papers/69dd605d80eea7d3f699c3eb — DOI: https://doi.org/10.18653/v1/2020.acl-main.204

Also consider

Synapse has enriched 4 closely related papers on similar clinical questions. Consider them for comparative context:

Natural Language Generation for Effective Knowledge Distillation· 2019 · 27 citations
TinyBERT: Distilling BERT for Natural Language Understanding· 2020 · 1,613 citations
Visualizing and Understanding Convolutional Networks· 2014 · 15,382 citations
Early exit optimizations for additive machine learned ranking systems· 2010 · 139 citations

Authors

Ji Xin

Raphael Tang

Jaejun Lee

Actions

Institutions

University of Waterloo

Vector Institute

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion