January 1, 2019Open Access

BERT Rediscovers the Classical NLP Pipeline

Key Points

Key points are not available for this paper at this time.

Abstract

Pre-trained text encoders have rapidly advanced the state of the art on many NLP tasks. We focus on one such model, BERT, and aim to quantify where linguistic information is captured within the network. We find that the model represents the steps of the traditional NLP pipeline in an interpretable and localizable way, and that the regions responsible for each step appear in the expected sequence: POS tagging, parsing, NER, semantic roles, then coreference. Qualitative analysis reveals that the model can and often does adjust this pipeline dynamically, revising lowerlevel decisions on the basis of disambiguating information from higher-level representations.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Ian Tenney

Dipanjan Das

Ellie Pavlick

Actions

Institutions

Google (United States)

John Brown University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

BERT Rediscovers the Classical NLP Pipeline

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study