Skip to content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

← Back to topic

Authors: Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova
Year: 2018
Journal: NAACL
DOI: 10.18653/v1/N19-1423
Publisher: https://aclanthology.org/N19-1423/

Keywords: bert, pretraining, language model

Abstract

We introduce a new language representation model called BERT which stands for Bidirectional Encoder Representations from Transformers.

Cite this paper

bibtex
@misc{bert2018,
  title  = {BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding},
  author = {Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova},
  year   = {2018},
  journal = {NAACL},
  doi    = {10.18653/v1/N19-1423},
  url    = {https://doi.org/10.18653/v1/N19-1423},
}

Source files

Released under the MIT License.