BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
← Back to topic
Authors: Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova
Year: 2018
Journal: NAACL
DOI: 10.18653/v1/N19-1423
Publisher: https://aclanthology.org/N19-1423/
Keywords: bert, pretraining, language model
Abstract
We introduce a new language representation model called BERT which stands for Bidirectional Encoder Representations from Transformers.
Cite this paper
bibtex
@misc{bert2018,
title = {BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding},
author = {Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova},
year = {2018},
journal = {NAACL},
doi = {10.18653/v1/N19-1423},
url = {https://doi.org/10.18653/v1/N19-1423},
}