Skip to content

DocBank: A Benchmark Dataset for Document Layout Analysis

← Back to topic

Authors: Bin Li, Mingxin Huang, Yijuan Lu
Year: 2017
Journal: arXiv
DOI: 10.48550/arXiv.2006.01038
Publisher: https://arxiv.org/abs/2006.01038

Keywords: docbank, document analysis

Abstract

We present DocBank a new large-scale dataset for document layout analysis with fine-grained token-level annotations.

Cite this paper

bibtex
@misc{docbank2017,
  title  = {DocBank: A Benchmark Dataset for Document Layout Analysis},
  author = {Bin Li, Mingxin Huang, Yijuan Lu},
  year   = {2017},
  journal = {arXiv},
  doi    = {10.48550/arXiv.2006.01038},
  url    = {https://doi.org/10.48550/arXiv.2006.01038},
}

Source files

Released under the MIT License.