Document Layout Analysis on PubLayNet val

Metric: Table (higher is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	Table▼	Extra Data	Paper	Date↕	Code
1	VGT	0.981	No	Vision Grid Transformer for Document Layout Anal...	2023-08-29	Code
2	DETR	0.981	No	Bridging the Performance Gap between DETR and R-...	2023-06-23	-
3	LayoutLMv3-B	0.979	No	LayoutLMv3: Pre-training for Document AI with Un...	2022-04-18	Code
4	DiT-L	0.978	No	DiT: Self-supervised Pre-training for Document I...	2022-03-04	Code
5	CDeC-Net	0.978	No	CDeC-Net: Composite Deformable Cascade Network f...	2020-08-25	Code
6	DoPTA	0.977	No	DoPTA: Improving Document Layout Analysis using ...	2024-12-17	-
7	TRDLU	0.976	No	-	-	-
8	ResNext-101-32×8d	0.976	No	Vision Grid Transformer for Document Layout Anal...	2023-08-29	Code
9	VSR	0.974	No	VSR: A Unified Framework for Document Layout Ana...	2021-05-13	Code
10	UDoc	0.973	No	Unified Pretraining Framework for Document Under...	2022-04-22	-
11	BEiT-B	0.973	No	BEiT: BERT Pre-Training of Image Transformers	2021-06-15	Code
12	DeiT-B	0.972	No	Training data-efficient image transformers & dis...	2020-12-23	Code
13	Mask RCNN	0.96	No	PubLayNet: largest dataset ever for document lay...	2019-08-16	Code
14	Faster RCNN	0.954	No	PubLayNet: largest dataset ever for document lay...	2019-08-16	Code
15	GLAM	0.868	No	A Graphical Approach to Document Layout Analysis	2023-08-03	Code

#1VGT
0.981
Table· 2023-08-29
Vision Grid Transformer for Document Layout Analysis Code
#2DETRSOTA
0.981
Table· 2023-06-23
Bridging the Performance Gap between DETR and R-CNN for Graphical Object Detection in Document Images
#3LayoutLMv3-BSOTA
0.979
Table· 2022-04-18
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking Code
#4DiT-L
0.978
Table· 2022-03-04
DiT: Self-supervised Pre-training for Document Image Transformer Code
#5CDeC-NetSOTA
0.978
Table· 2020-08-25
CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images Code
#6DoPTA
0.977
Table· 2024-12-17
DoPTA: Improving Document Layout Analysis using Patch-Text Alignment
#7TRDLU
0.976
Table
No paper
#8ResNext-101-32×8d
0.976
Table· 2023-08-29
Vision Grid Transformer for Document Layout Analysis Code
#9VSR
0.974
Table· 2021-05-13
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations Code
#10UDoc
0.973
Table· 2022-04-22
Unified Pretraining Framework for Document Understanding
#11BEiT-B
0.973
Table· 2021-06-15
BEiT: BERT Pre-Training of Image Transformers Code
#12DeiT-B
0.972
Table· 2020-12-23
Training data-efficient image transformers & distillation through attention Code
#13Mask RCNNSOTA
0.96
Table· 2019-08-16
PubLayNet: largest dataset ever for document layout analysis Code
#14Faster RCNN
0.954
Table· 2019-08-16
PubLayNet: largest dataset ever for document layout analysis Code
#15GLAM
0.868
Table· 2023-08-03
A Graphical Approach to Document Layout Analysis Code