Document Layout Analysis on PubLayNet val

Metric: Title (higher is better)

LeaderboardDataset

Loading chart...

Results

Submit a result

Sort:

#	Model↕	Title▼	Extra Data	Paper	Date↕	Code
1	VGT	0.939	No	Vision Grid Transformer for Document Layout Anal...	2023-08-29	Code
2	VSR	0.931	No	VSR: A Unified Framework for Document Layout Ana...	2021-05-13	Code
3	TRDLU	0.921	No	-	-	-
4	DETR	0.918	No	Bridging the Performance Gap between DETR and R-...	2023-06-23	-
5	LayoutLMv3-B	0.906	No	LayoutLMv3: Pre-training for Document AI with Un...	2022-04-18	Code
6	DoPTA	0.895	No	DoPTA: Improving Document Layout Analysis using ...	2024-12-17	-
7	DiT-L	0.893	No	DiT: Self-supervised Pre-training for Document I...	2022-03-04	Code
8	UDoc	0.885	No	Unified Pretraining Framework for Document Under...	2022-04-22	-
9	DeiT-B	0.874	No	Training data-efficient image transformers & dis...	2020-12-23	Code
10	BEiT-B	0.866	No	BEiT: BERT Pre-Training of Image Transformers	2021-06-15	Code
11	ResNext-101-32×8d	0.862	No	Vision Grid Transformer for Document Layout Anal...	2023-08-29	Code
12	Mask RCNN	0.84	No	PubLayNet: largest dataset ever for document lay...	2019-08-16	Code
13	Faster RCNN	0.826	No	PubLayNet: largest dataset ever for document lay...	2019-08-16	Code
14	GLAM	0.8	No	A Graphical Approach to Document Layout Analysis	2023-08-03	Code

#1VGTSOTA
0.939
Title· 2023-08-29
Vision Grid Transformer for Document Layout Analysis Code
#2VSRSOTA
0.931
Title· 2021-05-13
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations Code
#3TRDLU
0.921
Title
No paper
#4DETR
0.918
Title· 2023-06-23
Bridging the Performance Gap between DETR and R-CNN for Graphical Object Detection in Document Images
#5LayoutLMv3-B
0.906
Title· 2022-04-18
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking Code
#6DoPTA
0.895
Title· 2024-12-17
DoPTA: Improving Document Layout Analysis using Patch-Text Alignment
#7DiT-L
0.893
Title· 2022-03-04
DiT: Self-supervised Pre-training for Document Image Transformer Code
#8UDoc
0.885
Title· 2022-04-22
Unified Pretraining Framework for Document Understanding
#9DeiT-BSOTA
0.874
Title· 2020-12-23
Training data-efficient image transformers & distillation through attention Code
#10BEiT-B
0.866
Title· 2021-06-15
BEiT: BERT Pre-Training of Image Transformers Code
#11ResNext-101-32×8d
0.862
Title· 2023-08-29
Vision Grid Transformer for Document Layout Analysis Code
#12Mask RCNNSOTA
0.84
Title· 2019-08-16
PubLayNet: largest dataset ever for document layout analysis Code
#13Faster RCNN
0.826
Title· 2019-08-16
PubLayNet: largest dataset ever for document layout analysis Code
#14GLAM
0.8
Title· 2023-08-03
A Graphical Approach to Document Layout Analysis Code