Document Layout Analysis on PubLayNet val

Metric: Text (higher is better)

LeaderboardDataset

Loading chart...

Results

Submit a result

Sort:

#	Model↕	Text▼	Extra Data	Paper	Date↕	Code
1	VSR	0.967	No	VSR: A Unified Framework for Document Layout Ana...	2021-05-13	Code
2	TRDLU	0.958	No	-	-	-
3	VGT	0.95	No	Vision Grid Transformer for Document Layout Anal...	2023-08-29	Code
4	DETR	0.947	No	Bridging the Performance Gap between DETR and R-...	2023-06-23	-
5	LayoutLMv3-B	0.945	No	LayoutLMv3: Pre-training for Document AI with Un...	2022-04-18	Code
6	DoPTA	0.944	No	DoPTA: Improving Document Layout Analysis using ...	2024-12-17	-
7	DiT-L	0.944	No	DiT: Self-supervised Pre-training for Document I...	2022-03-04	Code
8	UDoc	0.939	No	Unified Pretraining Framework for Document Under...	2022-04-22	-
9	DeiT-B	0.934	No	Training data-efficient image transformers & dis...	2020-12-23	Code
10	BEiT-B	0.934	No	BEiT: BERT Pre-Training of Image Transformers	2021-06-15	Code
11	ResNext-101-32×8d	0.93	No	Vision Grid Transformer for Document Layout Anal...	2023-08-29	Code
12	Mask RCNN	0.916	No	PubLayNet: largest dataset ever for document lay...	2019-08-16	Code
13	Faster RCNN	0.91	No	PubLayNet: largest dataset ever for document lay...	2019-08-16	Code
14	GLAM	0.878	No	A Graphical Approach to Document Layout Analysis	2023-08-03	Code

#1VSRSOTA
0.967
Text· 2021-05-13
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations Code
#2TRDLU
0.958
Text
No paper
#3VGT
0.95
Text· 2023-08-29
Vision Grid Transformer for Document Layout Analysis Code
#4DETR
0.947
Text· 2023-06-23
Bridging the Performance Gap between DETR and R-CNN for Graphical Object Detection in Document Images
#5LayoutLMv3-B
0.945
Text· 2022-04-18
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking Code
#6DoPTA
0.944
Text· 2024-12-17
DoPTA: Improving Document Layout Analysis using Patch-Text Alignment
#7DiT-L
0.944
Text· 2022-03-04
DiT: Self-supervised Pre-training for Document Image Transformer Code
#8UDoc
0.939
Text· 2022-04-22
Unified Pretraining Framework for Document Understanding
#9DeiT-BSOTA
0.934
Text· 2020-12-23
Training data-efficient image transformers & distillation through attention Code
#10BEiT-B
0.934
Text· 2021-06-15
BEiT: BERT Pre-Training of Image Transformers Code
#11ResNext-101-32×8d
0.93
Text· 2023-08-29
Vision Grid Transformer for Document Layout Analysis Code
#12Mask RCNNSOTA
0.916
Text· 2019-08-16
PubLayNet: largest dataset ever for document layout analysis Code
#13Faster RCNN
0.91
Text· 2019-08-16
PubLayNet: largest dataset ever for document layout analysis Code
#14GLAM
0.878
Text· 2023-08-03
A Graphical Approach to Document Layout Analysis Code