Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Document Layout Analysis
/
PubLayNet val
Document Layout Analysis on PubLayNet val
Metric: Table (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
Table (best first)
Table (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Table
▼
Extra Data
Paper
Date
↕
Code
1
VGT
0.981
No
Vision Grid Transformer for Document Layout Anal...
2023-08-29
Code
2
DETR
0.981
No
Bridging the Performance Gap between DETR and R-...
2023-06-23
-
3
LayoutLMv3-B
0.979
No
LayoutLMv3: Pre-training for Document AI with Un...
2022-04-18
Code
4
DiT-L
0.978
No
DiT: Self-supervised Pre-training for Document I...
2022-03-04
Code
5
CDeC-Net
0.978
No
CDeC-Net: Composite Deformable Cascade Network f...
2020-08-25
Code
6
DoPTA
0.977
No
DoPTA: Improving Document Layout Analysis using ...
2024-12-17
-
7
TRDLU
0.976
No
-
-
-
8
ResNext-101-32×8d
0.976
No
Vision Grid Transformer for Document Layout Anal...
2023-08-29
Code
9
VSR
0.974
No
VSR: A Unified Framework for Document Layout Ana...
2021-05-13
Code
10
UDoc
0.973
No
Unified Pretraining Framework for Document Under...
2022-04-22
-
11
BEiT-B
0.973
No
BEiT: BERT Pre-Training of Image Transformers
2021-06-15
Code
12
DeiT-B
0.972
No
Training data-efficient image transformers & dis...
2020-12-23
Code
13
Mask RCNN
0.96
No
PubLayNet: largest dataset ever for document lay...
2019-08-16
Code
14
Faster RCNN
0.954
No
PubLayNet: largest dataset ever for document lay...
2019-08-16
Code
15
GLAM
0.868
No
A Graphical Approach to Document Layout Analysis
2023-08-03
Code
#1
VGT
0.981
Table
· 2023-08-29
Vision Grid Transformer for Document Layout Analysis
Code
#2
DETR
SOTA
0.981
Table
· 2023-06-23
Bridging the Performance Gap between DETR and R-CNN for Graphical Object Detection in Document Images
#3
LayoutLMv3-B
SOTA
0.979
Table
· 2022-04-18
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking
Code
#4
DiT-L
0.978
Table
· 2022-03-04
DiT: Self-supervised Pre-training for Document Image Transformer
Code
#5
CDeC-Net
SOTA
0.978
Table
· 2020-08-25
CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images
Code
#6
DoPTA
0.977
Table
· 2024-12-17
DoPTA: Improving Document Layout Analysis using Patch-Text Alignment
#7
TRDLU
0.976
Table
No paper
#8
ResNext-101-32×8d
0.976
Table
· 2023-08-29
Vision Grid Transformer for Document Layout Analysis
Code
#9
VSR
0.974
Table
· 2021-05-13
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations
Code
#10
UDoc
0.973
Table
· 2022-04-22
Unified Pretraining Framework for Document Understanding
#11
BEiT-B
0.973
Table
· 2021-06-15
BEiT: BERT Pre-Training of Image Transformers
Code
#12
DeiT-B
0.972
Table
· 2020-12-23
Training data-efficient image transformers & distillation through attention
Code
#13
Mask RCNN
SOTA
0.96
Table
· 2019-08-16
PubLayNet: largest dataset ever for document layout analysis
Code
#14
Faster RCNN
0.954
Table
· 2019-08-16
PubLayNet: largest dataset ever for document layout analysis
Code
#15
GLAM
0.868
Table
· 2023-08-03
A Graphical Approach to Document Layout Analysis
Code