Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Scene Parsing
/
IIIT5k
Scene Parsing on IIIT5k
Metric: Accuracy (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
Accuracy
▼
Extra Data
Paper
Date
↕
Code
1
CLIP4STR-L (DataComp-1B)
99.6
Yes
CLIP4STR: A Simple Baseline for Scene Text Recog...
2023-05-23
Code
2
DTrOCR 105M
99.6
No
DTrOCR: Decoder-only Transformer for Optical Cha...
2023-08-30
Code
3
CLIP4STR-L
99.5
Yes
CLIP4STR: A Simple Baseline for Scene Text Recog...
2023-05-23
Code
4
CLIP4STR-B (DataComp-1B)
99.5
Yes
CLIP4STR: A Simple Baseline for Scene Text Recog...
2023-05-23
Code
5
CPPD
99.3
Yes
Context Perception Parallel Decoder for Scene Te...
2023-07-23
Code
6
CLIP4STR-B
99.2
Yes
CLIP4STR: A Simple Baseline for Scene Text Recog...
2023-05-23
Code
7
MGP-STR
98.8
Yes
Multi-Granularity Prediction for Scene Text Reco...
2022-09-08
Code
8
CCD-ViT-Small(ARD_2.8M)
98
Yes
Self-supervised Character-to-Character Distillat...
2022-11-01
Code
9
CCD-ViT-Base(ARD_2.8M)
98
Yes
Self-supervised Character-to-Character Distillat...
2022-11-01
Code
10
S-GTR
97.5
Yes
Visual Semantics Allow for Textual Reasoning Bet...
2021-12-24
Code
11
DiffusionSTR
97.3
No
DiffusionSTR: Diffusion Model for Scene Text Rec...
2023-06-29
-
12
CCD-ViT-Tiny(ARD_2.8M)
97.1
Yes
Self-supervised Character-to-Character Distillat...
2022-11-01
Code
13
SIGA_S
96.9
No
Self-supervised Implicit Glyph Attention for Tex...
2022-03-07
Code
14
MATRN
96.6
No
Multi-modal Text Recognition Networks: Interacti...
2021-11-30
Code
15
CDistNet (Ours)
96.57
No
CDistNet: Perceiving Multi-Domain Character Dist...
2021-11-22
Code
16
DPAN
96.2
No
-
-
Code