TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/2D Semantic Segmentation/IIIT5k

2D Semantic Segmentation on IIIT5k

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1CLIP4STR-L (DataComp-1B)99.6YesCLIP4STR: A Simple Baseline for Scene Text Recog...2023-05-23Code
2DTrOCR 105M99.6NoDTrOCR: Decoder-only Transformer for Optical Cha...2023-08-30Code
3CLIP4STR-L99.5YesCLIP4STR: A Simple Baseline for Scene Text Recog...2023-05-23Code
4CLIP4STR-B (DataComp-1B)99.5YesCLIP4STR: A Simple Baseline for Scene Text Recog...2023-05-23Code
5CPPD99.3YesContext Perception Parallel Decoder for Scene Te...2023-07-23Code
6CLIP4STR-B99.2YesCLIP4STR: A Simple Baseline for Scene Text Recog...2023-05-23Code
7MGP-STR98.8YesMulti-Granularity Prediction for Scene Text Reco...2022-09-08Code
8CCD-ViT-Small(ARD_2.8M)98YesSelf-supervised Character-to-Character Distillat...2022-11-01Code
9CCD-ViT-Base(ARD_2.8M)98YesSelf-supervised Character-to-Character Distillat...2022-11-01Code
10S-GTR97.5YesVisual Semantics Allow for Textual Reasoning Bet...2021-12-24Code
11DiffusionSTR97.3NoDiffusionSTR: Diffusion Model for Scene Text Rec...2023-06-29-
12CCD-ViT-Tiny(ARD_2.8M)97.1YesSelf-supervised Character-to-Character Distillat...2022-11-01Code
13SIGA_S96.9NoSelf-supervised Implicit Glyph Attention for Tex...2022-03-07Code
14MATRN96.6NoMulti-modal Text Recognition Networks: Interacti...2021-11-30Code
15CDistNet (Ours)96.57NoCDistNet: Perceiving Multi-Domain Character Dist...2021-11-22Code
16DPAN96.2No--Code