TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/2D Semantic Segmentation/SVTP

2D Semantic Segmentation on SVTP

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1DTrOCR 105M98.6NoDTrOCR: Decoder-only Transformer for Optical Cha...2023-08-30Code
2MGP-STR98.3YesMulti-Granularity Prediction for Scene Text Reco...2022-09-08Code
3CLIP4STR-L*98.13YesAn Empirical Study of Scaling Law for OCR2023-12-29Code
4CLIP4STR-L (DataComp-1B)98.1NoCLIP4STR: A Simple Baseline for Scene Text Recog...2023-05-23Code
5CLIP4STR-L97.4NoCLIP4STR: A Simple Baseline for Scene Text Recog...2023-05-23Code
6CLIP4STR-B97.2YesCLIP4STR: A Simple Baseline for Scene Text Recog...2023-05-23Code
7CPPD96.7YesContext Perception Parallel Decoder for Scene Te...2023-07-23Code
8CCD-ViT-Base96.1YesSelf-supervised Character-to-Character Distillat...2022-11-01Code
9CCD-ViT-Small92.7YesSelf-supervised Character-to-Character Distillat...2022-11-01Code
10CCD-ViT-Tiny91.6YesSelf-supervised Character-to-Character Distillat...2022-11-01Code
11S-GTR90.6YesVisual Semantics Allow for Textual Reasoning Bet...2021-12-24Code
12MATRN90.6NoMulti-modal Text Recognition Networks: Interacti...2021-11-30Code
13SIGA_T90.5NoSelf-supervised Implicit Glyph Attention for Tex...2022-03-07Code
14CDistNet (Ours)89.77NoCDistNet: Perceiving Multi-Domain Character Dist...2021-11-22Code
15DiffusionSTR89.2NoDiffusionSTR: Diffusion Model for Scene Text Rec...2023-06-29-
16DPAN89No--Code