Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Open Vocabulary Semantic Segmentation
/
PASCAL Context-59
Open Vocabulary Semantic Segmentation on PASCAL Context-59
Metric: mIoU (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
mIoU
▼
Extra Data
Paper
Date
↕
Code
1
HyperSeg
64.6
Yes
HyperSeg: Towards Universal Visual Segmentation ...
2024-11-26
Code
2
SILC
63.5
No
SILC: Improving Vision Language Pretraining with...
2023-10-20
-
3
CAT-Seg
63.3
No
CAT-Seg: Cost Aggregation for Open-Vocabulary Se...
2023-03-21
Code
4
MaskCLIP++
62.5
No
MaskCLIP++: A Mask-Based CLIP Fine-tuning Framew...
2024-12-16
Code
5
CLIPSelf
62.3
No
CLIPSelf: Vision Transformer Distills Itself for...
2023-10-02
Code
6
UMG-CLIP-L/14
61
No
UMG-CLIP: A Unified Multi-Granularity Vision Gen...
2024-01-12
Code
7
SED
60.6
No
SED: A Simple Encoder-Decoder for Open-Vocabular...
2023-11-27
Code
8
Mask-Adapter
60.4
No
Mask-Adapter: The Devil is in the Masks for Open...
2024-12-05
Code
9
EBSeg-L
60.2
No
Open-Vocabulary Semantic Segmentation with Image...
2024-06-14
Code
10
MAFT+
59.4
No
Collaborative Vision-Text Representation Optimiz...
2024-08-01
Code
11
SCAN
59.3
No
Open-Vocabulary Segmentation with Semantic-Assis...
2023-12-07
Code
12
MAFT-ViTL
58.5
No
Learning Mask-aware CLIP Representations for Zer...
2023-09-30
Code
13
FC-CLIP
58.4
No
Convolutions Die Hard: Open-Vocabulary Segmentat...
2023-08-04
Code
14
ODISE
57.3
No
Open-Vocabulary Panoptic Segmentation with Text-...
2023-03-08
Code
15
OVSeg Swin-B
55.7
No
Open-Vocabulary Semantic Segmentation with Mask-...
2022-10-09
Code
16
PACL
50.1
No
Open Vocabulary Semantic Segmentation with Patch...
2022-12-09
Code
17
SimSeg
47.7
No
A Simple Baseline for Open-Vocabulary Semantic S...
2021-12-29
Code
18
MaskCLIP
45.9
No
Open-Vocabulary Universal Image Segmentation wit...
2022-08-18
Code
19
TaAlign(trained with image-text pairs)
37.6
No
TagAlign: Improving Vision-Language Alignment wi...
2023-12-21
Code
20
TTD (TCL)
37.4
No
TTD: Text-Tag Self-Distillation Enhancing Image-...
2024-03-30
Code
21
LaVG
34.7
No
In Defense of Lazy Visual Grounding for Open-Voc...
2024-08-09
Code
22
TCL
33.9
No
Learning to Generate Text-grounded Mask for Open...
2022-12-01
Code
23
TTD (MaskCLIP)
31
No
TTD: Text-Tag Self-Distillation Enhancing Image-...
2024-03-30
Code
24
CLIP Surgery (original CLIP without any fine-tuning)
29.3
No
A Closer Look at the Explainability of Contrasti...
2023-04-12
Code