Open Vocabulary Panoptic Segmentation on ADE20K

Metric: PQ (higher is better)

LeaderboardDataset

Loading chart...

Results

Submit a result

Sort:

#	Model↕	PQ▼	Extra Data	Paper	Date↕	Code
1	UMG-CLIP-E/14	31.6	No	UMG-CLIP: A Unified Multi-Granularity Vision Gen...	2024-01-12	Code
2	PosSAM	29.2	No	PosSAM: Panoptic Open-vocabulary Segment Anything	2024-03-14	Code
3	UMG-CLIP-L/14	29.1	No	UMG-CLIP: A Unified Multi-Granularity Vision Gen...	2024-01-12	Code
4	MAFT+	27.1	No	Collaborative Vision-Text Representation Optimiz...	2024-08-01	Code
5	FC-CLIP	26.8	No	Convolutions Die Hard: Open-Vocabulary Segmentat...	2023-08-04	Code
6	CLIPSelf	23.7	No	CLIPSelf: Vision Transformer Distills Itself for...	2023-10-02	Code
7	ODISE(Caption)	23.4	No	Open-Vocabulary Panoptic Segmentation with Text-...	2023-03-08	Code
8	ODISE (Label)	22.6	No	Open-Vocabulary Panoptic Segmentation with Text-...	2023-03-08	Code
9	FreeSeg	16.3	No	FreeSeg: Unified, Universal and Open-Vocabulary ...	2023-03-30	-
10	MaskCLIP	15.1	No	Extract Free Dense Labels from CLIP	2021-12-02	Code

#1UMG-CLIP-E/14SOTA
31.6
PQ· 2024-01-12
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding Code
#2PosSAM
29.2
PQ· 2024-03-14
PosSAM: Panoptic Open-vocabulary Segment Anything Code
#3UMG-CLIP-L/14
29.1
PQ· 2024-01-12
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding Code
#4MAFT+
27.1
PQ· 2024-08-01
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation Code
#5FC-CLIPSOTA
26.8
PQ· 2023-08-04
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP Code
#6CLIPSelf
23.7
PQ· 2023-10-02
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction Code
#7ODISE(Caption)SOTA
23.4
PQ· 2023-03-08
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Code
#8ODISE (Label)
22.6
PQ· 2023-03-08
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Code
#9FreeSeg
16.3
PQ· 2023-03-30
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
#10MaskCLIPSOTA
15.1
PQ· 2021-12-02
Extract Free Dense Labels from CLIP Code