TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Open Vocabulary Semantic Segmentation/PASCAL Context-59

Open Vocabulary Semantic Segmentation on PASCAL Context-59

Metric: mIoU (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕mIoU▼Extra DataPaperDate↕Code
1HyperSeg64.6YesHyperSeg: Towards Universal Visual Segmentation ...2024-11-26Code
2SILC63.5NoSILC: Improving Vision Language Pretraining with...2023-10-20-
3CAT-Seg63.3NoCAT-Seg: Cost Aggregation for Open-Vocabulary Se...2023-03-21Code
4MaskCLIP++62.5NoMaskCLIP++: A Mask-Based CLIP Fine-tuning Framew...2024-12-16Code
5CLIPSelf62.3NoCLIPSelf: Vision Transformer Distills Itself for...2023-10-02Code
6UMG-CLIP-L/1461NoUMG-CLIP: A Unified Multi-Granularity Vision Gen...2024-01-12Code
7SED60.6NoSED: A Simple Encoder-Decoder for Open-Vocabular...2023-11-27Code
8Mask-Adapter60.4NoMask-Adapter: The Devil is in the Masks for Open...2024-12-05Code
9EBSeg-L60.2NoOpen-Vocabulary Semantic Segmentation with Image...2024-06-14Code
10MAFT+59.4NoCollaborative Vision-Text Representation Optimiz...2024-08-01Code
11SCAN59.3NoOpen-Vocabulary Segmentation with Semantic-Assis...2023-12-07Code
12MAFT-ViTL58.5NoLearning Mask-aware CLIP Representations for Zer...2023-09-30Code
13FC-CLIP58.4NoConvolutions Die Hard: Open-Vocabulary Segmentat...2023-08-04Code
14ODISE57.3NoOpen-Vocabulary Panoptic Segmentation with Text-...2023-03-08Code
15OVSeg Swin-B55.7NoOpen-Vocabulary Semantic Segmentation with Mask-...2022-10-09Code
16PACL50.1NoOpen Vocabulary Semantic Segmentation with Patch...2022-12-09Code
17SimSeg47.7NoA Simple Baseline for Open-Vocabulary Semantic S...2021-12-29Code
18MaskCLIP45.9NoOpen-Vocabulary Universal Image Segmentation wit...2022-08-18Code
19TaAlign(trained with image-text pairs)37.6NoTagAlign: Improving Vision-Language Alignment wi...2023-12-21Code
20TTD (TCL)37.4NoTTD: Text-Tag Self-Distillation Enhancing Image-...2024-03-30Code
21LaVG34.7NoIn Defense of Lazy Visual Grounding for Open-Voc...2024-08-09Code
22TCL33.9NoLearning to Generate Text-grounded Mask for Open...2022-12-01Code
23TTD (MaskCLIP)31NoTTD: Text-Tag Self-Distillation Enhancing Image-...2024-03-30Code
24CLIP Surgery (original CLIP without any fine-tuning)29.3NoA Closer Look at the Explainability of Contrasti...2023-04-12Code