TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/UMG-CLIP-E/14

UMG-CLIP-E/14

Reported on 14 benchmarks across 5 tasks · 1 paper · 7 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision8 results

  • Open Vocabulary Panoptic SegmentationonADE20K
    PQ· 2024-01-12
    31.6
    SOTA
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • Open Vocabulary Semantic SegmentationonADE20K-847
    mIoU· 2024-01-12
    17.3
    SOTA
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • Open Vocabulary Semantic SegmentationonPascalVOC-20b
    mIoU· 2024-01-12
    85.4
    SOTA
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • Open Vocabulary Semantic SegmentationonADE20K-150
    mIoU· 2024-01-12
    38.2
    SOTA
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • Panoptic SegmentationonCOCO minival
    mIoU· uses extra data· 2024-01-12
    69.7
    SOTA
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • Open Vocabulary Semantic SegmentationonPASCAL Context-459
    mIoU· 2024-01-12
    25.2
    best: 25.8 (SILC)
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • Panoptic SegmentationonCOCO minival
    AP· uses extra data· 2024-01-12
    50.7
    best: 53.2 (OpenSeeD (SwinL, single-scale))
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • Panoptic SegmentationonCOCO minival
    PQ· uses extra data· 2024-01-12
    59.5
    best: 61.2 (HyperSeg (Swin-B))
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397

Medical3 results

  • Semantic SegmentationonCOCO minival
    mIoU· uses extra data· 2024-01-12
    69.7
    SOTA
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • Semantic SegmentationonCOCO minival
    AP· uses extra data· 2024-01-12
    50.7
    best: 53.2 (OpenSeeD (SwinL, single-scale))
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • Semantic SegmentationonCOCO minival
    PQ· uses extra data· 2024-01-12
    59.5
    best: 61.2 (HyperSeg (Swin-B))
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397

Audio3 results

  • 10-shot image generationonCOCO minival
    mIoU· uses extra data· 2024-01-12
    69.7
    SOTA
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • 10-shot image generationonCOCO minival
    AP· uses extra data· 2024-01-12
    50.7
    best: 53.2 (OpenSeeD (SwinL, single-scale))
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397
  • 10-shot image generationonCOCO minival
    PQ· uses extra data· 2024-01-12
    59.5
    best: 61.2 (HyperSeg (Swin-B))
    UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World UnderstandingarXiv:2401.06397