TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Mask2Former (ResNet-50)

Mask2Former (ResNet-50)

Reported on 10 benchmarks across 6 tasks · 3 papers

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision7 results

  • Panoptic SegmentationonLaRS
    PQ· 2023-08-18
    37.6
    best: 41.7 (Mask2Former (Swin-B))
    LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and BenchmarkarXiv:2308.09618
  • Video Instance SegmentationonYouTube-VIS validation
    AP50· 2021-12-20
    68
    best: 89.3 (CAVIS(ViT-L, Online))
    Mask2Former for Video Instance SegmentationarXiv:2112.10764
  • Video Instance SegmentationonYouTube-VIS validation
    AP75· 2021-12-20
    50
    best: 76.2 (CAVIS(ViT-L, Online))
    Mask2Former for Video Instance SegmentationarXiv:2112.10764
  • Video Instance SegmentationonYouTube-VIS validation
    mask AP· 2021-12-20
    46.4
    best: 68.9 (CAVIS(ViT-L, Online))
    Mask2Former for Video Instance SegmentationarXiv:2112.10764
  • Instance SegmentationonCityscapes val
    mask AP· 2021-12-02
    37.4
    best: 49 (ViT-P (OneFormer, ConvNeXt-L, single-scale, 512x1024, Mapillary Vistas-pretrained))
    Masked-attention Mask Transformer for Universal Image SegmentationarXiv:2112.01527
  • Instance SegmentationonADE20K val
    APL· 2021-12-02
    43.1
    best: 64.3 (OneFormer (InternImage-H, emb_dim=1024, single-scale, 896x896, COCO-Pretrained))
    Masked-attention Mask Transformer for Universal Image SegmentationarXiv:2112.01527
  • Instance SegmentationonADE20K val
    APM· 2021-12-02
    28.9
    best: 49.9 (OneFormer (InternImage-H, emb_dim=1024, single-scale, 896x896, COCO-Pretrained))
    Masked-attention Mask Transformer for Universal Image SegmentationarXiv:2112.01527

Audio2 results

  • 10-shot image generationonLaRS
    PQ· 2023-08-18
    37.6
    best: 41.7 (Mask2Former (Swin-B))
    LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and BenchmarkarXiv:2308.09618
  • 2D Semantic SegmentationonWildScenes
    mIoU· uses extra data· 2021-12-02
    43.71
    best: 47.85 (Mask2Former (Swin-L))
    Masked-attention Mask Transformer for Universal Image SegmentationarXiv:2112.01527

Medical1 result

  • Semantic SegmentationonLaRS
    PQ· 2023-08-18
    37.6
    best: 41.7 (Mask2Former (Swin-B))
    LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and BenchmarkarXiv:2308.09618