Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Mask2Former (ResNet-50)

Mask2Former (ResNet-50)

Reported on 10 benchmarks across 6 tasks · 3 papers

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision7 results

Panoptic SegmentationonLaRS
PQ· 2023-08-18
37.6
best: 41.7 (Mask2Former (Swin-B))
LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and Benchmark arXiv:2308.09618
Video Instance SegmentationonYouTube-VIS validation
AP50· 2021-12-20
68
best: 89.3 (CAVIS(ViT-L, Online))
Mask2Former for Video Instance Segmentation arXiv:2112.10764
Video Instance SegmentationonYouTube-VIS validation
AP75· 2021-12-20
50
best: 76.2 (CAVIS(ViT-L, Online))
Mask2Former for Video Instance Segmentation arXiv:2112.10764
Video Instance SegmentationonYouTube-VIS validation
mask AP· 2021-12-20
46.4
best: 68.9 (CAVIS(ViT-L, Online))
Mask2Former for Video Instance Segmentation arXiv:2112.10764
Instance SegmentationonCityscapes val
mask AP· 2021-12-02
37.4
best: 49 (ViT-P (OneFormer, ConvNeXt-L, single-scale, 512x1024, Mapillary Vistas-pretrained))
Masked-attention Mask Transformer for Universal Image Segmentation arXiv:2112.01527
Instance SegmentationonADE20K val
APL· 2021-12-02
43.1
best: 64.3 (OneFormer (InternImage-H, emb_dim=1024, single-scale, 896x896, COCO-Pretrained))
Masked-attention Mask Transformer for Universal Image Segmentation arXiv:2112.01527
Instance SegmentationonADE20K val
APM· 2021-12-02
28.9
best: 49.9 (OneFormer (InternImage-H, emb_dim=1024, single-scale, 896x896, COCO-Pretrained))
Masked-attention Mask Transformer for Universal Image Segmentation arXiv:2112.01527

Audio2 results

10-shot image generationonLaRS
PQ· 2023-08-18
37.6
best: 41.7 (Mask2Former (Swin-B))
LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and Benchmark arXiv:2308.09618
2D Semantic SegmentationonWildScenes
mIoU· uses extra data· 2021-12-02
43.71
best: 47.85 (Mask2Former (Swin-L))
Masked-attention Mask Transformer for Universal Image Segmentation arXiv:2112.01527

Medical1 result

Semantic SegmentationonLaRS
PQ· 2023-08-18
37.6
best: 41.7 (Mask2Former (Swin-B))
LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and Benchmark arXiv:2308.09618