TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video Instance Segmentation/YouTube-VIS validation

Video Instance Segmentation on YouTube-VIS validation

Metric: AP75 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕AP75▼Extra DataPaperDate↕Code
1CAVIS(ViT-L, Online)76.2YesContext-Aware Video Instance Segmentation2024-07-03Code
2DVIS++(ViT-L, Online)75.3YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
3DVIS72.7YesDVIS: Decoupled Video Instance Segmentation Fram...2023-06-06Code
4Tube-Link71.3NoTube-Link: A Flexible Cross Tube Framework for U...2023-03-22Code
5MinVIS (Swin-L)68.6NoMinVIS: A Minimal Video Instance Segmentation Fr...2022-08-03Code
6MDQE(Swin-L)67.3NoMDQE: Mining Discriminative Query Embeddings to ...2023-03-25Code
7Mask2Former (Swin-L)67NoMask2Former for Video Instance Segmentation2021-12-20Code
8SeqFormer (Swin-L)66.4YesSeqFormer: Sequential Transformer for Video Inst...2021-12-15Code
9DeVIS (Swin-L)66.3NoDeVIS: Making Deformable Transformers Work for V...2022-07-22Code
10TCIS (Swin-S)65.6No1st Place Solution for YouTubeVOS Challenge 2021...2021-06-12-
11UniVS(Swin-L)65.3YesUniVS: Unified and Universal Video Segmentation ...2024-02-28Code
12InstanceFormer(Swin-L)64.2YesInstanceFormer: An Online Video Instance Segment...2022-08-22Code
13Video K-Net (Swin-Base)59.6NoVideo K-Net: A Simple, Strong, and Unified Basel...2022-04-10Code
14NOVIS (ResNet-50)56.9YesNOVIS: A Case for End-to-End Near-Online Video I...2023-08-29-
15SeqFormer (ResNet-101)55.7YesSeqFormer: Sequential Transformer for Video Inst...2021-12-15Code
16MSN54.9NoMSN: Efficient Online Mask Selection Network for...2021-06-19Code
17Mask2Former (ResNet-101)54.2NoMask2Former for Video Instance Segmentation2021-12-20Code
18IDOL (ResNet-50)52.9NoIn Defense of Online Models for Video Instance S...2022-07-21Code
19SeqFormer (ResNet-50)51.8YesSeqFormer: Sequential Transformer for Video Inst...2021-12-15Code
20SeqFormer (ResNet-50)50.5NoSeqFormer: Sequential Transformer for Video Inst...2021-12-15Code
21Mask2Former (ResNet-50)50NoMask2Former for Video Instance Segmentation2021-12-20Code
22InstanceFormer(ResNet-50)49.6YesInstanceFormer: An Online Video Instance Segment...2022-08-22Code
23DeVIS (ResNet-50)48.6NoDeVIS: Making Deformable Transformers Work for V...2022-07-22Code
24IFC (ResNet-50)46.8NoVideo Instance Segmentation using Inter-Frame Co...2021-06-07Code
25VisTR(ResNet-101)45NoEnd-to-End Video Instance Segmentation with Tran...2020-11-30Code
26CrossVIS (ResNet-101)39.7NoCrossover Learning for Fast Online Video Instanc...2021-04-13Code
27PCAN(ResNet-50)39.4NoPrototypical Cross-Attention Networks for Multip...2021-06-22Code
28ObjProp (ResNet-50)39.2NoObject Propagation via Inter-Frame Attentions fo...2021-11-15Code
29STC (ResNet-50)38.6NoSTC: Spatio-Temporal Contrastive Learning for Vi...2022-02-08-
30CompFeat(ResNet-50)38.6NoCompFeat: Comprehensive Feature Aggregation for ...2020-12-07Code
31CSipMask38.1NoOccluded Video Instance Segmentation: A Benchmark2021-02-02Code
32STMask(R101-DCN-FPN)38NoSpatial Feature Calibration and Temporal Fusion ...2021-04-06Code
33STEm-Seg (ResNet-101)37.9NoSTEm-Seg: Spatio-temporal Embeddings for Instanc...2020-03-18Code
34STEm-Seg (ResNet-50)37.9NoSTEm-Seg: Spatio-temporal Embeddings for Instanc...2020-03-18Code
35VisTR(ResNet-50)36.9NoEnd-to-End Video Instance Segmentation with Tran...2020-11-30Code
36SipMask (ResNet-50, ms-train, single-scale test)35.8NoSipMask: Spatial Information Preservation for Fa...2020-07-29Code
37CMaskTrack R-CNN34.9NoOccluded Video Instance Segmentation: A Benchmark2021-02-02Code
38SipMask (ResNet-50, single-scale test)33.3NoSipMask: Spatial Information Preservation for Fa...2020-07-29Code
39OSMN33.1NoEfficient Video Object Segmentation via Network ...2018-02-04Code
40TraDeS32.8NoTrack to Detect and Segment: An Online Multi-Obj...2021-03-16Code
41MaskTrack R-CNN (ResNet-50, single-scale training and test)32.6NoVideo Instance Segmentation2019-05-12Code