TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video Instance Segmentation/YouTube-VIS validation

Video Instance Segmentation on YouTube-VIS validation

Metric: AP50 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕AP50▼Extra DataPaperDate↕Code
1CAVIS(ViT-L, Online)89.3YesContext-Aware Video Instance Segmentation2024-07-03Code
2DVIS++(ViT-L, Online)88.8YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
3DVIS88YesDVIS: Decoupled Video Instance Segmentation Fram...2023-06-06Code
4Tube-Link86.6NoTube-Link: A Flexible Cross Tube Framework for U...2023-03-22Code
5MDQE(Swin-L)84.9NoMDQE: Mining Discriminative Query Embeddings to ...2023-03-25Code
6Mask2Former (Swin-L)84.4NoMask2Former for Video Instance Segmentation2021-12-20Code
7MinVIS (Swin-L)83.3NoMinVIS: A Minimal Video Instance Segmentation Fr...2022-08-03Code
8UniVS(Swin-L)82.1YesUniVS: Unified and Universal Video Segmentation ...2024-02-28Code
9SeqFormer (Swin-L)82.1YesSeqFormer: Sequential Transformer for Video Inst...2021-12-15Code
10DeVIS (Swin-L)80.8NoDeVIS: Making Deformable Transformers Work for V...2022-07-22Code
11Video K-Net (Swin-Base)79NoVideo K-Net: A Simple, Strong, and Unified Basel...2022-04-10Code
12InstanceFormer(Swin-L)78YesInstanceFormer: An Online Video Instance Segment...2022-08-22Code
13TCIS (Swin-S)76.6No1st Place Solution for YouTubeVOS Challenge 2021...2021-06-12-
14NOVIS (ResNet-50)75.7YesNOVIS: A Case for End-to-End Near-Online Video I...2023-08-29-
15IDOL (ResNet-50)74NoIn Defense of Online Models for Video Instance S...2022-07-21Code
16Mask2Former (ResNet-101)72.8NoMask2Former for Video Instance Segmentation2021-12-20Code
17SeqFormer (ResNet-101)71.1YesSeqFormer: Sequential Transformer for Video Inst...2021-12-15Code
18SeqFormer (ResNet-50)69.8YesSeqFormer: Sequential Transformer for Video Inst...2021-12-15Code
19MSN69.4NoMSN: Efficient Online Mask Selection Network for...2021-06-19Code
20InstanceFormer(ResNet-50)68.6YesInstanceFormer: An Online Video Instance Segment...2022-08-22Code
21Mask2Former (ResNet-50)68NoMask2Former for Video Instance Segmentation2021-12-20Code
22SeqFormer (ResNet-50)66.9NoSeqFormer: Sequential Transformer for Video Inst...2021-12-15Code
23DeVIS (ResNet-50)66.7NoDeVIS: Making Deformable Transformers Work for V...2022-07-22Code
24IFC (ResNet-50)65.8NoVideo Instance Segmentation using Inter-Frame Co...2021-06-07Code
25VisTR(ResNet-101)64NoEnd-to-End Video Instance Segmentation with Tran...2020-11-30Code
26VisTR(ResNet-50)59.8NoEnd-to-End Video Instance Segmentation with Tran...2020-11-30Code
27ObjProp (ResNet-50)59.4NoObject Propagation via Inter-Frame Attentions fo...2021-11-15Code
28CrossVIS (ResNet-101)57.3NoCrossover Learning for Fast Online Video Instanc...2021-04-13Code
29STC (ResNet-50)57.2NoSTC: Spatio-Temporal Contrastive Learning for Vi...2022-02-08-
30STMask(R101-DCN-FPN)56.8NoSpatial Feature Calibration and Temporal Fusion ...2021-04-06Code
31CompFeat(ResNet-50)56NoCompFeat: Comprehensive Feature Aggregation for ...2020-12-07Code
32STEm-Seg (ResNet-101)55.8NoSTEm-Seg: Spatio-temporal Embeddings for Instanc...2020-03-18Code
33CSipMask55.6NoOccluded Video Instance Segmentation: A Benchmark2021-02-02Code
34PCAN(ResNet-50)54.9NoPrototypical Cross-Attention Networks for Multip...2021-06-22Code
35SipMask (ResNet-50, ms-train, single-scale test)54.1NoSipMask: Spatial Information Preservation for Fa...2020-07-29Code
36SipMask (ResNet-50, single-scale test)53NoSipMask: Spatial Information Preservation for Fa...2020-07-29Code
37CMaskTrack R-CNN52.8NoOccluded Video Instance Segmentation: A Benchmark2021-02-02Code
38TraDeS52.6NoTrack to Detect and Segment: An Online Multi-Obj...2021-03-16Code
39MaskTrack R-CNN (ResNet-50, single-scale training and test)51.1NoVideo Instance Segmentation2019-05-12Code
40STEm-Seg (ResNet-50)50.7NoSTEm-Seg: Spatio-temporal Embeddings for Instanc...2020-03-18Code
41DeepSORT31.3NoSimple Online and Realtime Tracking with a Deep ...2017-03-21Code
42OSMN28.6NoEfficient Video Object Segmentation via Network ...2018-02-04Code