TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Instance Segmentation/A2D Sentences

Instance Segmentation on A2D Sentences

Metric: IoU mean (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕IoU mean▼Extra DataPaperDate↕Code
1SOC (Video-Swin-B)0.725YesSOC: Semantic-Assisted Object Cluster for Referr...2023-05-26Code
2SgMg (Video-Swin-B)0.72YesSpectrum-guided Multi-granularity Referring Vide...2023-07-25Code
3ReferFormer (Video-Swin-B)0.703YesLanguage as Queries for Referring Video Object S...2022-01-03Code
4SOC (Video-Swin-T)0.669NoSOC: Semantic-Assisted Object Cluster for Referr...2023-05-26Code
5ClawCraneNet0.655NoClawCraneNet: Leveraging Object-level Relation f...2021-03-19-
6MTTR (w=10)0.64NoEnd-to-End Referring Video Object Segmentation w...2021-11-29Code
7MANET0.632NoMulti-Attention Network for Compressed Video Ref...2022-07-26Code
8MTTR (w=8)0.618NoEnd-to-End Referring Video Object Segmentation w...2021-11-29Code
9RefVOS0.599NoRefVOS: A Closer Look at Referring Expressions f...2020-10-01Code
10VLIDE0.598NoDeeply Interleaved Two-Stream Encoder for Referr...2022-03-30-
11Locater0.597NoLocal-Global Context Aware Transformer for Langu...2022-03-18Code
12CMPC-V (I3D)0.573NoCross-Modal Progressive Comprehension for Referr...2021-05-15Code
13Hui et al.0.561NoCollaborative Spatial-Temporal Modeling for Lang...2021-05-14-
14mmmmtbvs0.558NoModeling Motion with Multi-Modal Features for Te...2022-04-06Code
15AAMN0.552NoActor and Action Modular Network for Text-based ...2020-11-02-
16CMDy0.531No---
17PRPE0.529No---
18HINet0.529No---
19CMPC-V (R2D)0.515NoCross-Modal Progressive Comprehension for Referr...2021-05-15Code
20RefVOS0.497No---
21ACGA0.49No--Code
22VT-Capsule0.46No---
23CMSA+CFSA0.432NoReferring Segmentation in Images and Videos with...2021-02-09-
24Gavriluyk el al. (Optical flow)0.426NoActor and Action Video Segmentation from a Sente...2018-03-20Code
25Gavriluyk el al.0.421NoActor and Action Video Segmentation from a Sente...2018-03-20Code
26Li et al.0.354No---
27Hu et al.0.35NoSegmentation from Natural Language Expressions2016-03-20Code