TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/LWL

LWL

Reported on 28 benchmarks across 3 tasks · 2 papers · 3 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision28 results

  • VideoonDAVIS (no YouTube-VOS training)
    D17 val (J)· 2020-03-25
    72.2
    best: 77.7 (HMMN)
    SOTA
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • Video Object SegmentationonDAVIS (no YouTube-VOS training)
    D17 val (J)· 2020-03-25
    72.2
    best: 77.7 (HMMN)
    SOTA
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • Semi-Supervised Video Object SegmentationonDAVIS (no YouTube-VOS training)
    D17 val (J)· 2020-03-25
    72.2
    best: 77.7 (HMMN)
    SOTA
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • VideoonYouTube-VOS 2018
    F-Measure (Seen)· 2022-08-01
    84.9
    best: 91 (Cutie+ (base, MEGA))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • VideoonYouTube-VOS 2018
    F-Measure (Unseen)· 2022-08-01
    84.4
    best: 90.2 (XMem (BL30K, MS))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • VideoonYouTube-VOS 2018
    Jaccard (Seen)· 2022-08-01
    80.4
    best: 86.6 (Cutie+ (base, MEGA))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • VideoonYouTube-VOS 2018
    Jaccard (Unseen)· 2022-08-01
    76.4
    best: 82.2 (Cutie+ (base, MEGA))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • VideoonYouTube-VOS 2018
    Mean Jaccard & F-Measure· 2022-08-01
    81.5
    best: 86.9 (XMem (BL30K, MS))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • VideoonDAVIS 2017 (val)
    F-measure· 2022-08-01
    84.1
    best: 92.6 (XMem (BLK30K, MS))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • VideoonDAVIS 2017 (val)
    Jaccard· 2022-08-01
    79.1
    best: 86.3 (XMem (BLK30K, MS))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • VideoonDAVIS 2017 (val)
    Mean Jaccard & F-Measure· 2022-08-01
    81.6
    best: 89.5 (XMem (BLK30K, MS))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • Video Object SegmentationonYouTube-VOS 2018
    F-Measure (Seen)· 2022-08-01
    84.9
    best: 91 (Cutie+ (base, MEGA))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • Video Object SegmentationonYouTube-VOS 2018
    F-Measure (Unseen)· 2022-08-01
    84.4
    best: 90.2 (XMem (BL30K, MS))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • Video Object SegmentationonYouTube-VOS 2018
    Jaccard (Seen)· 2022-08-01
    80.4
    best: 86.6 (Cutie+ (base, MEGA))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • Video Object SegmentationonYouTube-VOS 2018
    Jaccard (Unseen)· 2022-08-01
    76.4
    best: 82.2 (Cutie+ (base, MEGA))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • Video Object SegmentationonYouTube-VOS 2018
    Mean Jaccard & F-Measure· 2022-08-01
    81.5
    best: 86.9 (XMem (BL30K, MS))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • Video Object SegmentationonDAVIS 2017 (val)
    F-measure· 2022-08-01
    84.1
    best: 92.6 (XMem (BLK30K, MS))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • Video Object SegmentationonDAVIS 2017 (val)
    Jaccard· 2022-08-01
    79.1
    best: 86.3 (XMem (BLK30K, MS))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • Video Object SegmentationonDAVIS 2017 (val)
    Mean Jaccard & F-Measure· 2022-08-01
    81.6
    best: 89.5 (XMem (BLK30K, MS))
    BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object SegmentationarXiv:2208.01159
  • VideoonDAVIS (no YouTube-VOS training)
    D17 val (F)· 2020-03-25
    76.3
    best: 83.1 (HMMN)
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • VideoonDAVIS (no YouTube-VOS training)
    D17 val (G)· 2020-03-25
    74.3
    best: 80.4 (HMMN)
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • VideoonDAVIS (no YouTube-VOS training)
    FPS· 2020-03-25
    14
    best: 50.1 (TBD)
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • Video Object SegmentationonDAVIS (no YouTube-VOS training)
    D17 val (F)· 2020-03-25
    76.3
    best: 83.1 (HMMN)
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • Video Object SegmentationonDAVIS (no YouTube-VOS training)
    D17 val (G)· 2020-03-25
    74.3
    best: 80.4 (HMMN)
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • Video Object SegmentationonDAVIS (no YouTube-VOS training)
    FPS· 2020-03-25
    14
    best: 50.1 (TBD)
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • Semi-Supervised Video Object SegmentationonDAVIS (no YouTube-VOS training)
    D17 val (F)· 2020-03-25
    76.3
    best: 83.1 (HMMN)
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • Semi-Supervised Video Object SegmentationonDAVIS (no YouTube-VOS training)
    D17 val (G)· 2020-03-25
    74.3
    best: 80.4 (HMMN)
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540
  • Semi-Supervised Video Object SegmentationonDAVIS (no YouTube-VOS training)
    FPS· 2020-03-25
    14
    best: 50.1 (TBD)
    Learning What to Learn for Video Object SegmentationarXiv:2003.11540