Ye Yu, Jialin Yuan, Gaurav Mittal, Li Fuxin, Mei Chen
Video Object Segmentation (VOS) is fundamental to video understanding. Transformer-based methods show significant performance improvement on semi-supervised VOS. However, existing work faces challenges segmenting visually similar objects in close proximity of each other. In this paper, we propose a novel Bilateral Attention Transformer in Motion-Appearance Neighboring space (BATMAN) for semi-supervised VOS. It captures object motion in the video via a novel optical flow calibration module that fuses the segmentation mask with optical flow estimation to improve within-object optical flow smoothness and reduce noise at object boundaries. This calibrated optical flow is then employed in our novel bilateral attention, which computes the correspondence between the query and reference frames in the neighboring bilateral space considering both motion and appearance. Extensive experiments validate the effectiveness of BATMAN architecture by outperforming all existing state-of-the-art on all four popular VOS benchmarks: Youtube-VOS 2019 (85.0%), Youtube-VOS 2018 (85.3%), DAVIS 2017Val/Testdev (86.2%/82.2%), and DAVIS 2016 (92.5%).
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Video | YouTube-VOS 2019 | F-Measure (Seen) | 89.3 | BATMAN |
| Video | YouTube-VOS 2019 | F-Measure (Unseen) | 87.2 | BATMAN |
| Video | YouTube-VOS 2019 | Jaccard (Seen) | 84.5 | BATMAN |
| Video | YouTube-VOS 2019 | Jaccard (Unseen) | 79 | BATMAN |
| Video | YouTube-VOS 2019 | Mean Jaccard & F-Measure | 85 | BATMAN |
| Video | YouTube-VOS 2019 | Mean Jaccard & F-Measure | 84.1 | AOT |
| Video | YouTube-VOS 2019 | F-Measure (Seen) | 85.4 | STCN |
| Video | YouTube-VOS 2019 | F-Measure (Seen) | 85.1 | CFBI |
| Video | YouTube-VOS 2019 | F-Measure (Unseen) | 83 | CFBI |
| Video | YouTube-VOS 2019 | Jaccard (Seen) | 80.6 | CFBI |
| Video | YouTube-VOS 2019 | Jaccard (Unseen) | 75.2 | CFBI |
| Video | YouTube-VOS 2019 | Mean Jaccard & F-Measure | 81 | CFBI |
| Video | DAVIS 2016 | F-Score | 94.2 | BATMAN (val) |
| Video | DAVIS 2016 | J&F | 92.5 | BATMAN (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 90.7 | BATMAN (val) |
| Video | DAVIS 2016 | F-Score | 92.5 | STCN (val) |
| Video | DAVIS 2016 | J&F | 91.6 | STCN (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 90.8 | STCN (val) |
| Video | DAVIS 2016 | F-Score | 92.1 | AOT (val) |
| Video | DAVIS 2016 | J&F | 91.1 | AOT (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 90.1 | AOT (val) |
| Video | DAVIS 2016 | F-Score | 91.4 | LCM (val) |
| Video | DAVIS 2016 | J&F | 90.7 | LCM (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 89.9 | LCM (val) |
| Video | DAVIS 2016 | F-Score | 94 | RPCMVOS (val) |
| Video | DAVIS 2016 | J&F | 90.6 | RPCMVOS (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 87.1 | RPCMVOS (val) |
| Video | DAVIS 2016 | F-Score | 91.5 | KMN (val) |
| Video | DAVIS 2016 | J&F | 90.5 | KMN (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 89.5 | KMN (val) |
| Video | DAVIS 2016 | F-Score | 91.2 | TransVOS (val) |
| Video | DAVIS 2016 | J&F | 90.5 | TransVOS (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 89.8 | TransVOS (val) |
| Video | DAVIS 2016 | F-Score | 91.1 | CFBI+ (val) |
| Video | DAVIS 2016 | J&F | 89.9 | CFBI+ (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 88.7 | CFBI+ (val) |
| Video | DAVIS 2016 | F-Score | 90.5 | CFBI (val) |
| Video | DAVIS 2016 | J&F | 89.4 | CFBI (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 88.3 | CFBI (val) |
| Video | DAVIS 2016 | F-Score | 88.7 | RMN (val) |
| Video | DAVIS 2016 | J&F | 88.8 | RMN (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 88.9 | RMN (val) |
| Video | DAVIS 2016 | F-Score | 89.9 | STM (val) |
| Video | DAVIS 2016 | Jaccard (Mean) | 88.7 | STM (val) |
| Video | DAVIS 2017 (test-dev) | F-measure | 86.1 | BATMAN |
| Video | DAVIS 2017 (test-dev) | Jaccard | 78.4 | BATMAN |
| Video | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 82.2 | BATMAN |
| Video | DAVIS 2017 (test-dev) | F-measure | 81.8 | LCM |
| Video | DAVIS 2017 (test-dev) | Jaccard | 74.4 | LCM |
| Video | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 78.1 | LCM |
| Video | DAVIS 2017 (test-dev) | F-measure | 80.9 | TransVOS |
| Video | DAVIS 2017 (test-dev) | Jaccard | 73 | TransVOS |
| Video | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 76.9 | TransVOS |
| Video | DAVIS 2017 (test-dev) | F-measure | 79.6 | STCN |
| Video | DAVIS 2017 (test-dev) | Jaccard | 72.7 | STCN |
| Video | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 76.1 | STCN |
| Video | DAVIS 2017 (test-dev) | F-measure | 78.1 | RMN |
| Video | DAVIS 2017 (test-dev) | Jaccard | 71.9 | RMN |
| Video | DAVIS 2017 (test-dev) | Jaccard | 71.6 | CFBI+ |
| Video | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 75.6 | CFBI+ |
| Video | DAVIS 2017 (test-dev) | F-measure | 78.7 | CFBI |
| Video | DAVIS 2017 (test-dev) | Jaccard | 71.4 | CFBI |
| Video | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 75 | CFBI |
| Video | YouTube-VOS 2018 | F-Measure (Seen) | 88.5 | AOT |
| Video | YouTube-VOS 2018 | F-Measure (Unseen) | 86.1 | AOT |
| Video | YouTube-VOS 2018 | Jaccard (Seen) | 83.7 | AOT |
| Video | YouTube-VOS 2018 | Jaccard (Unseen) | 78.1 | AOT |
| Video | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 84.1 | AOT |
| Video | YouTube-VOS 2018 | F-Measure (Seen) | 86.5 | STCN |
| Video | YouTube-VOS 2018 | F-Measure (Unseen) | 85.7 | STCN |
| Video | YouTube-VOS 2018 | Jaccard (Seen) | 81.9 | STCN |
| Video | YouTube-VOS 2018 | Jaccard (Unseen) | 77.9 | STCN |
| Video | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 83 | STCN |
| Video | YouTube-VOS 2018 | Jaccard (Seen) | 82.2 | LCM |
| Video | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 82 | LCM |
| Video | YouTube-VOS 2018 | F-Measure (Seen) | 86.7 | TransVOS |
| Video | YouTube-VOS 2018 | F-Measure (Unseen) | 83.4 | TransVOS |
| Video | YouTube-VOS 2018 | Jaccard (Seen) | 82 | TransVOS |
| Video | YouTube-VOS 2018 | Jaccard (Unseen) | 75 | TransVOS |
| Video | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 81.8 | TransVOS |
| Video | YouTube-VOS 2018 | Jaccard (Seen) | 81.2 | SST |
| Video | YouTube-VOS 2018 | Jaccard (Unseen) | 76 | SST |
| Video | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 81.7 | SST |
| Video | YouTube-VOS 2018 | F-Measure (Seen) | 84.9 | LWL |
| Video | YouTube-VOS 2018 | F-Measure (Unseen) | 84.4 | LWL |
| Video | YouTube-VOS 2018 | Jaccard (Seen) | 80.4 | LWL |
| Video | YouTube-VOS 2018 | Jaccard (Unseen) | 76.4 | LWL |
| Video | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 81.5 | LWL |
| Video | YouTube-VOS 2018 | Jaccard (Unseen) | 75.3 | KMN |
| Video | YouTube-VOS 2018 | F-Measure (Seen) | 84.2 | STM |
| Video | YouTube-VOS 2018 | F-Measure (Unseen) | 80.9 | STM |
| Video | YouTube-VOS 2018 | Jaccard (Seen) | 79.7 | STM |
| Video | YouTube-VOS 2018 | Jaccard (Unseen) | 72.8 | STM |
| Video | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 79.4 | STM |
| Video | YouTube-VOS 2018 | F-Measure (Seen) | 85.7 | RMN |
| Video | YouTube-VOS 2018 | F-Measure (Unseen) | 82.4 | RMN |
| Video | YouTube-VOS 2018 | Jaccard (Seen) | 82.1 | RMN |
| Video | YouTube-VOS 2018 | Jaccard (Unseen) | 75.7 | RMN |
| Video | DAVIS 2017 (val) | F-measure | 89.3 | BATMAN |
| Video | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 86.2 | BATMAN |
| Video | DAVIS 2017 (val) | Jaccard | 82.2 | STCN |
| Video | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 85.4 | STCN |
| Video | DAVIS 2017 (val) | F-measure | 87.5 | AOT |
| Video | DAVIS 2017 (val) | Jaccard | 82.3 | AOT |
| Video | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 84.9 | AOT |
| Video | DAVIS 2017 (val) | F-measure | 86.4 | TransVOS |
| Video | DAVIS 2017 (val) | Jaccard | 81.4 | TransVOS |
| Video | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 83.9 | TransVOS |
| Video | DAVIS 2017 (val) | F-measure | 86 | RMN |
| Video | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 83.5 | RMN |
| Video | DAVIS 2017 (val) | F-measure | 85.1 | SST |
| Video | DAVIS 2017 (val) | Jaccard | 79.9 | SST |
| Video | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 82.5 | SST |
| Video | DAVIS 2017 (val) | F-measure | 84.5 | CFBI |
| Video | DAVIS 2017 (val) | Jaccard | 79.3 | CFBI |
| Video | DAVIS 2017 (val) | F-measure | 84.1 | LWL |
| Video | DAVIS 2017 (val) | Jaccard | 79.1 | LWL |
| Video | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 81.6 | LWL |
| Video | DAVIS 2017 (val) | F-measure | 86.5 | LCM |
| Video | DAVIS 2017 (val) | Jaccard | 80.5 | LCM |
| Video | YouTube-VOS 2018 | Jaccard (Unseen) | 75.3 | KMN |
| Video | YouTube-VOS 2018 | F-Measure (Unseen) | 83.4 | CFBI |
| Object Tracking | YouTube-VOS 2018 | Jaccard (Unseen) | 75.7 | RMN |
| Object Tracking | YouTube-VOS 2018 | Jaccard (Unseen) | 75.3 | KMN |
| Object Tracking | YouTube-VOS 2018 | F-Measure (Seen) | 86.7 | TransVOS |
| Object Tracking | YouTube-VOS 2018 | F-Measure (Unseen) | 83.4 | TransVOS |
| Object Tracking | YouTube-VOS 2018 | F-Measure (Unseen) | 83.4 | CFBI |
| Video Object Segmentation | YouTube-VOS 2019 | F-Measure (Seen) | 89.3 | BATMAN |
| Video Object Segmentation | YouTube-VOS 2019 | F-Measure (Unseen) | 87.2 | BATMAN |
| Video Object Segmentation | YouTube-VOS 2019 | Jaccard (Seen) | 84.5 | BATMAN |
| Video Object Segmentation | YouTube-VOS 2019 | Jaccard (Unseen) | 79 | BATMAN |
| Video Object Segmentation | YouTube-VOS 2019 | Mean Jaccard & F-Measure | 85 | BATMAN |
| Video Object Segmentation | YouTube-VOS 2019 | Mean Jaccard & F-Measure | 84.1 | AOT |
| Video Object Segmentation | YouTube-VOS 2019 | F-Measure (Seen) | 85.4 | STCN |
| Video Object Segmentation | YouTube-VOS 2019 | F-Measure (Seen) | 85.1 | CFBI |
| Video Object Segmentation | YouTube-VOS 2019 | F-Measure (Unseen) | 83 | CFBI |
| Video Object Segmentation | YouTube-VOS 2019 | Jaccard (Seen) | 80.6 | CFBI |
| Video Object Segmentation | YouTube-VOS 2019 | Jaccard (Unseen) | 75.2 | CFBI |
| Video Object Segmentation | YouTube-VOS 2019 | Mean Jaccard & F-Measure | 81 | CFBI |
| Video Object Segmentation | DAVIS 2016 | F-Score | 94.2 | BATMAN (val) |
| Video Object Segmentation | DAVIS 2016 | J&F | 92.5 | BATMAN (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 90.7 | BATMAN (val) |
| Video Object Segmentation | DAVIS 2016 | F-Score | 92.5 | STCN (val) |
| Video Object Segmentation | DAVIS 2016 | J&F | 91.6 | STCN (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 90.8 | STCN (val) |
| Video Object Segmentation | DAVIS 2016 | F-Score | 92.1 | AOT (val) |
| Video Object Segmentation | DAVIS 2016 | J&F | 91.1 | AOT (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 90.1 | AOT (val) |
| Video Object Segmentation | DAVIS 2016 | F-Score | 91.4 | LCM (val) |
| Video Object Segmentation | DAVIS 2016 | J&F | 90.7 | LCM (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 89.9 | LCM (val) |
| Video Object Segmentation | DAVIS 2016 | F-Score | 94 | RPCMVOS (val) |
| Video Object Segmentation | DAVIS 2016 | J&F | 90.6 | RPCMVOS (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 87.1 | RPCMVOS (val) |
| Video Object Segmentation | DAVIS 2016 | F-Score | 91.5 | KMN (val) |
| Video Object Segmentation | DAVIS 2016 | J&F | 90.5 | KMN (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 89.5 | KMN (val) |
| Video Object Segmentation | DAVIS 2016 | F-Score | 91.2 | TransVOS (val) |
| Video Object Segmentation | DAVIS 2016 | J&F | 90.5 | TransVOS (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 89.8 | TransVOS (val) |
| Video Object Segmentation | DAVIS 2016 | F-Score | 91.1 | CFBI+ (val) |
| Video Object Segmentation | DAVIS 2016 | J&F | 89.9 | CFBI+ (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 88.7 | CFBI+ (val) |
| Video Object Segmentation | DAVIS 2016 | F-Score | 90.5 | CFBI (val) |
| Video Object Segmentation | DAVIS 2016 | J&F | 89.4 | CFBI (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 88.3 | CFBI (val) |
| Video Object Segmentation | DAVIS 2016 | F-Score | 88.7 | RMN (val) |
| Video Object Segmentation | DAVIS 2016 | J&F | 88.8 | RMN (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 88.9 | RMN (val) |
| Video Object Segmentation | DAVIS 2016 | F-Score | 89.9 | STM (val) |
| Video Object Segmentation | DAVIS 2016 | Jaccard (Mean) | 88.7 | STM (val) |
| Video Object Segmentation | DAVIS 2017 (test-dev) | F-measure | 86.1 | BATMAN |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Jaccard | 78.4 | BATMAN |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 82.2 | BATMAN |
| Video Object Segmentation | DAVIS 2017 (test-dev) | F-measure | 81.8 | LCM |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Jaccard | 74.4 | LCM |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 78.1 | LCM |
| Video Object Segmentation | DAVIS 2017 (test-dev) | F-measure | 80.9 | TransVOS |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Jaccard | 73 | TransVOS |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 76.9 | TransVOS |
| Video Object Segmentation | DAVIS 2017 (test-dev) | F-measure | 79.6 | STCN |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Jaccard | 72.7 | STCN |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 76.1 | STCN |
| Video Object Segmentation | DAVIS 2017 (test-dev) | F-measure | 78.1 | RMN |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Jaccard | 71.9 | RMN |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Jaccard | 71.6 | CFBI+ |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 75.6 | CFBI+ |
| Video Object Segmentation | DAVIS 2017 (test-dev) | F-measure | 78.7 | CFBI |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Jaccard | 71.4 | CFBI |
| Video Object Segmentation | DAVIS 2017 (test-dev) | Mean Jaccard & F-Measure | 75 | CFBI |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Seen) | 88.5 | AOT |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Unseen) | 86.1 | AOT |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Seen) | 83.7 | AOT |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Unseen) | 78.1 | AOT |
| Video Object Segmentation | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 84.1 | AOT |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Seen) | 86.5 | STCN |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Unseen) | 85.7 | STCN |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Seen) | 81.9 | STCN |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Unseen) | 77.9 | STCN |
| Video Object Segmentation | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 83 | STCN |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Seen) | 82.2 | LCM |
| Video Object Segmentation | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 82 | LCM |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Seen) | 86.7 | TransVOS |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Unseen) | 83.4 | TransVOS |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Seen) | 82 | TransVOS |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Unseen) | 75 | TransVOS |
| Video Object Segmentation | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 81.8 | TransVOS |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Seen) | 81.2 | SST |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Unseen) | 76 | SST |
| Video Object Segmentation | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 81.7 | SST |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Seen) | 84.9 | LWL |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Unseen) | 84.4 | LWL |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Seen) | 80.4 | LWL |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Unseen) | 76.4 | LWL |
| Video Object Segmentation | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 81.5 | LWL |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Unseen) | 75.3 | KMN |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Seen) | 84.2 | STM |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Unseen) | 80.9 | STM |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Seen) | 79.7 | STM |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Unseen) | 72.8 | STM |
| Video Object Segmentation | YouTube-VOS 2018 | Mean Jaccard & F-Measure | 79.4 | STM |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Seen) | 85.7 | RMN |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Unseen) | 82.4 | RMN |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Seen) | 82.1 | RMN |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Unseen) | 75.7 | RMN |
| Video Object Segmentation | DAVIS 2017 (val) | F-measure | 89.3 | BATMAN |
| Video Object Segmentation | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 86.2 | BATMAN |
| Video Object Segmentation | DAVIS 2017 (val) | Jaccard | 82.2 | STCN |
| Video Object Segmentation | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 85.4 | STCN |
| Video Object Segmentation | DAVIS 2017 (val) | F-measure | 87.5 | AOT |
| Video Object Segmentation | DAVIS 2017 (val) | Jaccard | 82.3 | AOT |
| Video Object Segmentation | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 84.9 | AOT |
| Video Object Segmentation | DAVIS 2017 (val) | F-measure | 86.4 | TransVOS |
| Video Object Segmentation | DAVIS 2017 (val) | Jaccard | 81.4 | TransVOS |
| Video Object Segmentation | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 83.9 | TransVOS |
| Video Object Segmentation | DAVIS 2017 (val) | F-measure | 86 | RMN |
| Video Object Segmentation | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 83.5 | RMN |
| Video Object Segmentation | DAVIS 2017 (val) | F-measure | 85.1 | SST |
| Video Object Segmentation | DAVIS 2017 (val) | Jaccard | 79.9 | SST |
| Video Object Segmentation | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 82.5 | SST |
| Video Object Segmentation | DAVIS 2017 (val) | F-measure | 84.5 | CFBI |
| Video Object Segmentation | DAVIS 2017 (val) | Jaccard | 79.3 | CFBI |
| Video Object Segmentation | DAVIS 2017 (val) | F-measure | 84.1 | LWL |
| Video Object Segmentation | DAVIS 2017 (val) | Jaccard | 79.1 | LWL |
| Video Object Segmentation | DAVIS 2017 (val) | Mean Jaccard & F-Measure | 81.6 | LWL |
| Video Object Segmentation | DAVIS 2017 (val) | F-measure | 86.5 | LCM |
| Video Object Segmentation | DAVIS 2017 (val) | Jaccard | 80.5 | LCM |
| Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Unseen) | 75.3 | KMN |
| Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Unseen) | 83.4 | CFBI |
| Semi-Supervised Video Object Segmentation | YouTube-VOS 2018 | Jaccard (Unseen) | 75.3 | KMN |
| Semi-Supervised Video Object Segmentation | YouTube-VOS 2018 | F-Measure (Unseen) | 83.4 | CFBI |
| Visual Object Tracking | YouTube-VOS 2018 | Jaccard (Unseen) | 75.7 | RMN |
| Visual Object Tracking | YouTube-VOS 2018 | Jaccard (Unseen) | 75.3 | KMN |
| Visual Object Tracking | YouTube-VOS 2018 | F-Measure (Seen) | 86.7 | TransVOS |
| Visual Object Tracking | YouTube-VOS 2018 | F-Measure (Unseen) | 83.4 | TransVOS |
| Visual Object Tracking | YouTube-VOS 2018 | F-Measure (Unseen) | 83.4 | CFBI |