TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/BURST: A Benchmark for Unifying Object Recognition, Segmen...

BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video

Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan

2022-09-25Long-tail Video Object SegmentationMulti-Object Tracking and SegmentationMulti-Object TrackingObject RecognitionSegmentationSemantic SegmentationVideo Object SegmentationObject TrackingVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

Multiple existing benchmarks involve tracking and segmenting objects in video e.g., Video Object Segmentation (VOS) and Multi-Object Tracking and Segmentation (MOTS), but there is little interaction between them due to the use of disparate benchmark datasets and metrics (e.g. J&F, mAP, sMOTSA). As a result, published works usually target a particular benchmark, and are not easily comparable to each another. We believe that the development of generalized methods that can tackle multiple tasks requires greater cohesion among these research sub-communities. In this paper, we aim to facilitate this by proposing BURST, a dataset which contains thousands of diverse videos with high-quality object masks, and an associated benchmark with six tasks involving object tracking and segmentation in video. All tasks are evaluated using the same data and comparable metrics, which enables researchers to consider them in unison, and hence, more effectively pool knowledge from different methods across different tasks. Additionally, we demonstrate several baselines for all tasks and show that approaches for one task can be applied to another with a quantifiable and explainable performance difference. Dataset annotations and evaluation code is available at: https://github.com/Ali2500/BURST-benchmark.

Results

TaskDatasetMetricValueModel
VideoBURST-valHOTA (all)8.2Box Tracker
VideoBURST-valHOTA (com)27Box Tracker
VideoBURST-valHOTA (unc)3.6Box Tracker
VideoBURST-valmAP (all)1.4Box Tracker
VideoBURST-valmAP (com)3Box Tracker
VideoBURST-valmAP (unc)0.9Box Tracker
VideoBURST-valHOTA (all)5.5STCN Tracker
VideoBURST-valHOTA (com)17.5STCN Tracker
VideoBURST-valHOTA (unc)2.5STCN Tracker
VideoBURST-valmAP (all)0.9STCN Tracker
VideoBURST-valmAP (com)0.7STCN Tracker
VideoBURST-valmAP (unc)0.6STCN Tracker
Video Object SegmentationBURST-valHOTA (all)8.2Box Tracker
Video Object SegmentationBURST-valHOTA (com)27Box Tracker
Video Object SegmentationBURST-valHOTA (unc)3.6Box Tracker
Video Object SegmentationBURST-valmAP (all)1.4Box Tracker
Video Object SegmentationBURST-valmAP (com)3Box Tracker
Video Object SegmentationBURST-valmAP (unc)0.9Box Tracker
Video Object SegmentationBURST-valHOTA (all)5.5STCN Tracker
Video Object SegmentationBURST-valHOTA (com)17.5STCN Tracker
Video Object SegmentationBURST-valHOTA (unc)2.5STCN Tracker
Video Object SegmentationBURST-valmAP (all)0.9STCN Tracker
Video Object SegmentationBURST-valmAP (com)0.7STCN Tracker
Video Object SegmentationBURST-valmAP (unc)0.6STCN Tracker

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17