TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Associate Everything Detected: Facilitating Tracking-by-De...

Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown

Zimeng Fang, Chao Liang, Xue Zhou, Shuyuan Zhu, Xi Li

2024-09-14Multi-Object TrackingObject TrackingMultiple Object TrackingVideo Object Tracking
PaperPDFCode(official)

Abstract

Multi-object tracking (MOT) emerges as a pivotal and highly promising branch in the field of computer vision. Classical closed-vocabulary MOT (CV-MOT) methods aim to track objects of predefined categories. Recently, some open-vocabulary MOT (OV-MOT) methods have successfully addressed the problem of tracking unknown categories. However, we found that the CV-MOT and OV-MOT methods each struggle to excel in the tasks of the other. In this paper, we present a unified framework, Associate Everything Detected (AED), that simultaneously tackles CV-MOT and OV-MOT by integrating with any off-the-shelf detector and supports unknown categories. Different from existing tracking-by-detection MOT methods, AED gets rid of prior knowledge (e.g. motion cues) and relies solely on highly robust feature learning to handle complex trajectories in OV-MOT tasks while keeping excellent performance in CV-MOT tasks. Specifically, we model the association task as a similarity decoding problem and propose a sim-decoder with an association-centric learning mechanism. The sim-decoder calculates similarities in three aspects: spatial, temporal, and cross-clip. Subsequently, association-centric learning leverages these threefold similarities to ensure that the extracted features are appropriate for continuous tracking and robust enough to generalize to unknown categories. Compared with existing powerful OV-MOT and CV-MOT methods, AED achieves superior performance on TAO, SportsMOT, and DanceTrack without any prior knowledge. Our code is available at https://github.com/balabooooo/AED.

Results

TaskDatasetMetricValueModel
VideoSportsMOTAssA70.1AED
VideoSportsMOTDetA89.4AED
VideoSportsMOTHOTA79.1AED
VideoSportsMOTIDF181.8AED
VideoSportsMOTMOTA97.1AED
Multi-Object TrackingTAOAssocA52.4AED (Co-DETR)
Multi-Object TrackingTAOClsA41.7AED (Co-DETR)
Multi-Object TrackingTAOLocA71.8AED (Co-DETR)
Multi-Object TrackingTAOTETA55.3AED (Co-DETR)
Multi-Object TrackingTAOAssocA38.1AED (RegionCLIP)
Multi-Object TrackingTAOClsA16.2AED (RegionCLIP)
Multi-Object TrackingTAOLocA56.7AED (RegionCLIP)
Multi-Object TrackingTAOTETA37AED (RegionCLIP)
Multi-Object TrackingDanceTrackAssA54.3AED
Multi-Object TrackingDanceTrackDetA82AED
Multi-Object TrackingDanceTrackHOTA66.6AED
Multi-Object TrackingDanceTrackIDF169.7AED
Multi-Object TrackingDanceTrackMOTA92.2AED
Multi-Object TrackingSportsMOTAssA70.1AED
Multi-Object TrackingSportsMOTDetA89.4AED
Multi-Object TrackingSportsMOTHOTA79.1AED
Multi-Object TrackingSportsMOTIDF181.8AED
Multi-Object TrackingSportsMOTMOTA97.1AED
Object TrackingTAOAssocA52.4AED (Co-DETR)
Object TrackingTAOClsA41.7AED (Co-DETR)
Object TrackingTAOLocA71.8AED (Co-DETR)
Object TrackingTAOTETA55.3AED (Co-DETR)
Object TrackingTAOAssocA38.1AED (RegionCLIP)
Object TrackingTAOClsA16.2AED (RegionCLIP)
Object TrackingTAOLocA56.7AED (RegionCLIP)
Object TrackingTAOTETA37AED (RegionCLIP)
Object TrackingDanceTrackAssA54.3AED
Object TrackingDanceTrackDetA82AED
Object TrackingDanceTrackHOTA66.6AED
Object TrackingDanceTrackIDF169.7AED
Object TrackingDanceTrackMOTA92.2AED
Object TrackingSportsMOTAssA70.1AED
Object TrackingSportsMOTDetA89.4AED
Object TrackingSportsMOTHOTA79.1AED
Object TrackingSportsMOTIDF181.8AED
Object TrackingSportsMOTMOTA97.1AED
Object TrackingSportsMOTAssA70.1AED
Object TrackingSportsMOTDetA89.4AED
Object TrackingSportsMOTHOTA79.1AED
Object TrackingSportsMOTIDF181.8AED
Object TrackingSportsMOTMOTA97.1AED
Multiple Object TrackingSportsMOTAssA70.1AED
Multiple Object TrackingSportsMOTDetA89.4AED
Multiple Object TrackingSportsMOTHOTA79.1AED
Multiple Object TrackingSportsMOTIDF181.8AED
Multiple Object TrackingSportsMOTMOTA97.1AED

Related Papers

MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results2025-07-17YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association2025-07-16HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking2025-07-10Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking2025-07-07UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions2025-07-01Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking2025-06-30Visual and Memory Dual Adapter for Multi-Modal Object Tracking2025-06-30R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning2025-06-27