TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multiple Object Tracking as ID Prediction

Multiple Object Tracking as ID Prediction

Ruopeng Gao, Yijun Zhang, LiMin Wang

2024-03-25CVPR 2025 1Multi-Object TrackingPredictionObject TrackingMultiple Object Trackingobject-detectionObject Detection
PaperPDFCode(official)

Abstract

In Multiple Object Tracking (MOT), tracking-by-detection methods have stood the test for a long time, which split the process into two parts according to the definition: object detection and association. They leverage robust single-frame detectors and treat object association as a post-processing step through hand-crafted heuristic algorithms and surrogate tasks. However, the nature of heuristic techniques prevents end-to-end exploitation of training data, leading to increasingly cumbersome and challenging manual modification while facing complicated or novel scenarios. In this paper, we regard this object association task as an End-to-End in-context ID prediction problem and propose a streamlined baseline called MOTIP. Specifically, we form the target embeddings into historical trajectory information while considering the corresponding IDs as in-context prompts, then directly predict the ID labels for the objects in the current frame. Thanks to this end-to-end process, MOTIP can learn tracking capabilities straight from training data, freeing itself from burdensome hand-crafted algorithms. Without bells and whistles, our method achieves impressive state-of-the-art performance in complex scenarios like DanceTrack and SportsMOT, and it performs competitively with other transformer-based methods on MOT17. We believe that MOTIP demonstrates remarkable potential and can serve as a starting point for future research. The code is available at https://github.com/MCG-NJU/MOTIP.

Results

TaskDatasetMetricValueModel
VideoSportsMOTAssA65.4MOTIP (Deformable DETR, with SportsMOT val)
VideoSportsMOTDetA86.5MOTIP (Deformable DETR, with SportsMOT val)
VideoSportsMOTHOTA75.2MOTIP (Deformable DETR, with SportsMOT val)
VideoSportsMOTIDF178.2MOTIP (Deformable DETR, with SportsMOT val)
VideoSportsMOTMOTA96.1MOTIP (Deformable DETR, with SportsMOT val)
VideoSportsMOTAssA62MOTIP (Deformable DETR)
VideoSportsMOTDetA83.4MOTIP (Deformable DETR)
VideoSportsMOTHOTA71.9MOTIP (Deformable DETR)
VideoSportsMOTIDF175MOTIP (Deformable DETR)
VideoSportsMOTMOTA92.9MOTIP (Deformable DETR)
Multi-Object TrackingMOT17HOTA59.2MOTIP (Deformable-DETR)
Multi-Object TrackingDanceTrackAssA65.9MOTIP (Deformable DETR, with DanceTrack val and CrowdHuman)
Multi-Object TrackingDanceTrackDetA82.6MOTIP (Deformable DETR, with DanceTrack val and CrowdHuman)
Multi-Object TrackingDanceTrackHOTA73.7MOTIP (Deformable DETR, with DanceTrack val and CrowdHuman)
Multi-Object TrackingDanceTrackIDF178.4MOTIP (Deformable DETR, with DanceTrack val and CrowdHuman)
Multi-Object TrackingDanceTrackMOTA92.7MOTIP (Deformable DETR, with DanceTrack val and CrowdHuman)
Multi-Object TrackingDanceTrackAssA62.8MOTIP (Deformable DETR, with CrowdHuman)
Multi-Object TrackingDanceTrackDetA81.3MOTIP (Deformable DETR, with CrowdHuman)
Multi-Object TrackingDanceTrackHOTA71.4MOTIP (Deformable DETR, with CrowdHuman)
Multi-Object TrackingDanceTrackIDF176.3MOTIP (Deformable DETR, with CrowdHuman)
Multi-Object TrackingDanceTrackMOTA91.6MOTIP (Deformable DETR, with CrowdHuman)
Multi-Object TrackingDanceTrackAssA60.8MOTIP (DAB-Deformable DETR)
Multi-Object TrackingDanceTrackDetA80.8MOTIP (DAB-Deformable DETR)
Multi-Object TrackingDanceTrackHOTA70MOTIP (DAB-Deformable DETR)
Multi-Object TrackingDanceTrackIDF175.1MOTIP (DAB-Deformable DETR)
Multi-Object TrackingDanceTrackMOTA91MOTIP (DAB-Deformable DETR)
Multi-Object TrackingDanceTrackAssA57.6MOTIP (Deformable DETR)
Multi-Object TrackingDanceTrackDetA79.4MOTIP (Deformable DETR)
Multi-Object TrackingDanceTrackHOTA67.5MOTIP (Deformable DETR)
Multi-Object TrackingDanceTrackIDF172.2MOTIP (Deformable DETR)
Multi-Object TrackingDanceTrackMOTA90.3MOTIP (Deformable DETR)
Object TrackingMOT17HOTA59.2MOTIP (Deformable-DETR)
Object TrackingDanceTrackAssA65.9MOTIP (Deformable DETR, with DanceTrack val and CrowdHuman)
Object TrackingDanceTrackDetA82.6MOTIP (Deformable DETR, with DanceTrack val and CrowdHuman)
Object TrackingDanceTrackHOTA73.7MOTIP (Deformable DETR, with DanceTrack val and CrowdHuman)
Object TrackingDanceTrackIDF178.4MOTIP (Deformable DETR, with DanceTrack val and CrowdHuman)
Object TrackingDanceTrackMOTA92.7MOTIP (Deformable DETR, with DanceTrack val and CrowdHuman)
Object TrackingDanceTrackAssA62.8MOTIP (Deformable DETR, with CrowdHuman)
Object TrackingDanceTrackDetA81.3MOTIP (Deformable DETR, with CrowdHuman)
Object TrackingDanceTrackHOTA71.4MOTIP (Deformable DETR, with CrowdHuman)
Object TrackingDanceTrackIDF176.3MOTIP (Deformable DETR, with CrowdHuman)
Object TrackingDanceTrackMOTA91.6MOTIP (Deformable DETR, with CrowdHuman)
Object TrackingDanceTrackAssA60.8MOTIP (DAB-Deformable DETR)
Object TrackingDanceTrackDetA80.8MOTIP (DAB-Deformable DETR)
Object TrackingDanceTrackHOTA70MOTIP (DAB-Deformable DETR)
Object TrackingDanceTrackIDF175.1MOTIP (DAB-Deformable DETR)
Object TrackingDanceTrackMOTA91MOTIP (DAB-Deformable DETR)
Object TrackingDanceTrackAssA57.6MOTIP (Deformable DETR)
Object TrackingDanceTrackDetA79.4MOTIP (Deformable DETR)
Object TrackingDanceTrackHOTA67.5MOTIP (Deformable DETR)
Object TrackingDanceTrackIDF172.2MOTIP (Deformable DETR)
Object TrackingDanceTrackMOTA90.3MOTIP (Deformable DETR)
Object TrackingSportsMOTAssA65.4MOTIP (Deformable DETR, with SportsMOT val)
Object TrackingSportsMOTDetA86.5MOTIP (Deformable DETR, with SportsMOT val)
Object TrackingSportsMOTHOTA75.2MOTIP (Deformable DETR, with SportsMOT val)
Object TrackingSportsMOTIDF178.2MOTIP (Deformable DETR, with SportsMOT val)
Object TrackingSportsMOTMOTA96.1MOTIP (Deformable DETR, with SportsMOT val)
Object TrackingSportsMOTAssA62MOTIP (Deformable DETR)
Object TrackingSportsMOTDetA83.4MOTIP (Deformable DETR)
Object TrackingSportsMOTHOTA71.9MOTIP (Deformable DETR)
Object TrackingSportsMOTIDF175MOTIP (Deformable DETR)
Object TrackingSportsMOTMOTA92.9MOTIP (Deformable DETR)
Multiple Object TrackingSportsMOTAssA65.4MOTIP (Deformable DETR, with SportsMOT val)
Multiple Object TrackingSportsMOTDetA86.5MOTIP (Deformable DETR, with SportsMOT val)
Multiple Object TrackingSportsMOTHOTA75.2MOTIP (Deformable DETR, with SportsMOT val)
Multiple Object TrackingSportsMOTIDF178.2MOTIP (Deformable DETR, with SportsMOT val)
Multiple Object TrackingSportsMOTMOTA96.1MOTIP (Deformable DETR, with SportsMOT val)
Multiple Object TrackingSportsMOTAssA62MOTIP (Deformable DETR)
Multiple Object TrackingSportsMOTDetA83.4MOTIP (Deformable DETR)
Multiple Object TrackingSportsMOTHOTA71.9MOTIP (Deformable DETR)
Multiple Object TrackingSportsMOTIDF175MOTIP (Deformable DETR)
Multiple Object TrackingSportsMOTMOTA92.9MOTIP (Deformable DETR)

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16