TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CoTracker: It is Better to Track Together

CoTracker: It is Better to Track Together

Nikita Karaev, Ignacio Rocco, Benjamin Graham, Natalia Neverova, Andrea Vedaldi, Christian Rupprecht

2023-07-14Point TrackingOptical Flow Estimationmotion predictionObject Tracking
PaperPDFCodeCode(official)

Abstract

We introduce CoTracker, a transformer-based model that tracks a large number of 2D points in long video sequences. Differently from most existing approaches that track points independently, CoTracker tracks them jointly, accounting for their dependencies. We show that joint tracking significantly improves tracking accuracy and robustness, and allows CoTracker to track occluded points and points outside of the camera view. We also introduce several innovations for this class of trackers, including using token proxies that significantly improve memory efficiency and allow CoTracker to track 70k points jointly and simultaneously at inference on a single GPU. CoTracker is an online algorithm that operates causally on short windows. However, it is trained utilizing unrolled windows as a recurrent network, maintaining tracks for long periods of time even when points are occluded or leave the field of view. Quantitatively, CoTracker substantially outperforms prior trackers on standard point-tracking benchmarks.

Results

TaskDatasetMetricValueModel
Visual TrackingTAP-Vid-DAVIS-FirstAverage Jaccard62.2CoTracker
Visual TrackingTAP-Vid-DAVIS-FirstAverage PCK75.7CoTracker
Visual TrackingTAP-Vid-DAVIS-FirstOcclusion Accuracy89.3CoTracker
Visual TrackingTAP-Vid-DAVISAverage Jaccard65.9CoTracker
Visual TrackingTAP-Vid-DAVISAverage PCK79.4CoTracker
Visual TrackingTAP-Vid-DAVISOcclusion Accuracy89.9CoTracker
Visual TrackingTAP-Vid-Kinetics-FirstAverage Jaccard48.8CoTracker
Visual TrackingTAP-Vid-Kinetics-FirstAverage PCK64.5CoTracker
Visual TrackingTAP-Vid-Kinetics-FirstOcclusion Accuracy85.8CoTracker
Point TrackingTAP-Vid-DAVIS-FirstAverage Jaccard62.2CoTracker
Point TrackingTAP-Vid-DAVIS-FirstAverage PCK75.7CoTracker
Point TrackingTAP-Vid-DAVIS-FirstOcclusion Accuracy89.3CoTracker
Point TrackingTAP-Vid-DAVISAverage Jaccard65.9CoTracker
Point TrackingTAP-Vid-DAVISAverage PCK79.4CoTracker
Point TrackingTAP-Vid-DAVISOcclusion Accuracy89.9CoTracker
Point TrackingTAP-Vid-Kinetics-FirstAverage Jaccard48.8CoTracker
Point TrackingTAP-Vid-Kinetics-FirstAverage PCK64.5CoTracker
Point TrackingTAP-Vid-Kinetics-FirstOcclusion Accuracy85.8CoTracker

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results2025-07-17Integrated Switched Capacitor Array and Synchronous Charge Extraction with Adaptive Hybrid MPPT for Piezoelectric Harvesters2025-07-16SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second2025-07-14An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11