TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Local All-Pair Correspondence for Point Tracking

Local All-Pair Correspondence for Point Tracking

Seokju Cho, Jiahui Huang, Jisu Nam, Honggyu An, Seungryong Kim, Joon-Young Lee

2024-07-22Visual TrackingPoint TrackingAll
PaperPDFCodeCode(official)

Abstract

We introduce LocoTrack, a highly accurate and efficient model designed for the task of tracking any point (TAP) across video sequences. Previous approaches in this task often rely on local 2D correlation maps to establish correspondences from a point in the query image to a local region in the target image, which often struggle with homogeneous regions or repetitive features, leading to matching ambiguities. LocoTrack overcomes this challenge with a novel approach that utilizes all-pair correspondences across regions, i.e., local 4D correlation, to establish precise correspondences, with bidirectional correspondence and matching smoothness significantly enhancing robustness against ambiguities. We also incorporate a lightweight correlation encoder to enhance computational efficiency, and a compact Transformer architecture to integrate long-term temporal information. LocoTrack achieves unmatched accuracy on all TAP-Vid benchmarks and operates at a speed almost 6 times faster than the current state-of-the-art.

Results

TaskDatasetMetricValueModel
Visual TrackingTAP-Vid-DAVIS-FirstAverage Jaccard64.8LocoTrack-B
Visual TrackingTAP-Vid-DAVIS-FirstAverage PCK77.4LocoTrack-B
Visual TrackingTAP-Vid-DAVIS-FirstOcclusion Accuracy86.2LocoTrack-B
Visual TrackingTAP-Vid-KineticsAverage Jaccard59.1LocoTrack-B
Visual TrackingTAP-Vid-KineticsAverage PCK72.5LocoTrack-B
Visual TrackingTAP-Vid-KineticsOcclusion Accuracy85.7LocoTrack-B
Visual TrackingTAP-Vid-DAVISAverage Jaccard69.4LocoTrack-B
Visual TrackingTAP-Vid-DAVISAverage PCK81.3LocoTrack-B
Visual TrackingTAP-Vid-DAVISOcclusion Accuracy88.6LocoTrack-B
Visual TrackingTAP-Vid-Kinetics-FirstAverage Jaccard52.3LocoTrack-B
Visual TrackingTAP-Vid-Kinetics-FirstAverage PCK66.4LocoTrack-B
Visual TrackingTAP-Vid-Kinetics-FirstOcclusion Accuracy82.1LocoTrack-B
Visual TrackingTAP-Vid-RGB-StackingAverage Jaccard70.8LocoTrack-B
Visual TrackingTAP-Vid-RGB-StackingAverage PCK83.2LocoTrack-B
Visual TrackingTAP-Vid-RGB-StackingOcclusion Accuracy84.1LocoTrack-B
Point TrackingTAP-Vid-DAVIS-FirstAverage Jaccard64.8LocoTrack-B
Point TrackingTAP-Vid-DAVIS-FirstAverage PCK77.4LocoTrack-B
Point TrackingTAP-Vid-DAVIS-FirstOcclusion Accuracy86.2LocoTrack-B
Point TrackingTAP-Vid-KineticsAverage Jaccard59.1LocoTrack-B
Point TrackingTAP-Vid-KineticsAverage PCK72.5LocoTrack-B
Point TrackingTAP-Vid-KineticsOcclusion Accuracy85.7LocoTrack-B
Point TrackingTAP-Vid-DAVISAverage Jaccard69.4LocoTrack-B
Point TrackingTAP-Vid-DAVISAverage PCK81.3LocoTrack-B
Point TrackingTAP-Vid-DAVISOcclusion Accuracy88.6LocoTrack-B
Point TrackingTAP-Vid-Kinetics-FirstAverage Jaccard52.3LocoTrack-B
Point TrackingTAP-Vid-Kinetics-FirstAverage PCK66.4LocoTrack-B
Point TrackingTAP-Vid-Kinetics-FirstOcclusion Accuracy82.1LocoTrack-B
Point TrackingTAP-Vid-RGB-StackingAverage Jaccard70.8LocoTrack-B
Point TrackingTAP-Vid-RGB-StackingAverage PCK83.2LocoTrack-B
Point TrackingTAP-Vid-RGB-StackingOcclusion Accuracy84.1LocoTrack-B

Related Papers

Integrated Switched Capacitor Array and Synchronous Charge Extraction with Adaptive Hybrid MPPT for Piezoelectric Harvesters2025-07-16SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15Modeling Code: Is Text All You Need?2025-07-15All Eyes, no IMU: Learning Flight Attitude from Vision Alone2025-07-15MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second2025-07-14What You Have is What You Track: Adaptive and Robust Multimodal Tracking2025-07-08Learning to Track Any Points from Human Motion2025-07-08