TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/One Homography is All You Need: IMM-based Joint Homography...

One Homography is All You Need: IMM-based Joint Homography and Multiple Object State Estimation

Paul Johannes Claasen, Johan Pieter de Villiers

2024-09-04Motion CompensationMulti-Object TrackingAllNoise EstimationMultiple Object Tracking
PaperPDFCode(official)

Abstract

A novel online MOT algorithm, IMM Joint Homography State Estimation (IMM-JHSE), is proposed. IMM-JHSE uses an initial homography estimate as the only additional 3D information, whereas other 3D MOT methods use regular 3D measurements. By jointly modelling the homography matrix and its dynamics as part of track state vectors, IMM-JHSE removes the explicit influence of camera motion compensation techniques on predicted track position states, which was prevalent in previous approaches. Expanding upon this, static and dynamic camera motion models are combined using an IMM filter. A simple bounding box motion model is used to predict bounding box positions to incorporate image plane information. In addition to applying an IMM to camera motion, a non-standard IMM approach is applied where bounding-box-based BIoU scores are mixed with ground-plane-based Mahalanobis distances in an IMM-like fashion to perform association only, making IMM-JHSE robust to motion away from the ground plane. Finally, IMM-JHSE makes use of dynamic process and measurement noise estimation techniques. IMM-JHSE improves upon related techniques, including UCMCTrack, OC-SORT, C-BIoU and ByteTrack on the DanceTrack and KITTI-car datasets, increasing HOTA by 2.64 and 2.11, respectively, while offering competitive performance on the MOT17, MOT20 and KITTI-pedestrian datasets. Using publicly available detections, IMM-JHSE outperforms almost all other 2D MOT methods and is outperformed only by 3D MOT methods -- some of which are offline -- on the KITTI-car dataset. Compared to tracking-by-attention methods, IMM-JHSE shows remarkably similar performance on the DanceTrack dataset and outperforms them on the MOT17 dataset. The code is publicly available: https://github.com/Paulkie99/imm-jhse.

Results

TaskDatasetMetricValueModel
VideoKITTI Test (Online Methods)HOTA79.21IMM-JHSE
VideoKITTI Test (Online Methods)IDSW177IMM-JHSE
VideoKITTI Test (Online Methods)MOTA89.8IMM-JHSE
Multi-Object TrackingMOT20AssA61.56IMM-JHSE
Multi-Object TrackingMOT20HOTA60.87IMM-JHSE
Multi-Object TrackingMOT20IDF174.64IMM-JHSE
Multi-Object TrackingMOT20MOTA72.82IMM-JHSE
Multi-Object TrackingMOT17AssA65.65IMM-JHSE
Multi-Object TrackingMOT17HOTA64.9IMM-JHSE
Multi-Object TrackingMOT17IDF180.11IMM-JHSE
Multi-Object TrackingMOT17MOTA79.54IMM-JHSE
Multi-Object TrackingDanceTrackAssA55.41IMM-JHSE
Multi-Object TrackingDanceTrackHOTA66.24IMM-JHSE
Multi-Object TrackingDanceTrackIDF171.72IMM-JHSE
Multi-Object TrackingDanceTrackMOTA89.95IMM-JHSE
Object TrackingMOT20AssA61.56IMM-JHSE
Object TrackingMOT20HOTA60.87IMM-JHSE
Object TrackingMOT20IDF174.64IMM-JHSE
Object TrackingMOT20MOTA72.82IMM-JHSE
Object TrackingMOT17AssA65.65IMM-JHSE
Object TrackingMOT17HOTA64.9IMM-JHSE
Object TrackingMOT17IDF180.11IMM-JHSE
Object TrackingMOT17MOTA79.54IMM-JHSE
Object TrackingDanceTrackAssA55.41IMM-JHSE
Object TrackingDanceTrackHOTA66.24IMM-JHSE
Object TrackingDanceTrackIDF171.72IMM-JHSE
Object TrackingDanceTrackMOTA89.95IMM-JHSE
Object TrackingKITTI Test (Online Methods)HOTA79.21IMM-JHSE
Object TrackingKITTI Test (Online Methods)IDSW177IMM-JHSE
Object TrackingKITTI Test (Online Methods)MOTA89.8IMM-JHSE
Multiple Object TrackingKITTI Test (Online Methods)HOTA79.21IMM-JHSE
Multiple Object TrackingKITTI Test (Online Methods)IDSW177IMM-JHSE
Multiple Object TrackingKITTI Test (Online Methods)MOTA89.8IMM-JHSE

Related Papers

MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results2025-07-17YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association2025-07-16Modeling Code: Is Text All You Need?2025-07-15All Eyes, no IMU: Learning Flight Attitude from Vision Alone2025-07-15Is Diversity All You Need for Scalable Robotic Manipulation?2025-07-08DESIGN AND IMPLEMENTATION OF ONLINE CLEARANCE REPORT.2025-07-07Is Reasoning All You Need? Probing Bias in the Age of Reasoning Language Models2025-07-03Prompt2SegCXR:Prompt to Segment All Organs and Diseases in Chest X-rays2025-07-01