Pierre Baqué, François Fleuret, Pascal Fua
People detection in single 2D images has improved greatly in recent years. However, comparatively little of this progress has percolated into multi-camera multi-people tracking algorithms, whose performance still degrades severely when scenes become very crowded. In this work, we introduce a new architecture that combines Convolutional Neural Nets and Conditional Random Fields to explicitly model those ambiguities. One of its key ingredients are high-order CRF terms that model potential occlusions and give our approach its robustness even when many people are present. Our model is trained end-to-end and we show that it outperforms several state-of-art algorithms on challenging scenes.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Object Detection | Wildtrack | MODA | 74.1 | Deep-Occlusion |
| Object Detection | Wildtrack | MODP | 53.8 | Deep-Occlusion |
| Object Detection | MultiviewX | MODA | 75.2 | Deep-Occulsion |
| Object Detection | MultiviewX | MODP | 54.7 | Deep-Occulsion |
| 3D | Wildtrack | MODA | 74.1 | Deep-Occlusion |
| 3D | Wildtrack | MODP | 53.8 | Deep-Occlusion |
| 3D | MultiviewX | MODA | 75.2 | Deep-Occulsion |
| 3D | MultiviewX | MODP | 54.7 | Deep-Occulsion |
| 3D Object Detection | Wildtrack | MODA | 74.1 | Deep-Occlusion |
| 3D Object Detection | Wildtrack | MODP | 53.8 | Deep-Occlusion |
| 3D Object Detection | MultiviewX | MODA | 75.2 | Deep-Occulsion |
| 3D Object Detection | MultiviewX | MODP | 54.7 | Deep-Occulsion |
| 2D Classification | Wildtrack | MODA | 74.1 | Deep-Occlusion |
| 2D Classification | Wildtrack | MODP | 53.8 | Deep-Occlusion |
| 2D Classification | MultiviewX | MODA | 75.2 | Deep-Occulsion |
| 2D Classification | MultiviewX | MODP | 54.7 | Deep-Occulsion |
| 2D Object Detection | Wildtrack | MODA | 74.1 | Deep-Occlusion |
| 2D Object Detection | Wildtrack | MODP | 53.8 | Deep-Occlusion |
| 2D Object Detection | MultiviewX | MODA | 75.2 | Deep-Occulsion |
| 2D Object Detection | MultiviewX | MODP | 54.7 | Deep-Occulsion |
| 16k | Wildtrack | MODA | 74.1 | Deep-Occlusion |
| 16k | Wildtrack | MODP | 53.8 | Deep-Occlusion |
| 16k | MultiviewX | MODA | 75.2 | Deep-Occulsion |
| 16k | MultiviewX | MODP | 54.7 | Deep-Occulsion |