Metric: val mAP (higher is better)
| # | Model↕ | val mAP▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | VideoMAE V2-g | 42.6 | Yes | VideoMAE V2: Scaling Video Masked Autoencoders w... | 2023-03-29 | Code |
| 2 | STAR/L | 41.7 | Yes | End-to-End Spatio-Temporal Action Localisation w... | 2023-04-24 | - |
| 3 | InternVideo | 41.01 | Yes | InternVideo: General Video Foundation Models via... | 2022-12-06 | Code |
| 4 | RM (multi-scale, ensemble) | 40.52 | Yes | Relation Modeling in Spatio-Temporal Action Loca... | 2021-06-15 | - |
| 5 | ACAR (multi-scale, ensemble) | 40.49 | Yes | Actor-Context-Actor Relation Network for Spatio-... | 2020-06-14 | Code |
| 6 | RM (multi-scale, ir-CSN-152) | 37.95 | No | Relation Modeling in Spatio-Temporal Action Loca... | 2021-06-15 | - |
| 7 | ACAR (multi-scale, R-101, 8 × 8) | 36.36 | No | Actor-Context-Actor Relation Network for Spatio-... | 2020-06-14 | Code |