Metric: Top-1 Accuracy (higher is better)
| # | Model↕ | Top-1 Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Florence | 87.8 | No | Florence: A New Foundation Model for Computer Vi... | 2021-11-22 | Code |
| 2 | CVRL (R3D-152 2x) | 72.9 | No | Spatiotemporal Contrastive Video Representation ... | 2020-08-09 | Code |
| 3 | CVRL (R3D-101) | 71.6 | No | Spatiotemporal Contrastive Video Representation ... | 2020-08-09 | Code |
| 4 | BraVe:V-FA (TSM-50x2) | 71.4 | No | Broaden Your Views for Self-Supervised Video Lea... | 2021-03-30 | Code |
| 5 | CVRL (R3D-50) | 70.4 | No | Spatiotemporal Contrastive Video Representation ... | 2020-08-09 | Code |
| 6 | MMV | 55.5 | No | Self-Supervised MultiModal Versatile Networks | 2020-06-29 | Code |