Metric: Top-1 accuracy % (higher is better)
| # | Model↕ | Top-1 accuracy %▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | XKD (ViT-B/112/16) | 77.6 | No | XKD: Cross-modal Knowledge Distillation with Dom... | 2022-11-25 | Code |
| 2 | CVRL (R3D-152 2x; K600 pretrain) | 71.6 | Yes | Spatiotemporal Contrastive Video Representation ... | 2020-08-09 | Code |
| 3 | CVRL (R3D-101) | 67.6 | No | Spatiotemporal Contrastive Video Representation ... | 2020-08-09 | Code |
| 4 | CVRL (R3D-50) | 66.1 | No | Spatiotemporal Contrastive Video Representation ... | 2020-08-09 | Code |