Zero-Shot Video Retrieval on MSR-VTT
Metric: text-to-video Mean Rank (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | text-to-video Mean Rank▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | MMT | 148.1 | Yes | Multi-modal Transformer for Video Retrieval | 2020-07-21 | Code |
| 2 | CLIP4Clip | 34 | No | CLIP4Clip: An Empirical Study of CLIP for End to... | 2021-04-18 | Code |
| 3 | MIL-NCE | 29.5 | No | End-to-End Learning of Visual Representations fr... | 2019-12-13 | Code |