Metric: Frame accuracy (higher is better)
| # | Model↕ | Frame accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | UnLoc-L | 72.8 | No | UnLoc: A Unified Framework for Video Localizatio... | 2023-08-21 | Code |
| 2 | Univl | 70 | Yes | UniVL: A Unified Video and Language Pre-Training... | 2020-02-15 | Code |
| 3 | Norton | 69.8 | Yes | Multi-granularity Correspondence Learning from L... | 2024-01-30 | Code |
| 4 | VideoClip | 68.7 | Yes | VideoCLIP: Contrastive Pre-training for Zero-sho... | 2021-09-28 | Code |
| 5 | VLM | 68.4 | Yes | VLM: Task-agnostic Video-Language Model Pre-trai... | 2021-05-20 | Code |
| 6 | TACo | 68.4 | No | TACo: Token-aware Cascade Contrastive Learning f... | 2021-08-23 | - |
| 7 | MIL-NCE | 61 | No | End-to-End Learning of Visual Representations fr... | 2019-12-13 | Code |
| 8 | ActBERT | 57 | No | ActBERT: Learning Global-Local Video-Text Repres... | 2020-11-14 | Code |
| 9 | CBT | 53.9 | No | End-to-End Learning of Visual Representations fr... | 2019-12-13 | Code |