Metric: R@50,IoU=0.3 (higher is better)
| # | Model↕ | R@50,IoU=0.3▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | VLG-Net + Guidance Model | 39.77 | No | Localizing Moments in Long Video Via Multimodal ... | 2023-02-26 | Code |
| 2 | VLG-Net | 33.68 | No | MAD: A Scalable Dataset for Language Grounding i... | 2021-12-01 | Code |
| 3 | Zero-Shot CLIP + Guidance Model | 32.23 | No | Localizing Moments in Long Video Via Multimodal ... | 2023-02-26 | Code |
| 4 | CLIP | 28.71 | No | MAD: A Scalable Dataset for Language Grounding i... | 2021-12-01 | Code |
| 5 | Random Chance | 1.92 | No | MAD: A Scalable Dataset for Language Grounding i... | 2021-12-01 | Code |