Metric: R@10,IoU=0.5 (higher is better)
| # | Model↕ | R@10,IoU=0.5▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | VLG-Net + Guidance Model | 13.72 | No | Localizing Moments in Long Video Via Multimodal ... | 2023-02-26 | Code |
| 2 | Zero-Shot CLIP + Guidance Model | 11.09 | No | Localizing Moments in Long Video Via Multimodal ... | 2023-02-26 | Code |
| 3 | VLG-Net | 10.18 | No | MAD: A Scalable Dataset for Language Grounding i... | 2021-12-01 | Code |
| 4 | CLIP | 8.38 | No | MAD: A Scalable Dataset for Language Grounding i... | 2021-12-01 | Code |
| 5 | Random Chance | 0.14 | No | MAD: A Scalable Dataset for Language Grounding i... | 2021-12-01 | Code |