Metric: BLEU-3 (higher is better)
| # | Model↕ | BLEU-3▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | COOT (ae-test split) - Only Appearance features | 17.43 | No | COOT: Cooperative Hierarchical Transformer for V... | 2020-11-01 | Code |
| 2 | TSP | 4.16 | No | TSP: Temporally-Sensitive Pretraining of Video E... | 2020-11-23 | Code |
| 3 | BMT | 3.84 | No | A Better Use of Audio-Visual Cues: Dense Video C... | 2020-05-17 | Code |
| 4 | iPerceive (Chadha et al., 2020) | 2.93 | No | iPerceive: Applying Common-Sense Reasoning to Mu... | 2020-11-16 | - |
| 5 | MDVC | 2.6 | No | Multi-modal Dense Video Captioning | 2020-03-17 | Code |