Video Captioning on TVC
Metric: CIDEr (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | CIDEr▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | VAST | 74.1 | Yes | VAST: A Vision-Audio-Subtitle-Text Omni-Modality... | 2023-05-29 | Code |
| 2 | COSA | 70.7 | Yes | COSA: Concatenated Sample Pretrained Vision-Lang... | 2023-06-15 | Code |