Video Captioning on ViTT

Metric: METEOR (higher is better)

LeaderboardDataset
Loading chart...
#ModelMETEORExtra DataPaperDateCode
1HiCM²9.6YesHiCM$^2$: Hierarchical Compact Memory Modeling f...2024-12-19Code
2Vid2Seq (VidChapters-7M PT)9.5Yes---
3Vid2Seq8.5YesVid2Seq: Large-Scale Pretraining of a Visual Lan...2023-02-27Code
4E2ESG8.1YesEnd-to-end Dense Video Captioning as Sequence Ge...2022-04-18-
5Vid2Seq (VidChapters-7M PT)6.7Yes---