TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video Captioning/YouCook2

Video Captioning on YouCook2

Metric: SODA (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕SODA▼Extra DataPaperDate↕Code
1HiCM²10.73YesHiCM$^2$: Hierarchical Compact Memory Modeling f...2024-12-19Code
2Vid2Seq (HowTo100M+VidChapters-7M PT)10.3Yes---
3Vid2Seq7.9YesVid2Seq: Large-Scale Pretraining of a Visual Lan...2023-02-27Code
4CM²5.34NoDo You Remember? Dense Video Captioning with Cro...2024-04-11Code
5GVL4.91NoLearning Grounded Vision-Language Representation...2023-03-11Code
6PDVC (TSN features, no SCST)4.42NoEnd-to-End Dense Video Captioning with Parallel ...2021-08-17Code
7Vid2Seq (HowTo100M+VidChapters-7M PT)3.9Yes---