TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video Captioning/ActivityNet Captions

Video Captioning on ActivityNet Captions

Metric: BLEU4 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕BLEU4▼Extra DataPaperDate↕Code
1VideoCoCa14.7YesVideoCoCa: Video-Text Modeling with Zero-Shot Tr...2022-12-09-
2VLTinT (ae-test split) C3D/Ling14.5NoVLTinT: Visual-Linguistic Transformer-in-Transfo...2022-11-28Code
3VLCap (ae-test split) - Appearance + Language13.38NoVLCap: Vision-Language with Contrastive Learning...2022-06-26Code
4COOT (ae-test split) - Only Appearance features10.85NoCOOT: Cooperative Hierarchical Transformer for V...2020-11-01Code
5MART (ae-test split) - Appearance + Flow10.33NoMART: Memory-Augmented Recurrent Transformer for...2020-05-11Code
6CM²2.38NoDo You Remember? Dense Video Captioning with Cro...2024-04-11Code