TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Vid2Seq (VidChapters-7M PT)

Vid2Seq (VidChapters-7M PT)

Reported on 6 benchmarks across 2 tasks

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision12 results

  • Video CaptioningonViTT
    CIDEr· uses extra data
    50.9
    best: 51.2 (HiCM²)
  • Video CaptioningonViTT
    METEOR· uses extra data
    9.5
    best: 9.6 (HiCM²)
  • Video CaptioningonViTT
    SODA· uses extra data
    0.151
    best: 9.1
  • Video CaptioningonViTT
    CIDEr· uses extra data
    30.2
    best: 51.2 (HiCM²)
  • Video CaptioningonViTT
    METEOR· uses extra data
    6.7
    best: 9.6 (HiCM²)
  • Video CaptioningonViTT
    SODA· uses extra data
    9.1
  • Dense Video CaptioningonViTT
    CIDEr· uses extra data
    50.9
    best: 51.2 (HiCM²)
  • Dense Video CaptioningonViTT
    METEOR· uses extra data
    9.5
    best: 9.6 (HiCM²)
  • Dense Video CaptioningonViTT
    SODA· uses extra data
    0.151
    best: 9.1
  • Dense Video CaptioningonViTT
    CIDEr· uses extra data
    30.2
    best: 51.2 (HiCM²)
  • Dense Video CaptioningonViTT
    METEOR· uses extra data
    6.7
    best: 9.6 (HiCM²)
  • Dense Video CaptioningonViTT
    SODA· uses extra data
    9.1