TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Shotluck-Holmes (3.1B)

Shotluck-Holmes (3.1B)

Reported on 12 benchmarks across 3 tasks · 1 paper · 9 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision12 results

  • VideoonShot2Story20K
    CIDEr· 2024-05-31
    152.3
    SOTA
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • VideoonShot2Story20K
    METEOR· 2024-05-31
    23.2
    SOTA
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • VideoonShot2Story20K
    ROUGE· 2024-05-31
    43
    SOTA
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • Video CaptioningonShot2Story20K
    CIDEr· 2024-05-31
    63.2
    SOTA
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • Video CaptioningonShot2Story20K
    METEOR· 2024-05-31
    25.7
    SOTA
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • Video CaptioningonShot2Story20K
    ROUGE· 2024-05-31
    36.2
    SOTA
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • Video SummarizationonShot2Story20K
    CIDEr· 2024-05-31
    152.3
    SOTA
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • Video SummarizationonShot2Story20K
    METEOR· 2024-05-31
    23.2
    SOTA
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • Video SummarizationonShot2Story20K
    ROUGE· 2024-05-31
    43
    SOTA
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • VideoonShot2Story20K
    BLEU-4· 2024-05-31
    7.67
    best: 11.7 (SUM-shot)
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • Video CaptioningonShot2Story20K
    BLEU-4· 2024-05-31
    8.7
    best: 10.7 (Shot2Story)
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648
  • Video SummarizationonShot2Story20K
    BLEU-4· 2024-05-31
    7.67
    best: 11.7 (SUM-shot)
    Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationarXiv:2405.20648