Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Shotluck-Holmes (3.1B)

Shotluck-Holmes (3.1B)

Reported on 12 benchmarks across 3 tasks · 1 paper · 9 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision12 results

VideoonShot2Story20K
CIDEr· 2024-05-31
152.3
SOTA
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
VideoonShot2Story20K
METEOR· 2024-05-31
23.2
SOTA
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
VideoonShot2Story20K
ROUGE· 2024-05-31
43
SOTA
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
Video CaptioningonShot2Story20K
CIDEr· 2024-05-31
63.2
SOTA
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
Video CaptioningonShot2Story20K
METEOR· 2024-05-31
25.7
SOTA
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
Video CaptioningonShot2Story20K
ROUGE· 2024-05-31
36.2
SOTA
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
Video SummarizationonShot2Story20K
CIDEr· 2024-05-31
152.3
SOTA
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
Video SummarizationonShot2Story20K
METEOR· 2024-05-31
23.2
SOTA
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
Video SummarizationonShot2Story20K
ROUGE· 2024-05-31
43
SOTA
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
VideoonShot2Story20K
BLEU-4· 2024-05-31
7.67
best: 11.7 (SUM-shot)
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
Video CaptioningonShot2Story20K
BLEU-4· 2024-05-31
8.7
best: 10.7 (Shot2Story)
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648
Video SummarizationonShot2Story20K
BLEU-4· 2024-05-31
7.67
best: 11.7 (SUM-shot)
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization arXiv:2405.20648