Image Captioning on WHOOPS!
Metric: CIDEr (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | CIDEr▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | BLIP2 FlanT5-XXL (Fine-tuned) | 177 | Yes | Breaking Common Sense: WHOOPS! A Vision-and-Lang... | 2023-03-13 | - |
| 2 | BLIP2 FlanT5-XL (Fine-tuned) | 174 | Yes | Breaking Common Sense: WHOOPS! A Vision-and-Lang... | 2023-03-13 | - |
| 3 | BLIP2 FlanT5-XXL (Zero-Shot) | 120 | No | Breaking Common Sense: WHOOPS! A Vision-and-Lang... | 2023-03-13 | - |
| 4 | CoCa ViT-L-14 MSCOCO | 102 | No | Breaking Common Sense: WHOOPS! A Vision-and-Lang... | 2023-03-13 | - |
| 5 | BLIP Large | 65 | No | Breaking Common Sense: WHOOPS! A Vision-and-Lang... | 2023-03-13 | - |
| 6 | OFA Large | 0 | No | - | - | - |