Gemini + DDCoT

Reported on 3 benchmarks across 1 task · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Reasoning3 results

Visual ReasoningonWinoground
Group Score· 2024-01-05
23.75
best: 58.75 (GPT-4V (CoT, pick b/w two options))
CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs arXiv:2401.02582
Visual ReasoningonWinoground
Image Score· 2024-01-05
25
best: 68.75 (GPT-4V (CoT, pick b/w two options))
CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs arXiv:2401.02582
Visual ReasoningonWinoground
Text Score· 2024-01-05
45
best: 75.5 (GPT-4o + CA)
CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs arXiv:2401.02582