RLAIF-V 7B
Reported on 6 benchmarks across 3 tasks · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing6 results
- Hallucination Rate· 2024-05-2729.2
- Score· 2024-05-273.06best: 3.36 (RLAIF-V 12B)
- chair_i· 2024-05-274.3best: 7.5 (RLHF-V)
- chair_s· 2024-05-278.5best: 12.2 (RLHF-V)
- Hallucination Rate· 2024-05-2729.2
- Score· 2024-05-273.06best: 3.36 (RLAIF-V 12B)