Image Captioning on Object HalBench

Metric: chair_s (higher is better)

LeaderboardDataset
Loading chart...
#Modelchair_sExtra DataPaperDateCode
1RLHF-V12.2NoRLHF-V: Towards Trustworthy MLLMs via Behavior A...2023-12-01Code
2RLAIF-V 7B8.5NoRLAIF-V: Open-Source AI Feedback Leads to Super ...2024-05-27Code
3RLAIF-V 12B3.3NoRLAIF-V: Open-Source AI Feedback Leads to Super ...2024-05-27Code