Tianyu Yu, Haoye Zhang, Qiming Li, Qixin Xu, Yuan YAO, Da Chen, Xiaoman Lu, Ganqu Cui, Yunkai Dang, Taiwen He, Xiaocheng Feng, Jun Song, Bo Zheng, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun
Traditional feedback learning for hallucination reduction relies on labor-intensive manual labeling or expensive proprietary models. This leaves the community without foundational knowledge about how to build high-quality feedback with open-source MLLMs. In this work, we introduce RLAIF-V, a novel framework that aligns MLLMs in a fully open-source paradigm. RLAIF-V maximally explores open-source MLLMs from two perspectives, including high-quality feedback data generation for preference learning and self-feedback guidance for inference-time scaling. Extensive experiments on six benchmarks in both automatic and human evaluation show that RLAIF-V substantially enhances the trustworthiness of models at both preference learning and inference time. RLAIF-V 7B reduces object hallucination by 80.7\% and overall hallucination by 33.7\%. Remarkably, RLAIF-V 12B further reveals the self-alignment potential of open-source MLLMs, where the model can learn from feedback of itself to achieve super GPT-4V trustworthiness.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Visual Question Answering (VQA) | MMHal-Bench | Hallucination Rate | 29.2 | RLAIF-V 12B |
| Visual Question Answering (VQA) | MMHal-Bench | Score | 3.36 | RLAIF-V 12B |
| Visual Question Answering (VQA) | MMHal-Bench | Hallucination Rate | 29.2 | RLAIF-V 7B |
| Visual Question Answering (VQA) | MMHal-Bench | Score | 3.06 | RLAIF-V 7B |
| Visual Question Answering (VQA) | AMBER | Accuracy | 88 | RLAIF-V 12B |
| Visual Question Answering (VQA) | AMBER | F1 | 90.9 | RLAIF-V 12B |
| Image Captioning | Object HalBench | chair_i | 4.3 | RLAIF-V 7B |
| Image Captioning | Object HalBench | chair_s | 8.5 | RLAIF-V 7B |
| Image Captioning | Object HalBench | chair_i | 1.8 | RLAIF-V 12B |
| Image Captioning | Object HalBench | chair_s | 3.3 | RLAIF-V 12B |
| Visual Question Answering | MMHal-Bench | Hallucination Rate | 29.2 | RLAIF-V 12B |
| Visual Question Answering | MMHal-Bench | Score | 3.36 | RLAIF-V 12B |
| Visual Question Answering | MMHal-Bench | Hallucination Rate | 29.2 | RLAIF-V 7B |
| Visual Question Answering | MMHal-Bench | Score | 3.06 | RLAIF-V 7B |
| Visual Question Answering | AMBER | Accuracy | 88 | RLAIF-V 12B |
| Visual Question Answering | AMBER | F1 | 90.9 | RLAIF-V 12B |