HRS-Bench

Holistic, Reliable, and Scalable Benchmark

ImagesTextsIntroduced 2023-04-11

HRS-Bench is a concrete evaluation benchmark for T2I models that is Holistic, Reliable, and Scalable. It measures 13 skills that can be categorized into five major categories: accuracy, robustness, generalization, fairness, and bias. In addition, HRS-Bench covers 50 scenarios, including fashion, animals, transportation, food, and clothes.

Source: HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models

Image Source: HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models