BLUEX

Introduced 2023-07-11

BLUEX is a valuable benchmark dataset designed to evaluate language models in Portuguese. Despite Portuguese being the fifth most widely spoken language, there is a scarcity of freely available resources for assessing language models in this language. The BLUEX dataset addresses this gap by providing a multimodal collection of questions from the two leading university entrance exams conducted in Brazil: Convest (Unicamp) and Fuvest (USP). These exams span from 2018 to 2024 and cover a total of 1260 questions. Notably, 724 of these questions do not have accompanying images.