ScandiQA

Introduced 2023-04-03

The ScandiQA dataset is a question-answering dataset specifically constructed for the Mainland Scandinavian languages, which include Danish, Norwegian, and Swedish. It was developed as part of the ScandEval benchmarking platform and consists of questions and answers in these languages. The dataset is designed to facilitate the evaluation of language models' ability to comprehend and respond to questions in the Scandinavian languages. It is one of the contributions of the ScandEval project, aiming to advance the state of natural language processing in the Scandinavian languages.