TyDiQA

Typologically Diverse Question Answering

TextsUnknownIntroduced 2020-03-10

TyDi QA is a question answering dataset covering 11 typologically diverse languages with 200K question-answer pairs. The languages of TyDi QA are diverse with regard to their typology — the set of linguistic features that each language expresses — such that the authors expect models performing well on this set to generalize across a large number of the languages in the world.

Source: Google Research