NQiI
Natural Questions In Icelandic
Natural Questions in Icelandic (NQiI) is a valuable dataset designed for extractive question answering (QA) in the Icelandic language. Let me provide you with some details about this dataset:
-
Purpose and Importance:
- The NQiI dataset was created to facilitate the development and evaluation of Icelandic QA systems.
- It also supports the development of QA methods that need to work across a wide range of morphologically and grammatically diverse languages in a multilingual context¹.
-
Dataset Creation:
- Contributors were asked to come up with questions they were genuinely interested in knowing the answers to.
- Later, they were tasked with finding answers to each other's questions using a previously published methodology.
- The questions in NQiI are "natural" in the sense that they arise from genuine curiosity¹.
-
Dataset Details:
- The complete NQiI dataset contains 18,000 labeled entries.
- Among these, 5,568 entries are directly suitable for training an extractive QA system specifically for Icelandic¹.
-
Resource and Evaluation:
- NQiI serves as a valuable resource for Icelandic language research.
- Researchers have used it to create and evaluate systems capable of extractive QA in Icelandic¹.
(1) Natural Questions in Icelandic - ACL Anthology. https://aclanthology.org/2022.lrec-1.477/. (2) NQiI - Natural Questions In Icelandic - v1.0 - CLARIN. https://repository.clarin.is/repository/xmlui/handle/20.500.12537/143. (3) Natural Questions in Icelandic - ACL Anthology. https://aclanthology.org/2022.lrec-1.477/. (4) Natural Questions in Icelandic - ACL Anthology. https://aclanthology.org/2022.lrec-1.477.pdf. (5) undefined. https://aclanthology.org/2022.lrec-1.477.