QALD-9-plus: A Multilingual Dataset for Question Answering over DBpedia and Wikidata Translated by Native Speakers

Aleksandr Perevalov, Dennis Diefenbach, Ricardo Usbeck, Andreas Both

2022-01-31Question Answering Graph Question Answering

Abstract

The ability to have the same experience for different user groups (i.e., accessibility) is one of the most important characteristics of Web-based systems. The same is true for Knowledge Graph Question Answering (KGQA) systems that provide the access to Semantic Web data via natural language interface. While following our research agenda on the multilingual aspect of accessibility of KGQA systems, we identified several ongoing challenges. One of them is the lack of multilingual KGQA benchmarks. In this work, we extend one of the most popular KGQA benchmarks - QALD-9 by introducing high-quality questions' translations to 8 languages provided by native speakers, and transferring the SPARQL queries of QALD-9 from DBpedia to Wikidata, s.t., the usability and relevance of the dataset is strongly increased. Five of the languages - Armenian, Ukrainian, Lithuanian, Bashkir and Belarusian - to our best knowledge were never considered in KGQA research community before. The latter two of the languages are considered as "endangered" by UNESCO. We call the extended dataset QALD-9-plus and made it available online https://github.com/Perevalov/qald_9_plus.

Results

Task	Dataset	Metric	Value	Model
Question Answering	QALD-9-Plus	Macro F1	0.4459	QAnswer-Wikidata-English
Question Answering	QALD-9-Plus	Macro F1	0.3171	QAnswer-Wikidata-German
Question Answering	QALD-9-Plus	Macro F1	0.3039	QAnswer-DBpedia-English
Question Answering	QALD-9-Plus	Macro F1	0.23	QAnswer-Wikidata-French
Question Answering	QALD-9-Plus	Macro F1	0.2143	QAnswer-Wikidata-Russian
Question Answering	QALD-9-Plus	Macro F1	0.1998	QAnswer-DBpedia-German
Question Answering	QALD-9-Plus	Macro F1	0.1506	QAnswer-DBpedia-French
Question Answering	QALD-9-Plus	Macro F1	0.1503	Platypus-Wikidata-English
Question Answering	QALD-9-Plus	Macro F1	0.124	DeepPavlov-Wikidata-English
Question Answering	QALD-9-Plus	Macro F1	0.0957	QAnswer-DBpedia-Russian
Question Answering	QALD-9-Plus	Macro F1	0.087	DeepPavlov-Wikidata-Russian
Question Answering	QALD-9-Plus	Macro F1	0.0417	Platypus-Wikidata-French

QALD-9-plus: A Multilingual Dataset for Question Answering over DBpedia and Wikidata Translated by Native Speakers

Abstract

Results

Related Papers

QALD-9-plus: A Multilingual Dataset for Question Answering over DBpedia and Wikidata Translated by Native Speakers

Abstract

Results

Related Papers