TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/QALD-9-plus: A Multilingual Dataset for Question Answering...

QALD-9-plus: A Multilingual Dataset for Question Answering over DBpedia and Wikidata Translated by Native Speakers

Aleksandr Perevalov, Dennis Diefenbach, Ricardo Usbeck, Andreas Both

2022-01-31Question AnsweringGraph Question Answering
PaperPDFCode(official)

Abstract

The ability to have the same experience for different user groups (i.e., accessibility) is one of the most important characteristics of Web-based systems. The same is true for Knowledge Graph Question Answering (KGQA) systems that provide the access to Semantic Web data via natural language interface. While following our research agenda on the multilingual aspect of accessibility of KGQA systems, we identified several ongoing challenges. One of them is the lack of multilingual KGQA benchmarks. In this work, we extend one of the most popular KGQA benchmarks - QALD-9 by introducing high-quality questions' translations to 8 languages provided by native speakers, and transferring the SPARQL queries of QALD-9 from DBpedia to Wikidata, s.t., the usability and relevance of the dataset is strongly increased. Five of the languages - Armenian, Ukrainian, Lithuanian, Bashkir and Belarusian - to our best knowledge were never considered in KGQA research community before. The latter two of the languages are considered as "endangered" by UNESCO. We call the extended dataset QALD-9-plus and made it available online https://github.com/Perevalov/qald_9_plus.

Results

TaskDatasetMetricValueModel
Question AnsweringQALD-9-PlusMacro F10.4459QAnswer-Wikidata-English
Question AnsweringQALD-9-PlusMacro F10.3171QAnswer-Wikidata-German
Question AnsweringQALD-9-PlusMacro F10.3039QAnswer-DBpedia-English
Question AnsweringQALD-9-PlusMacro F10.23QAnswer-Wikidata-French
Question AnsweringQALD-9-PlusMacro F10.2143QAnswer-Wikidata-Russian
Question AnsweringQALD-9-PlusMacro F10.1998QAnswer-DBpedia-German
Question AnsweringQALD-9-PlusMacro F10.1506QAnswer-DBpedia-French
Question AnsweringQALD-9-PlusMacro F10.1503Platypus-Wikidata-English
Question AnsweringQALD-9-PlusMacro F10.124DeepPavlov-Wikidata-English
Question AnsweringQALD-9-PlusMacro F10.0957QAnswer-DBpedia-Russian
Question AnsweringQALD-9-PlusMacro F10.087DeepPavlov-Wikidata-Russian
Question AnsweringQALD-9-PlusMacro F10.0417Platypus-Wikidata-French

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility2025-07-16Warehouse Spatial Question Answering with LLM Agent2025-07-14Evaluating Attribute Confusion in Fashion Text-to-Image Generation2025-07-09