TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/BERT Based Multilingual Machine Comprehension in English a...

BERT Based Multilingual Machine Comprehension in English and Hindi

Somil Gupta, Nilesh Khade

2020-06-02Reading ComprehensionQuestion AnsweringMultilingual Machine Comprehension in English Hindi
PaperPDFCode(official)Code

Abstract

Multilingual Machine Comprehension (MMC) is a Question-Answering (QA) sub-task that involves quoting the answer for a question from a given snippet, where the question and the snippet can be in different languages. Recently released multilingual variant of BERT (m-BERT), pre-trained with 104 languages, has performed well in both zero-shot and fine-tuned settings for multilingual tasks; however, it has not been used for English-Hindi MMC yet. We, therefore, present in this article, our experiments with m-BERT for MMC in zero-shot, mono-lingual (e.g. Hindi Question-Hindi Snippet) and cross-lingual (e.g. English QuestionHindi Snippet) fine-tune setups. These model variants are evaluated on all possible multilingual settings and results are compared against the current state-of-the-art sequential QA system for these languages. Experiments show that m-BERT, with fine-tuning, improves performance on all evaluation settings across both the datasets used by the prior model, therefore establishing m-BERT based MMC as the new state-of-the-art for English and Hindi. We also publish our results on an extended version of the recently released XQuAD dataset, which we propose to use as the evaluation benchmark for future research.

Results

TaskDatasetMetricValueModel
Question AnsweringExtended XQuADEM(QE-PE)64.29m-BERT augmented with Hindi QA
Question AnsweringExtended XQuADEM(QE-PH)44.71m-BERT augmented with Hindi QA
Question AnsweringExtended XQuADEM(QH-PE)41.01m-BERT augmented with Hindi QA
Question AnsweringExtended XQuADEM(QH-PH)45.63m-BERT augmented with Hindi QA
Question AnsweringExtended XQuADF1 (QE-PE)76.51m-BERT augmented with Hindi QA
Question AnsweringExtended XQuADF1 (QE-PH)57.31m-BERT augmented with Hindi QA
Question AnsweringExtended XQuADF1(QH-PE)51.04m-BERT augmented with Hindi QA
Question AnsweringExtended XQuADF1(QH-PH)59.8m-BERT augmented with Hindi QA
Multilingual Machine Comprehension in English HindiExtended XQuADEM(QE-PE)64.29m-BERT augmented with Hindi QA
Multilingual Machine Comprehension in English HindiExtended XQuADEM(QE-PH)44.71m-BERT augmented with Hindi QA
Multilingual Machine Comprehension in English HindiExtended XQuADEM(QH-PE)41.01m-BERT augmented with Hindi QA
Multilingual Machine Comprehension in English HindiExtended XQuADEM(QH-PH)45.63m-BERT augmented with Hindi QA
Multilingual Machine Comprehension in English HindiExtended XQuADF1 (QE-PE)76.51m-BERT augmented with Hindi QA
Multilingual Machine Comprehension in English HindiExtended XQuADF1 (QE-PH)57.31m-BERT augmented with Hindi QA
Multilingual Machine Comprehension in English HindiExtended XQuADF1(QH-PE)51.04m-BERT augmented with Hindi QA
Multilingual Machine Comprehension in English HindiExtended XQuADF1(QH-PH)59.8m-BERT augmented with Hindi QA

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility2025-07-16Warehouse Spatial Question Answering with LLM Agent2025-07-14Evaluating Attribute Confusion in Fashion Text-to-Image Generation2025-07-09