TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Universal and Independent: Multilingual Probing Framework ...

Universal and Independent: Multilingual Probing Framework for Exhaustive Model Interpretation and Evaluation

Oleg Serikov, Vitaly Protasov, Ekaterina Voloshina, Viktoria Knyazkova, Tatiana Shavrina

2022-10-24Probing Language Models
PaperPDFCodeCode(official)

Abstract

Linguistic analysis of language models is one of the ways to explain and describe their reasoning, weaknesses, and limitations. In the probing part of the model interpretability research, studies concern individual languages as well as individual linguistic structures. The question arises: are the detected regularities linguistically coherent, or on the contrary, do they dissonate at the typological scale? Moreover, the majority of studies address the inherent set of languages and linguistic structures, leaving the actual typological diversity knowledge out of scope. In this paper, we present and apply the GUI-assisted framework allowing us to easily probe a massive number of languages for all the morphosyntactic features present in the Universal Dependencies data. We show that reflecting the anglo-centric trend in NLP over the past years, most of the regularities revealed in the mBERT model are typical for the western-European languages. Our framework can be integrated with the existing probing toolboxes, model cards, and leaderboards, allowing practitioners to use and share their standard probing methods to interpret multilingual models. Thus we propose a toolkit to systematize the multilingual flaws in multilingual models, providing a reproducible experimental setup for 104 languages and 80 morphosyntactic features. https://github.com/AIRI-Institute/Probing_framework

Related Papers

Linguistically Grounded Analysis of Language Models using Shapley Head Values2024-10-17Probing Language Models on Their Knowledge Source2024-10-08Probing Language Models for Pre-training Data Detection2024-06-03Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph2024-04-04Probing Language Models' Gesture Understanding for Enhanced Human-AI Interaction2024-01-31Social Bias Probing: Fairness Benchmarking for Language Models2023-11-15Probing Representations for Document-level Event Extraction2023-10-23Table-GPT: Table-tuned GPT for Diverse Table Tasks2023-10-13