Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

XLM

Natural Language ProcessingIntroduced 200057 papers

Description

XLM is a Transformer based architecture that is pre-trained using one of three language modelling objectives:

Causal Language Modeling - models the probability of a word given the previous words in a sentence.
Masked Language Modeling - the masked language modeling objective of BERT.
Translation Language Modeling - a (new) translation language modeling objective for improving cross-lingual pre-training.

The authors find that both the CLM and MLM approaches provide strong cross-lingual features that can be used for pretraining models.

Papers Using This Method

BioBridge: Unified Bio-Embedding with Bridging Modality in Code-Switched EMR2024-12-16 XLM for Autonomous Driving Systems: A Comprehensive Review2024-09-16 Ax-to-Grind Urdu: Benchmark Dataset for Urdu Fake News Detection2024-03-20 Mapping Transformer Leveraged Embeddings for Cross-Lingual Document Representation2024-01-12 An Empirical study of Unsupervised Neural Machine Translation: analyzing NMT output, model's behavior and sentences' contribution2023-12-19 MedAI Dialog Corpus (MEDIC): Zero-Shot Classification of Doctor and AI Responses in Health Consultations2023-10-19 Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on Turkish2023-09-13 PESTS: Persian_English Cross Lingual Corpus for Semantic Textual Similarity2023-05-13 CLaC at SemEval-2023 Task 2: Comparing Span-Prediction and Sequence-Labeling approaches for NER2023-05-05 Exploring Methods for Building Dialects-Mandarin Code-Mixing Corpora: A Case Study in Taiwanese Hokkien2023-01-21 ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-training2022-11-15 BERT-Sort: A Zero-shot MLM Semantic Encoder on Ordinal Features for AutoML2022-06-01 GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained Language Models2022-05-24 Persian Natural Language Inference: A Meta-learning approach2022-05-18 HiNER: A Large Hindi Named Entity Recognition Dataset2022-04-28 Team ÚFAL at CMCL 2022 Shared Task: Figuring out the correct recipe for predicting Eye-Tracking features using Pretrained Language Models2022-04-11 Are You Robert or RoBERTa? Deceiving Online Authorship Attribution Models Using Neural Text Generators2022-03-18 "A Passage to India": Pre-trained Word Embeddings for Indian Languages2021-12-27 Prix-LM: Pretraining for Multilingual Knowledge Base Construction2021-10-16 Cross-Language Learning for Entity Matching2021-10-07