TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Unsupervised Multilingual Alignment using Wasserstein Bary...

Unsupervised Multilingual Alignment using Wasserstein Barycenter

Xin Lian, Kshitij Jain, Jakub Truszkowski, Pascal Poupart, Yao-Liang Yu

2020-01-28Machine TranslationUnsupervised Machine TranslationTranslationAllWord Alignment
PaperPDFCode

Abstract

We study unsupervised multilingual alignment, the problem of finding word-to-word translations between multiple languages without using any parallel data. One popular strategy is to reduce multilingual alignment to the much simplified bilingual setting, by picking one of the input languages as the pivot language that we transit through. However, it is well-known that transiting through a poorly chosen pivot language (such as English) may severely degrade the translation quality, since the assumed transitive relations among all pairs of languages may not be enforced in the training process. Instead of going through a rather arbitrarily chosen pivot language, we propose to use the Wasserstein barycenter as a more informative "mean" language: it encapsulates information from all languages and minimizes all pairwise transportation costs. We evaluate our method on standard benchmarks and demonstrate state-of-the-art performances.

Results

TaskDatasetMetricValueModel
Word Alignmenten-esP@184.26Barycenter Alignment
Word Alignmentes-enP@183.5Barycenter Alignment
Word Alignmentfr-enP@183.23Barycenter Alignment
Word Alignmenten-itP@181.45Barycenter Alignment
Word AlignmentMUSE en-ptP@184.65Barycenter Alignment
Word AlignmentMUSE en-deP@174.08Barycenter Alignment
Word Alignmenten-frP@182.94Barycenter Alignment

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Function-to-Style Guidance of LLMs for Code Translation2025-07-15Modeling Code: Is Text All You Need?2025-07-15All Eyes, no IMU: Learning Flight Attitude from Vision Alone2025-07-15Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09Unconditional Diffusion for Generative Sequential Recommendation2025-07-08Is Diversity All You Need for Scalable Robotic Manipulation?2025-07-08