TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Sicilian Translator: A Recipe for Low-Resource NMT

Sicilian Translator: A Recipe for Low-Resource NMT

Eryk Wdowiak

2021-10-05Machine TranslationAttributeNMTLow-Resource Neural Machine TranslationTranslation
PaperPDFCode(official)

Abstract

With 17,000 pairs of Sicilian-English translated sentences, Arba Sicula developed the first neural machine translator for the Sicilian language. Using small subword vocabularies, we trained small Transformer models with high dropout parameters and achieved BLEU scores in the upper 20s. Then we supplemented our dataset with backtranslation and multilingual translation and pushed our scores into the mid 30s. We also attribute our success to incorporating theoretical information in our dataset. Prior to training, we biased the subword vocabulary towards the desinences one finds in a textbook. And we included textbook exercises in our dataset.

Results

TaskDatasetMetricValueModel
Machine TranslationArba SiculaBLEU (En-Scn)35Larger
Machine TranslationArba SiculaBLEU (Scn-En)36.8Larger
Machine TranslationArba SiculaBLEU (It-Scn)36.5Many-to-Many
Machine TranslationArba SiculaBLEU (Scn-It)30.9Many-to-Many

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM2025-07-16Non-Adaptive Adversarial Face Generation2025-07-16Attributes Shape the Embedding Space of Face Recognition Models2025-07-15COLIBRI Fuzzy Model: Color Linguistic-Based Representation and Interpretation2025-07-15Function-to-Style Guidance of LLMs for Code Translation2025-07-15Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language Models2025-07-13Model Parallelism With Subnetwork Data Parallelism2025-07-11