TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/NeurST: Neural Speech Translation Toolkit

NeurST: Neural Speech Translation Toolkit

Chengqi Zhao, Mingxuan Wang, Qianqian Dong, Rong Ye, Lei LI

2020-12-18ACL 2021 5Speech-to-Text TranslationTranslation
PaperPDFCode(official)

Abstract

NeurST is an open-source toolkit for neural speech translation. The toolkit mainly focuses on end-to-end speech translation, which is easy to use, modify, and extend to advanced speech translation research and products. NeurST aims at facilitating the speech translation research for NLP researchers and building reliable benchmarks for this field. It provides step-by-step recipes for feature extraction, data preprocessing, distributed training, and evaluation. In this paper, we will introduce the framework design of NeurST and show experimental results for different benchmark datasets, which can be regarded as reliable baselines for future research. The toolkit is publicly available at https://github.com/bytedance/neurst/ and we will continuously update the performance of NeurST with other counterparts and studies at https://st-benchmark.github.io/.

Results

TaskDatasetMetricValueModel
Speech-to-Text TranslationMuST-C EN->FRCase-sensitive sacreBLEU33.3Transformer + ASR Pretrain + SpecAug
Speech-to-Text TranslationMuST-C EN->FRCase-sensitive sacreBLEU32.3Transformer + ASR Pretrain
Speech-to-Text TranslationMuST-C EN->DECase-sensitive sacreBLEU22.8Transformer + ASR Pretrain
Speech-to-Text Translationlibri-transCase-insensitive sacreBLEU17.2Transformer + ASR Pretrain + SpecAug
Speech-to-Text Translationlibri-transCase-insensitive tokenized BLEU18.7Transformer + ASR Pretrain + SpecAug
Speech-to-Text Translationlibri-transCase-sensitive sacreBLEU16.3Transformer + ASR Pretrain + SpecAug
Speech-to-Text Translationlibri-transCase-sensitive tokenized BLEU17.8Transformer + ASR Pretrain + SpecAug
Speech-to-Text Translationlibri-transCase-insensitive sacreBLEU16.5Transformer + ASR Pretrain
Speech-to-Text Translationlibri-transCase-insensitive tokenized BLEU17.9Transformer + ASR Pretrain
Speech-to-Text Translationlibri-transCase-sensitive sacreBLEU15.5Transformer + ASR Pretrain
Speech-to-Text Translationlibri-transCase-sensitive tokenized BLEU16.9Transformer + ASR Pretrain
Speech-to-Text TranslationMuST-C EN->ESCase-sensitive sacreBLEU27.4Transformer + ASR Pretrain + SpecAug
Speech-to-Text TranslationMuST-C EN->ESCase-sensitive sacreBLEU26.8Transformer + ASR Pretrain

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Function-to-Style Guidance of LLMs for Code Translation2025-07-15Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09Unconditional Diffusion for Generative Sequential Recommendation2025-07-08GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation2025-07-01CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation2025-06-29