TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/Text-To-Speech Synthesis/LJSpeech

Text-To-Speech Synthesis on LJSpeech

Metric: Audio Quality MOS (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Audio Quality MOS▼Extra DataPaperDate↕Code
1NaturalSpeech4.56YesNaturalSpeech: End-to-End Text to Speech Synthes...2022-05-09Code
2VITS4.43YesNaturalSpeech: End-to-End Text to Speech Synthes...2022-05-09Code
3Grad-TTS + HiFiGAN (1000 steps)4.37YesGrad-TTS: A Diffusion Probabilistic Model for Te...2021-05-13Code
4Glow-TTS + HiFiGAN4.34YesGlow-TTS: A Generative Flow for Text-to-Speech v...2020-05-22Code
5FastSpeech 2 + HiFiGAN4.34YesNaturalSpeech: End-to-End Text to Speech Synthes...2022-05-09Code
6FastSpeech 2 + HiFiGAN4.32YesFastSpeech 2: Fast and High-Quality End-to-End T...2020-06-08Code
7FastDiff (4 steps)4.28YesFastDiff: A Fast Conditional Diffusion Model for...2022-04-21Code
8FastDiff-TTS4.03YesFastDiff: A Fast Conditional Diffusion Model for...2022-04-21Code
9Transformer TTS (Mel + WaveGlow)3.88YesNeural Speech Synthesis with Transformer Network2018-09-19Code
10FastSpeech (Mel + WaveGlow)3.84YesFastSpeech: Fast, Robust and Controllable Text t...2019-05-22Code
11OverFlow3.37YesOverFlow: Putting flows on top of neural transdu...2022-11-13Code
12Merlin2.4YesFastSpeech: Fast, Robust and Controllable Text t...2019-05-22Code
13temp1.25Yes---