Papers With Code 2 | ML Benchmarks, SotA Results & Code

ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts

🔥 Key Features

3000+ hours of synthetic speech
Diverse Distribution Shifts: The dataset spans 7 key distribution shifts, including:
- 📖 Reading Style
- 🎙️ Podcast
- 🎥 YouTube
- 🗣️ Languages (Three different languages)
- 🌎 Demographics (including variations in age, accent, and gender)
Multiple Speech Generation Systems: Includes data synthesized from various TTS models and vocoders.

Dataset can be downloaded from: Hugging Face