ShiftySpeech

AudioCC BY 4.0Introduced 2025-02-08

ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts

🔥 Key Features

  • 3000+ hours of synthetic speech
  • Diverse Distribution Shifts: The dataset spans 7 key distribution shifts, including:
    • 📖 Reading Style
    • 🎙️ Podcast
    • 🎥 YouTube
    • 🗣️ Languages (Three different languages)
    • 🌎 Demographics (including variations in age, accent, and gender)
  • Multiple Speech Generation Systems: Includes data synthesized from various TTS models and vocoders.

Dataset can be downloaded from: Hugging Face