TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/Speech Recognition/Jam-ALT

Speech Recognition on Jam-ALT

Metric: Punctuation F1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Punctuation F1▼Extra DataPaperDate↕Code
1AudioShake v357NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code
2AudioShake v150.5NoJam-ALT: A Formatting-Aware Lyrics Transcription...2023-11-23Code
3Whisper v2 +lang45NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code
4Whisper v244.2NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code
5Whisper v3 +lang43.7NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code
6Whisper v343NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code
7Whisper v241.7NoJam-ALT: A Formatting-Aware Lyrics Transcription...2023-11-23Code
8Whisper v341.6NoJam-ALT: A Formatting-Aware Lyrics Transcription...2023-11-23Code
9Whisper v2 +demucs41.6NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code
10Whisper v2 +demucs +lang39.4NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code
11Whisper v3 +demucs +lang33.7NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code
12Whisper v3 +demucs33NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code
13Whisper v3 +demucs29NoJam-ALT: A Formatting-Aware Lyrics Transcription...2023-11-23Code
14Whisper v2 +demucs28NoJam-ALT: A Formatting-Aware Lyrics Transcription...2023-11-23Code
15OWSM v3.1 +lang22.5NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code
16OWSM v3.1 +demucs +lang20NoLyrics Transcription for Humans: A Readability-A...2024-07-30Code