Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Audio
/
Speech Recognition
/
Jam-ALT German
Speech Recognition on Jam-ALT German
Metric: Word Error Rate (WER) (lower is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
Word Error Rate (WER) (best first)
Word Error Rate (WER) (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Word Error Rate (WER)
▲
Extra Data
Paper
Date
↕
Code
1
AudioShake v3
12.6
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
2
Whisper v2 +lang
19.9
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
3
Whisper v2 +demucs +lang
23.9
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
4
AudioShake v1
24.4
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
5
Whisper v3 +lang
35.9
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
6
Whisper v3
40.7
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
7
Whisper v3
40.7
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
8
Whisper v3 +demucs +lang
40.8
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
9
Whisper v3 +demucs
43.5
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
10
Whisper v3 +demucs
43.5
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
11
Whisper v2
45.4
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
12
OWSM v3.1 +demucs +lang
51.8
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
13
Whisper v2
54.5
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
14
OWSM v3.1 +lang
63.3
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
15
Whisper v2 +demucs
65.2
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
16
Whisper v2 +demucs
65.2
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
#1
AudioShake v3
SOTA
12.6
Word Error Rate (WER)
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#2
Whisper v2 +lang
SOTA
19.9
Word Error Rate (WER)
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#3
Whisper v2 +demucs +lang
SOTA
23.9
Word Error Rate (WER)
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#4
AudioShake v1
SOTA
24.4
Word Error Rate (WER)
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#5
Whisper v3 +lang
35.9
Word Error Rate (WER)
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#6
Whisper v3
SOTA
40.7
Word Error Rate (WER)
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#7
Whisper v3
40.7
Word Error Rate (WER)
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#8
Whisper v3 +demucs +lang
40.8
Word Error Rate (WER)
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#9
Whisper v3 +demucs
SOTA
43.5
Word Error Rate (WER)
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#10
Whisper v3 +demucs
43.5
Word Error Rate (WER)
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#11
Whisper v2
SOTA
45.4
Word Error Rate (WER)
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#12
OWSM v3.1 +demucs +lang
51.8
Word Error Rate (WER)
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#13
Whisper v2
54.5
Word Error Rate (WER)
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#14
OWSM v3.1 +lang
63.3
Word Error Rate (WER)
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#15
Whisper v2 +demucs
SOTA
65.2
Word Error Rate (WER)
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#16
Whisper v2 +demucs
65.2
Word Error Rate (WER)
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code