Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Audio
/
Speech Recognition
/
Jam-ALT
Speech Recognition on Jam-ALT
Metric: Punctuation F1 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
Punctuation F1 (best first)
Punctuation F1 (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Punctuation F1
▼
Extra Data
Paper
Date
↕
Code
1
AudioShake v3
57
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
2
AudioShake v1
50.5
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
3
Whisper v2 +lang
45
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
4
Whisper v2
44.2
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
5
Whisper v3 +lang
43.7
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
6
Whisper v3
43
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
7
Whisper v2
41.7
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
8
Whisper v3
41.6
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
9
Whisper v2 +demucs
41.6
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
10
Whisper v2 +demucs +lang
39.4
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
11
Whisper v3 +demucs +lang
33.7
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
12
Whisper v3 +demucs
33
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
13
Whisper v3 +demucs
29
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
14
Whisper v2 +demucs
28
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
15
OWSM v3.1 +lang
22.5
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
16
OWSM v3.1 +demucs +lang
20
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
#1
AudioShake v3
SOTA
57
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#2
AudioShake v1
SOTA
50.5
Punctuation F1
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#3
Whisper v2 +lang
45
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#4
Whisper v2
44.2
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#5
Whisper v3 +lang
43.7
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#6
Whisper v3
43
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#7
Whisper v2
41.7
Punctuation F1
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#8
Whisper v3
41.6
Punctuation F1
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#9
Whisper v2 +demucs
41.6
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#10
Whisper v2 +demucs +lang
39.4
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#11
Whisper v3 +demucs +lang
33.7
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#12
Whisper v3 +demucs
33
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#13
Whisper v3 +demucs
29
Punctuation F1
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#14
Whisper v2 +demucs
28
Punctuation F1
· 2023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Code
#15
OWSM v3.1 +lang
22.5
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code
#16
OWSM v3.1 +demucs +lang
20
Punctuation F1
· 2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark
Code