Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Audio
/
Speech Recognition
/
Jam-ALT Spanish
Speech Recognition on Jam-ALT Spanish
Metric: Line break F-1 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
#
Model
↕
Line break F-1
▼
Extra Data
Paper
Date
↕
Code
1
AudioShake v1
82.7
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
2
AudioShake v3
81.5
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
3
Whisper v3 +lang
74.5
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
4
Whisper v3
73.7
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
5
Whisper v3
73.7
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
6
Whisper v2
71.7
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
7
Whisper v2
71.7
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
8
Whisper v2 +lang
71.5
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
9
Whisper v2 +demucs
56.6
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
10
Whisper v2 +demucs
56.4
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
11
Whisper v3 +demucs +lang
54.7
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
12
Whisper v2 +demucs +lang
52.6
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
13
Whisper v3 +demucs
52.4
No
Jam-ALT: A Formatting-Aware Lyrics Transcription...
2023-11-23
Code
14
Whisper v3 +demucs
52.3
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
15
OWSM v3.1 +demucs +lang
33.5
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code
16
OWSM v3.1 +lang
30.2
No
Lyrics Transcription for Humans: A Readability-A...
2024-07-30
Code