Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Audio
/
Speech Recognition
/
AISHELL-1
Speech Recognition on AISHELL-1
Metric: Word Error Rate (WER) (lower is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
Word Error Rate (WER)
▲
Extra Data
Paper
Date
↕
Code
1
FireRedASR-AED
0.55
Yes
FireRedASR: Open-Source Industrial-Grade Mandari...
2025-01-24
Code
2
Seed-ASR
0.68
Yes
Seed-ASR: Understanding Diverse Speech and Conte...
2024-07-05
-
3
Qwen-Audio
1.29
Yes
Qwen-Audio: Advancing Universal Audio Understand...
2023-11-14
Code
4
MMSpeech With LM
1.9
No
MMSpeech: Multi-modal Multi-task Encoder-Decoder...
2022-11-29
Code
5
Paraformer-large
1.95
Yes
FunASR: A Fundamental End-to-End Speech Recognit...
2023-05-18
Code
6
Zipformer+CR-CTC (no external language model)
4.02
No
CR-CTC: Consistency regularization on CTC for im...
2024-10-07
Code
7
Lightweight Transducer With LM
4.03
No
Lightweight Transducer Based on Frame-Level Crit...
2024-09-05
Code
8
SE-WSBO With LM
4.1
No
Improving Mandarin Speech Recogntion with Block-...
2022-07-24
Code
9
CIF-HKD With LM
4.1
No
Knowledge Transfer from Pre-trained Language Mod...
2023-01-30
Code
10
Lightweight Transducer
4.31
No
Lightweight Transducer Based on Frame-Level Crit...
2024-09-05
Code
11
UMA
4.7
No
Unimodal Aggregation for CTC-based Speech Recogn...
2023-09-15
Code
12
U2
4.72
No
Unified Streaming and Non-streaming Two-pass End...
2020-12-10
Code
13
Paraformer
4.95
No
FunASR: A Fundamental End-to-End Speech Recognit...
2023-05-18
Code
14
BAT
4.97
No
BAT: Boundary aware transducer for memory-effici...
2023-05-19
Code
15
CTC-CRF 4gram-LM
6.34
No
CAT: A CTC-CRF based ASR Toolkit Bridging the Hy...
2020-05-27
Code
16
BRA-E
6.63
No
Beyond Universal Transformer: block reusing with...
2023-03-23
-
17
CTC/Att
6.7
No
A Comparative Study on Transformer vs RNN in Spe...
2019-09-13
Code
18
Att
18.7
No
End-to-end Speech Recognition with Adaptive Comp...
2018-08-30
-