Metric: Gender Consistency (higher is better)
| # | Model↕ | Gender Consistency▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | pGSLM | 88.5 | Yes | Text-Free Prosody-Aware Generative Spoken Langua... | 2021-09-07 | Code |
| 2 | Spirit-LM (Expr.) | 85 | Yes | Spirit LM: Interleaved Spoken and Written Langua... | 2024-02-08 | Code |
| 3 | TASLM 1B (embedding) | 75.5 | No | - | - | - |
| 4 | LAST 350M | 70.5 | Yes | LAST: Language Model Aware Speech Tokenization | 2024-09-05 | - |
| 5 | TASLM 1B (token) | 70.5 | No | - | - | - |
| 6 | TWIST 7B | 70 | Yes | Textually Pretrained Speech Language Models | 2023-05-22 | Code |
| 7 | TWIST 1.3B | 69.5 | Yes | Textually Pretrained Speech Language Models | 2023-05-22 | Code |
| 8 | LAST 1.3B | 68.5 | Yes | LAST: Language Model Aware Speech Tokenization | 2024-09-05 | - |
| 9 | TWIST 350M | 68 | Yes | Textually Pretrained Speech Language Models | 2023-05-22 | Code |
| 10 | Spirit-LM (base) | 67 | Yes | Spirit LM: Interleaved Spoken and Written Langua... | 2024-02-08 | Code |