HMM-TDNN trained with MMI + data augmentation (speed) + iVectors + 3 regularizations + Fisher (10% / 15.1% respectively trained on SWBD only)
Reported on 2 benchmarks across 1 task
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Audio2 results
- Percentage error13.3best: 6.8 (IBM (LSTM+Conformer encoder-decoder))
- Percentage error9.2best: 4.3 (IBM (LSTM+Conformer encoder-decoder))