Pretrained Models on Huggingface
Model License
Model Zoo
Here we provided several pretrained models on different datasets. The details of models and datasets can be found on ModelScope.
Speech Recognition Models
Paraformer Models
Model Name |
Language |
Training Data |
Vocab Size |
Parameter |
Offline/Online |
Notes |
Paraformer-large |
CN & EN |
Alibaba Speech Data (60000hours) |
8404 |
220M |
Offline |
Duration of input wav <= 20s |
UniASR Models
Conformer Models
RNN-T Models
Multi-talker Speech Recognition Models
MFCCA Models
Voice Activity Detection Models
Model Name |
Training Data |
Parameters |
Sampling Rate |
Notes |
FSMN-VAD |
Alibaba Speech Data (5000hours) |
0.4M |
16000 |
|
Punctuation Restoration Models
Model Name |
Training Data |
Parameters |
Vocab Size |
Offline/Online |
Notes |
CT-Transformer |
Alibaba Text Data |
70M |
272727 |
Offline |
offline punctuation model |
Language Models
Speaker Verification Models
Speaker diarization Models
Timestamp Prediction Models