|
||
---|---|---|
.. | ||
huggingface_models.md | ||
modelscope_models.md | ||
modelscope_models_zh.md | ||
readme.md | ||
readme_zh.md |
readme.md
(简体中文|English)
Model Zoo
Model License
You are free to use, copy, modify, and share FunASR models under the conditions of this agreement. You should indicate the model source and author information when using, copying, modifying and sharing FunASR models. You should keep the relevant names of models in [FunASR software]. Full model license could see license
Model Usage
Ref to docs
Model Zoo
Here we provided several pretrained models on different datasets. The details of models and datasets can be found on ModelScope.
Speech Recognition
Paraformer
FunASR has open-sourced a large number of pre-trained models on industrial data. You are free to use, copy, modify, and share FunASR models under the Model License Agreement. Below are some representative models, for more models please refer to the Model Zoo.
(Note: 🤗 represents the Huggingface model zoo link, ⭐ represents the ModelScope model zoo link)
Model Name | Task Details | Training Data | Parameters |
---|---|---|---|
paraformer-zh (⭐ 🤗 ) |
speech recognition, with timestamps, non-streaming | 60000 hours, Mandarin | 220M |
paraformer-zh-spk ( ⭐ 🤗 ) |
speech recognition with speaker diarization, with timestamps, non-streaming | 60000 hours, Mandarin | 220M |
paraformer-zh-online ( ⭐ 🤗 ) |
speech recognition, streaming | 60000 hours, Mandarin | 220M |
paraformer-en ( ⭐ 🤗 ) |
speech recognition, with timestamps, non-streaming | 50000 hours, English | 220M |
conformer-en ( ⭐ 🤗 ) |
speech recognition, non-streaming | 50000 hours, English | 220M |
ct-punc ( ⭐ 🤗 ) |
punctuation restoration | 100M, Mandarin and English | 1.1G |
fsmn-vad ( ⭐ 🤗 ) |
voice activity detection | 5000 hours, Mandarin and English | 0.4M |
fa-zh ( ⭐ 🤗 ) |
timestamp prediction | 5000 hours, Mandarin | 38M |