History

IrvingGao d7a6de771c first commit for takway.ai		2024-05-18 15:50:56 +08:00
..
huggingface_models.md	first commit for takway.ai	2024-05-18 15:50:56 +08:00
modelscope_models.md	first commit for takway.ai	2024-05-18 15:50:56 +08:00
modelscope_models_zh.md	first commit for takway.ai	2024-05-18 15:50:56 +08:00
readme.md	first commit for takway.ai	2024-05-18 15:50:56 +08:00
readme_zh.md	first commit for takway.ai	2024-05-18 15:50:56 +08:00

readme.md

(简体中文|English)

Model Zoo

Model License

You are free to use, copy, modify, and share FunASR models under the conditions of this agreement. You should indicate the model source and author information when using, copying, modifying and sharing FunASR models. You should keep the relevant names of models in [FunASR software]. Full model license could see license

Model Usage

Ref to docs

Model Zoo

Here we provided several pretrained models on different datasets. The details of models and datasets can be found on ModelScope.

Speech Recognition

Paraformer

FunASR has open-sourced a large number of pre-trained models on industrial data. You are free to use, copy, modify, and share FunASR models under the Model License Agreement. Below are some representative models, for more models please refer to the Model Zoo.

(Note: 🤗 represents the Huggingface model zoo link, ⭐ represents the ModelScope model zoo link)

Model Name	Task Details	Training Data	Parameters
paraformer-zh (⭐ 🤗 )	speech recognition, with timestamps, non-streaming	60000 hours, Mandarin	220M
paraformer-zh-spk ( ⭐ 🤗 )	speech recognition with speaker diarization, with timestamps, non-streaming	60000 hours, Mandarin	220M
paraformer-zh-online ( ⭐ 🤗 )	speech recognition, streaming	60000 hours, Mandarin	220M
paraformer-en ( ⭐ 🤗 )	speech recognition, with timestamps, non-streaming	50000 hours, English	220M
conformer-en ( ⭐ 🤗 )	speech recognition, non-streaming	50000 hours, English	220M
ct-punc ( ⭐ 🤗 )	punctuation restoration	100M, Mandarin and English	1.1G
fsmn-vad ( ⭐ 🤗 )	voice activity detection	5000 hours, Mandarin and English	0.4M
fa-zh ( ⭐ 🤗 )	timestamp prediction	5000 hours, Mandarin	38M