Leveraging the Whisper checkpoints
Whisper checkpoints come in five configurations of varying model sizes (Tiny, Base, Small, Medium, and Large). The checkpoints with the smallest four sizes are trained on either English-only or multilingual data. The largest checkpoints are multilingual only. All 11 pre-trained checkpoints are available on the Hugging Face Hub (https://huggingface.co/models?search=openai/whisper). The checkpoints are summarized in the following table with links to the models on the Hub:
Size |
Layers |
Width |
Heads |
Parameters |
English-Only |
Multilingual |
Tiny |
4 |
384 |
6 |
39M |
✓ |
✓ |