# Speech-to-text
Whisper Finetuned Amharic
Apache-2.0
Amharic speech recognition model fine-tuned from openai/whisper-small, achieving a word error rate of 2.0538% on the evaluation set
Speech Recognition
Transformers

W
seyyaw
57
1
Wav2vec2 Large Xls R 300m Ru
Apache-2.0
This model is a Russian automatic speech recognition (ASR) model fine-tuned on the common_voice_17_0 dataset based on facebook/wav2vec2-xls-r-300m, with a word error rate (WER) of 0.195.
Speech Recognition
Transformers

W
NLPVladimir
56
1
Whisper Small Sinhala
Apache-2.0
A Sinhala speech recognition model fine-tuned based on OpenAI Whisper-small
Speech Recognition
Transformers Other

W
Lingalingeswaran
667
2
Whisper Hindi2Hinglish Swift
Apache-2.0
A Hindi-Hinglish mixed speech recognition model optimized based on the Whisper architecture, specifically designed for Indian accents and noisy environments
Speech Recognition
Transformers Supports Multiple Languages

W
Oriserve
496
6
Whisper Large V3 Turbo Arabic
Apache-2.0
Based on the transformers library, this is a fine-tuned version of openai/whisper-large-v3-turbo on the common_voice_11_0 dataset, optimized specifically for Arabic speech recognition.
Speech Recognition
Transformers

W
mboushaba
1,696
1
Distil Whisper Large V3
Apache-2.0
This model is a conversion from the GGML format of distil-whisper/distil-large-v3-ggml to Ratchet's custom format, primarily used for speech recognition tasks.
Speech Recognition
D
FL33TW00D-HF
164
4
Language Detector
Apache-2.0
A language detection model fine-tuned based on openai/whisper-small, achieving 96.47% accuracy on the evaluation set
Speech Recognition
Transformers

L
fitlemon
18
1
Whisper Large V3 Ft Cv16 Mn
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 16.0 dataset based on OpenAI Whisper Large V3
Speech Recognition
Transformers

W
sanchit-gandhi
34
1
Whisper Large V2 Spanish
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 13.0 Spanish dataset based on OpenAI Whisper-large-v2
Speech Recognition
Transformers

W
Sandiago21
38
3
Mms 1b L1107
An automatic speech recognition model from Facebook's Massively Multilingual Speech project, supporting 1107 languages, based on Wav2Vec2 architecture with adapter technology for multilingual transcription.
Speech Recognition
Transformers Supports Multiple Languages

M
facebook
267
10
Faster Whisper Tiny
MIT
This is the CTranslate2 converted version of the OpenAI Whisper-tiny model, used for efficient speech recognition tasks.
Speech Recognition Supports Multiple Languages
F
guillaumekln
1,547
6
Whisper Large V2 Malayalam
Apache-2.0
This is a fine-tuned version of the OpenAI Whisper Large V2 model for Malayalam speech recognition tasks, trained using the Common Voice 11.0 dataset
Speech Recognition
Transformers Other

W
DrishtiSharma
23
4
Wav2vec2 Large Xls R 300m Bn Colab
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the common_voice_9_0 dataset, supporting Bengali.
Speech Recognition
Transformers

W
rhr99
18
0
Wav2vec2 Large Multilang Cv Ru
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the common_voice dataset, primarily designed for Russian speech recognition tasks.
Speech Recognition
Transformers

W
cutten
16
0
Wav2vec2 Large Xls R 300m Ta Colab
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the common_voice dataset, primarily used for Tamil speech recognition tasks.
Speech Recognition
Transformers

W
AAkhilesh
24
0
84rry Xlsr 53 Arabic
Apache-2.0
This model is a fine-tuned Arabic speech recognition model based on facebook/wav2vec2-large-xlsr-53 on the Common Voice dataset
Speech Recognition
Transformers

8
84rry
24
0
Wav2vec2 Large Xls R 300m Turkish Colab Common Voice 8 6
Apache-2.0
This is a Turkish speech recognition model based on the wav2vec2 architecture, fine-tuned on the common_voice dataset
Speech Recognition
Transformers

W
husnu
21
0
Dansk Wav2vec21
Apache-2.0
This model is a Danish speech recognition model fine-tuned by Siyam/SKYLy on the common_voice dataset
Speech Recognition
Transformers

D
Siyam
32
0
Wav2vec2 Vorarlbergerisch
Apache-2.0
A German dialect speech recognition model fine-tuned from facebook/wav2vec2-base-960h, supporting Vorarlberg regional dialect recognition in Austria
Speech Recognition
Transformers

W
bkh6722
21
0
Wav2vec2 Base MIR ST500 ASR 109
Apache-2.0
A fine-tuned automatic speech recognition model based on facebook/wav2vec2-base on the MIR_ST500 dataset
Speech Recognition
Transformers

W
gary109
15
0
Wav2vec2 Common Voice Accents Scotland
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the common_voice dataset, specializing in Scottish accent speech recognition.
Speech Recognition
Transformers

W
willcai
19
0
Wav2vec2 Common Voice Accents
Apache-2.0
A speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-xls-r-300m, supporting multiple accent recognition
Speech Recognition
Transformers

W
willcai
24
0
Wav2vec2 Xls R 100m Common Voice Tr Ft
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the COMMON_VOICE - TR Turkish dataset based on facebook/wav2vec2-xls-r-100m.
Speech Recognition
Transformers Other

W
patrickvonplaten
16
0
Xls R Ab Spanish
This is an automatic speech recognition model fine-tuned on the Abkhazian language dataset based on the XLS-R dummy model
Speech Recognition
Transformers Other

X
joheras
18
0
Wav2vec2 Large Xlsr 129 Turkish Colab
Turkish speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-large-xlsr-129
Speech Recognition
Transformers

W
patrickvonplaten
16
0
Wav2vec2 Large Xlsr Open Brazilian Portuguese V2
Apache-2.0
This is a Wav2vec2 model optimized for Brazilian Portuguese, trained on multiple open datasets for automatic speech recognition tasks.
Speech Recognition
Transformers Other

W
lgris
1,825
18
Wav2vec2 Large Xls R 300m Da Colab
Apache-2.0
A Danish speech recognition model fine-tuned based on Alvenir/wav2vec2-base-da, suitable for Danish speech-to-text tasks
Speech Recognition
Transformers

W
vachonni
16
0
Wav2vec2 Large Xls R 300m Guarani Small
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the Common Voice dataset, supporting Guarani speech recognition.
Speech Recognition
Transformers

W
jhonparra18
20
0
Wav2vec2 Xls R 300m W2V2 XLSR 300M YAKUT SMALL
Apache-2.0
This is a speech recognition model fine-tuned on the Yakut (Sakha) language dataset based on the facebook/wav2vec2-xls-r-300m model
Speech Recognition
Transformers Other

W
emre
90
0
Tamil Wav2Vec Xls R 300m Tamil Colab
Apache-2.0
This model is a fine-tuned Tamil speech recognition model based on facebook/wav2vec2-xls-r-300m on the Common Voice dataset.
Speech Recognition
Transformers Other

T
bharat-raghunathan
29
1
Wav2vec2 Xlsr Breton
Apache-2.0
This model is a fine-tuned automatic speech recognition model for Breton based on facebook/wav2vec2-xls-r-1b.
Speech Recognition
Transformers Other

W
sammy786
13
0
Wav2vec2 Large Xls R 300m Turkish Colab
Apache-2.0
This is a speech recognition model fine-tuned on the Common Voice Turkish dataset based on the facebook/wav2vec2-xls-r-300m model
Speech Recognition
Transformers

W
izzy-lazerson
34
0
Wav2vec2 Large Xls R 300m Turkish Colab
Apache-2.0
A speech recognition model fine-tuned on the Common Voice Turkish dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers

W
krirk
17
0
Wav2vec2 Large Xls R 300m Turkish Colab
Apache-2.0
A speech recognition model fine-tuned on the Common Voice Turkish dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers

W
patrickvonplaten
18
1
Xls R Hausa 40
Apache-2.0
Hausa automatic speech recognition model based on wav2vec2-xls-r-300m architecture, fine-tuned on Common Voice 8.0 Hausa dataset
Speech Recognition
Transformers Other

X
Mofe
22
1
Featured Recommended AI Models