# Multi-scenario ASR
Parakeet Tdt 0.6b V2 Mlx
This is an automatic speech recognition model that has been converted to a version suitable for MLX and can perform inference quickly.
Speech Recognition
Safetensors English
P
senstella
183
6
Whisper Base Vi
MIT
A speech recognition model fine-tuned on 100 hours of Vietnamese speech data based on openai/whisper-base model, improving Vietnamese transcription accuracy
Speech Recognition
Transformers Other

W
namphungdn134
215
3
Whisper Small El
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned from the openai/whisper-small model for Greek speech recognition tasks, trained on 3,620 Greek samples from the Mozilla Common Voice 17.0 dataset.
Speech Recognition
Transformers Other

W
mozilla-ai
94
1
Distil Whisper Large V3 Int8 Ov
MIT
This is a converted and quantized speech recognition model. It is converted from the distil-large-v3 model to the OpenVINO™ IR format, and the weights are compressed to INT8 to improve performance and compatibility.
Speech Recognition
Transformers English

D
OpenVINO
2,103
3
Whisper Small Turkish V2
Apache-2.0
A speech recognition model fine-tuned on the Turkish Common Voice dataset based on OpenAI Whisper-small
Speech Recognition
Transformers Other

W
atakanince
61
2
Whisper Large Et
Estonian speech recognition model fine-tuned from OpenAI Whisper-large-v2, developed by Tallinn University of Technology, trained on approximately 1,200 hours of data
Speech Recognition
Transformers Other

W
TalTechNLP
245
5
Whisper Large V2 Hindi 2.5k Steps
Apache-2.0
This is a Hindi automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper Large V2, trained on the Common Voice 11.0 dataset with a word error rate (WER) of 10.05%.
Speech Recognition
Transformers Other

W
DrishtiSharma
52
2
Whisper Large V2 Vietnamese
Apache-2.0
This model is an automatic speech recognition (ASR) model based on OpenAI's Whisper Small architecture, fine-tuned on the Common Voice 11.0 Vietnamese dataset
Speech Recognition
Transformers Other

W
DrishtiSharma
25
2
Whisper Small Sk Cv11
Apache-2.0
Slovak speech recognition model fine-tuned on OpenAI Whisper-small, trained on the Common Voice 11.0 Slovak dataset
Speech Recognition
Transformers Other

W
mikr
79
2
Whisper Medium Pt
Apache-2.0
Portuguese-optimized Whisper Medium speech recognition model achieving 6.579 word error rate (WER) on Common Voice 11 dataset
Speech Recognition
Transformers Other

W
jlondonobo
85
15
Exp W2v2t It Xlsr 53 S387
Apache-2.0
An Italian automatic speech recognition model fine-tuned based on the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice 7.0 Italian dataset.
Speech Recognition
Transformers Other

E
jonatasgrosman
18
0
Exp W2v2t Th Wav2vec2 S664
Apache-2.0
A Thai speech recognition model fine-tuned based on facebook/wav2vec2-large-lv60, trained using the Common Voice 7.0 dataset
Speech Recognition
Transformers Other

E
jonatasgrosman
14
0
Wav2vec2 Large Xlsr 53 German Cv9
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the German Common Voice 9.0 dataset, based on Facebook's wav2vec2-large-xlsr-53 model.
Speech Recognition
Transformers German

W
oliverguhr
98
1
Wav2vec NCKH 2022
Vietnamese automatic speech recognition model based on Wav2vec2 architecture, supporting audio-to-text conversion
Speech Recognition
Transformers Other

W
hoangbinhmta99
29
0
Xlsr Indonesia
Apache-2.0
Indonesian automatic speech recognition (ASR) model fine-tuned on the XLSR architecture, trained on the Common Voice Indonesian dataset
Speech Recognition
Transformers Other

X
acul3
23
0
Xls R 300m Et
An Estonian automatic speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, trained with approximately 800 hours of diverse data
Speech Recognition
Transformers Other

X
TalTechNLP
58
1
Wav2vec2 Large Xls R 300m Tr
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the Turkish Common Voice 8.0 dataset based on facebook/wav2vec2-xls-r-300m, achieving a test WER of 28.69%.
Speech Recognition
Transformers Other

W
emre
25
0
Xls R 300m Te
Apache-2.0
Automatic speech recognition model fine-tuned on OpenSLR SLR66 Telugu dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers Other

X
chmanoj
25
0
Xls Asr Vi 40h
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice 7.0 Vietnamese dataset and private datasets based on facebook/wav2vec2-xls-r-300m.
Speech Recognition
Transformers Other

X
geninhu
14
0
Xlsr 53 Wav2vec Hi
Apache-2.0
A Hindi speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on multilingual and code-switching ASR challenge data for low-resource Indian languages
Speech Recognition
Transformers Other

X
harshit345
38
0
Newnew
Apache-2.0
An automatic speech recognition (ASR) model fine-tuned on Dutch (nl) dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers Other

N
Iskaj
39
0
Wav2vec2 Large Xls R 300m Hindi Kaggle
Hindi speech recognition model trained on the common_voice dataset
Speech Recognition
Transformers Other

W
Saitomar
27
0
Featured Recommended AI Models