# High-precision WER
Wav2vec2 Large Xlrs Korean V5
Apache-2.0
This model is a Korean automatic speech recognition model fine-tuned on the zeroth_korean dataset based on facebook/wav2vec2-xls-r-300m, with a word error rate of 0.2433.
Speech Recognition
Transformers

W
student-47
285
1
Wav2vec2 Large Xlsr 53 Icelandic Ep30 967h
An acoustic model fine-tuned specifically for Icelandic automatic speech recognition tasks, trained on 967 hours of Icelandic data
Speech Recognition
Transformers Other

W
language-and-voice-lab
2,153
2
Stt Ru Fastconformer Hybrid Large Pc
This is a FastConformer hybrid model for Russian automatic speech recognition, combining Transducer and CTC decoders with approximately 115 million parameters.
Speech Recognition Other
S
nvidia
6,513
10
Stt De Fastconformer Hybrid Large Pc
This is a German automatic speech recognition model based on the FastConformer architecture, employing a hybrid training approach with Transformer and CTC, with a parameter size of approximately 115M.
Speech Recognition German
S
nvidia
1,017
4
Wav2vec2 Large Xlsr 53 Spanish Ep5 944h
An acoustic model for Spanish automatic speech recognition, fine-tuned for 5 epochs based on facebook/wav2vec2-large-xlsr-53 using approximately 944 hours of Spanish data.
Speech Recognition
Transformers Spanish

W
carlosdanielhernandezmena
111
3
Wav2vec2 Large Vi Vlsp2020
Vietnamese automatic speech recognition model based on wav2vec2 architecture, pre-trained with 13,000 hours of unlabeled YouTube audio and fine-tuned on 250 hours of labeled data
Speech Recognition
Transformers Other

W
nguyenvulebinh
385
4
Stt Ru Conformer Ctc Large
This is a large Conformer-CTC model for Russian automatic speech recognition, trained on approximately 1,636 hours of Russian speech data with about 120 million parameters.
Speech Recognition Other
S
nvidia
452
5
Stt Es Conformer Ctc Large
This is a large Conformer-CTC model for Spanish automatic speech recognition (ASR), trained and released by NVIDIA.
Speech Recognition Spanish
S
nvidia
59
2
Stt Fr Conformer Transducer Large
This is a large-scale Conformer-Transducer model for French automatic speech recognition, with approximately 120 million parameters, trained on over 1,500 hours of French speech data.
Speech Recognition French
S
nvidia
31
10
Stt Fr Conformer Ctc Large
This is a large French automatic speech recognition (ASR) model based on the Conformer architecture, trained using CTC loss function on over 1,500 hours of French speech data.
Speech Recognition French
S
nvidia
361
6
Ai Light Dance Stepmania Ft Wav2vec2 Large Xlsr 53
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-STEPMANIA2 dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers

A
gary109
40
0
Wav2vec2 Large Multilang Cv Ru
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the common_voice dataset, primarily designed for Russian speech recognition tasks.
Speech Recognition
Transformers

W
cutten
16
0
Assignment1 Joane
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR)
Speech Recognition
Transformers English

A
Classroom-workshop
22
0
Assignment1 Jack
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture
Speech Recognition
Transformers English

A
Classroom-workshop
24
0
Assignment1 Omar
Apache-2.0
Wav2Vec2 is a self-supervised learning-based speech recognition model, pre-trained and fine-tuned on 960 hours of LibriSpeech audio data, supporting English speech transcription.
Speech Recognition
Transformers English

A
Classroom-workshop
28
0
Wav2vec2 Large Xls R 300m Singlish Colab
Apache-2.0
A speech recognition model fine-tuned on the Singapore English (li_singlish) dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers

W
RuiqianLi
22
1
Ai Light Dance Singing Ft Wav2vec2 Large Lv60 V2
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the ONSET-SINGING dataset based on wav2vec2-large-lv60, focusing on singing voice recognition tasks.
Speech Recognition
Transformers

A
gary109
16
1
Dansk Wav2vec21
Apache-2.0
This model is a Danish speech recognition model fine-tuned by Siyam/SKYLy on the common_voice dataset
Speech Recognition
Transformers

D
Siyam
32
0
English Filipino Wav2vec2 L Xls R Test 02
Apache-2.0
This is a speech recognition model fine-tuned on Filipino speech datasets based on the wav2vec2-large-xlsr-53-english model, supporting English and Filipino speech-to-text tasks.
Speech Recognition
Transformers

E
Khalsuu
21
0
Wav2vec2 Common Voice Lithuanian
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the COMMON_VOICE - LT dataset for Lithuanian speech recognition.
Speech Recognition
Transformers Other

W
birgermoell
17
0
20220413 210552
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the common_voice dataset
Speech Recognition
Transformers

2
lilitket
18
0
Aradia Ctc Distilhubert Ft
Apache-2.0
An automatic speech recognition (ASR) model fine-tuned on Arabic speech datasets based on distilhubert
Speech Recognition
Transformers

A
abdusah
16
0
Wav2vec2 Large Xls R 300m Irish Colab Test
Apache-2.0
This is a speech recognition model fine-tuned on the Common Voice Irish dataset based on the facebook/wav2vec2-xls-r-300m model, primarily used for automatic speech recognition tasks in Irish.
Speech Recognition
Transformers

W
jfealko
24
0
Wav2vec2 Xls R 1b Portuguese CORAA 3
Apache-2.0
Portuguese automatic speech recognition model fine-tuned on the CORAA dataset based on facebook/wav2vec2-xls-r-1b
Speech Recognition
Transformers Other

W
lgris
31
0
Wav2vec2 Large Xls R 300m Odia Cv8
Apache-2.0
An automatic speech recognition model fine-tuned on the Odia (OR) Common Voice dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers Other

W
infinitejoy
16
0
Wav2vec2 Large Xls R 300m Ur
Apache-2.0
Urdu speech recognition model based on the wav2vec2-large-xls-r-300m architecture, fine-tuned on the Common Voice dataset
Speech Recognition
Transformers

W
anuragshas
20
0
S2t Small Librispeech Asr
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture
Speech Recognition
Transformers English

S
facebook
10.92k
27
Wav2vec2 Xls R 1b German
Apache-2.0
This is a German automatic speech recognition model based on the XLS-R 1B architecture, fine-tuned on multiple German speech datasets including Common Voice 8.0
Speech Recognition
Transformers German

W
jonatasgrosman
105
3
Wav2vec2 Large Xlsr 53 Ir
Apache-2.0
An Irish Gaelic automatic speech recognition model fine-tuned on wav2vec2-large-xlsr-53, trained on the Common Voice 7.0 dataset
Speech Recognition
Transformers

W
jcmc
24
0
Wav2vec2 Xls R 1b Italian
Apache-2.0
This is an Italian automatic speech recognition model based on the XLS-R 1B architecture, fine-tuned on multiple Italian datasets
Speech Recognition
Transformers Other

W
jonatasgrosman
2,703
1
Wav2vec2 Speechdat
Apache-2.0
This model is a Swedish automatic speech recognition model fine-tuned on the COMMON_VOICE - SV-SE dataset based on facebook/wav2vec2-large-xlsr-53.
Speech Recognition
Transformers

W
birgermoell
29
0
Wav2vec2 Xlsr Basaa
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the Common Voice 8 Basaa dataset based on facebook/wav2vec2-xls-r-1b.
Speech Recognition
Transformers Other

W
sammy786
20
0
Wav2vec2 Base Turkish Cv7
Apache-2.0
Turkish automatic speech recognition model based on wav2vec2 architecture, fine-tuned on the Common Voice 7.0 Turkish dataset
Speech Recognition
Transformers Other

W
cahya
21
0
Wav2vec2 Xls R 1b Hi Cv8
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Common Voice 8.0 Hindi dataset based on the facebook/wav2vec2-xls-r-1b model, supporting Hindi speech-to-text tasks.
Speech Recognition
Transformers Other

W
anuragshas
16
0
Wav2vec2 Xls R 1b Russian
Apache-2.0
Russian speech recognition model fine-tuned based on XLS-R 1B architecture, trained on datasets like Common Voice 8.0
Speech Recognition
Transformers Other

W
jonatasgrosman
765
14
Wav2vec2 Large Xls R 300m Galician
Apache-2.0
This is an automatic speech recognition model fine-tuned on Galician speech datasets based on facebook/wav2vec2-xls-r-300m.
Speech Recognition
Transformers Other

W
infinitejoy
31
0
Wav2vec2 Xlsr Czech
Apache-2.0
This model is a Czech automatic speech recognition model fine-tuned on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - cs dataset based on facebook/wav2vec2-xls-r-1b.
Speech Recognition
Transformers Other

W
sammy786
21
0
Wav2vec2 Xls R 1b Portuguese
Apache-2.0
This is a Portuguese automatic speech recognition model based on the XLS-R 1B architecture, fine-tuned on multiple Portuguese speech datasets.
Speech Recognition
Transformers Other

W
jonatasgrosman
648.50k
12
S2t Large Librispeech Asr
MIT
An end-to-end sequence-to-sequence transformer model for automatic speech recognition (ASR), trained on the LibriSpeech dataset
Speech Recognition
Transformers English

S
facebook
422
10
Wav2vec2 Xl 960h Dementiabank
Apache-2.0
This model is a speech recognition model fine-tuned on the DementiaBank dataset based on facebook/wav2vec2-large-960h, primarily used for speech-to-text tasks.
Speech Recognition
Transformers

W
shields
20
0
- 1
- 2
Featured Recommended AI Models