Model Selection

Multi-scenario ASR

# Multi-scenario ASR

Parakeet Tdt 0.6b V2 Mlx

This is an automatic speech recognition model that has been converted to a version suitable for MLX and can perform inference quickly.

Speech Recognition

Safetensors English

Whisper Base Vi

A speech recognition model fine-tuned on 100 hours of Vietnamese speech data based on openai/whisper-base model, improving Vietnamese transcription accuracy

Speech Recognition

Transformers Other

Whisper Small El

This is an automatic speech recognition (ASR) model fine-tuned from the openai/whisper-small model for Greek speech recognition tasks, trained on 3,620 Greek samples from the Mozilla Common Voice 17.0 dataset.

Speech Recognition

Transformers Other

Distil Whisper Large V3 Int8 Ov

This is a converted and quantized speech recognition model. It is converted from the distil-large-v3 model to the OpenVINO™ IR format, and the weights are compressed to INT8 to improve performance and compatibility.

Speech Recognition

Transformers English

Whisper Small Turkish V2

A speech recognition model fine-tuned on the Turkish Common Voice dataset based on OpenAI Whisper-small

Speech Recognition

Transformers Other

Whisper Large Et

Estonian speech recognition model fine-tuned from OpenAI Whisper-large-v2, developed by Tallinn University of Technology, trained on approximately 1,200 hours of data

Speech Recognition

Transformers Other

Whisper Large V2 Hindi 2.5k Steps

This is a Hindi automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper Large V2, trained on the Common Voice 11.0 dataset with a word error rate (WER) of 10.05%.

Speech Recognition

Transformers Other

Whisper Large V2 Vietnamese

This model is an automatic speech recognition (ASR) model based on OpenAI's Whisper Small architecture, fine-tuned on the Common Voice 11.0 Vietnamese dataset

Speech Recognition

Transformers Other

Whisper Small Sk Cv11

Slovak speech recognition model fine-tuned on OpenAI Whisper-small, trained on the Common Voice 11.0 Slovak dataset

Speech Recognition

Transformers Other

Whisper Medium Pt

Portuguese-optimized Whisper Medium speech recognition model achieving 6.579 word error rate (WER) on Common Voice 11 dataset

Speech Recognition

Transformers Other

Exp W2v2t It Xlsr 53 S387

An Italian automatic speech recognition model fine-tuned based on the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice 7.0 Italian dataset.

Speech Recognition

Transformers Other

Exp W2v2t Th Wav2vec2 S664

A Thai speech recognition model fine-tuned based on facebook/wav2vec2-large-lv60, trained using the Common Voice 7.0 dataset

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr 53 German Cv9

This is an automatic speech recognition (ASR) model fine-tuned on the German Common Voice 9.0 dataset, based on Facebook's wav2vec2-large-xlsr-53 model.

Speech Recognition

Transformers German

Wav2vec NCKH 2022

Vietnamese automatic speech recognition model based on Wav2vec2 architecture, supporting audio-to-text conversion

Speech Recognition

Transformers Other

Indonesian automatic speech recognition (ASR) model fine-tuned on the XLSR architecture, trained on the Common Voice Indonesian dataset

Speech Recognition

Transformers Other

An Estonian automatic speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, trained with approximately 800 hours of diverse data

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Tr

This model is an automatic speech recognition (ASR) model fine-tuned on the Turkish Common Voice 8.0 dataset based on facebook/wav2vec2-xls-r-300m, achieving a test WER of 28.69%.

Speech Recognition

Transformers Other

Automatic speech recognition model fine-tuned on OpenSLR SLR66 Telugu dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

This model is a speech recognition model fine-tuned on the Common Voice 7.0 Vietnamese dataset and private datasets based on facebook/wav2vec2-xls-r-300m.

Speech Recognition

Transformers Other

Xlsr 53 Wav2vec Hi

A Hindi speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on multilingual and code-switching ASR challenge data for low-resource Indian languages

Speech Recognition

Transformers Other

An automatic speech recognition (ASR) model fine-tuned on Dutch (nl) dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Hindi Kaggle

Hindi speech recognition model trained on the common_voice dataset

Speech Recognition

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase