W

Wav2vec2 Xls R 300m Cs 250

Developed by comodoro
This is an automatic speech recognition model fine-tuned on Czech datasets based on facebook/wav2vec2-xls-r-300m, supporting 16kHz sampled audio input.
Downloads 248.66k
Release Time : 3/2/2022

Model Overview

This model is designed for Czech automatic speech recognition, fine-tuned on datasets like Common Voice 8.0, and can be used directly or with a language model.

Model Features

Multi-dataset training
Trained on multiple Czech datasets including Common Voice 8.0, OVM, PSCR, and Vystadial2016
High performance
Achieves 7.3% word error rate and 2.1% character error rate on the Common Voice 8.0 test set
Direct usage
Capable of speech recognition without requiring a language model

Model Capabilities

Czech speech recognition
16kHz sampled audio processing
Direct inference without language model

Use Cases

Speech transcription
Speech to text
Convert Czech speech content into text
Word error rate 7.3%, character error rate 2.1%
Speech analysis
Speech content analysis
Analyze Czech speech content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase