W

Wav2vec2 Xls R 300m Cs Cv8

Developed by comodoro
A speech recognition model fine-tuned on the Common Voice 8.0 Czech dataset based on facebook/wav2vec2-xls-r-300m
Downloads 13
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model optimized for Czech, based on the Wav2Vec2 architecture and fine-tuned on the Common Voice 8.0 dataset, supporting 16kHz sampled speech input.

Model Features

High-performance Czech recognition
Achieves 10.3% WER and 2.6% CER on the Common Voice 8.0 test set
Based on XLSR architecture
Uses facebook's wav2vec2-xls-r-300m as the base model, with strong cross-lingual representation capabilities
No language model required
Can be used directly without additional language model support

Model Capabilities

Czech speech recognition
16kHz audio processing
End-to-end speech-to-text

Use Cases

Speech transcription
Voice notes to text
Convert Czech voice notes into editable text
Highly accurate text output
Voice assistant
Speech recognition component for Czech voice assistant applications
Low-latency speech understanding
Speech analysis
Speech content analysis
Analyze Czech speech content and extract key information
Supports subsequent natural language processing tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase