W

Wav2vec2 Large Xlsr Polish

Developed by mbien
A speech recognition model fine-tuned on the Common Voice Polish dataset based on facebook/wav2vec2-large-xlsr-53, achieving a test set word error rate of 23.01%
Downloads 40
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model optimized for Polish, capable of converting Polish speech into text.

Model Features

High-accuracy Polish recognition
Achieves a word error rate of 23.01% on the Common Voice Polish test set
No language model required
Can be used directly without additional language model support
Based on XLSR architecture
Uses facebook's wav2vec2-large-xlsr-53 as the base model, with powerful speech feature extraction capabilities

Model Capabilities

Polish speech recognition
Audio to text conversion
16kHz audio processing

Use Cases

Speech transcription
Polish speech transcription
Convert Polish speech content into editable text format
Word error rate 23.01%
Voice assistants
Polish voice command recognition
Used for building Polish voice assistants or voice control systems
Featured Recommended AI Models
ยฉ 2025AIbase