W

Wav2vec2 Large Xlsr 53 French

Developed by jonatasgrosman
This is a French speech recognition model fine-tuned from the XLSR-53 large model, trained on the Common Voice dataset, supporting high-accuracy French speech-to-text conversion.
Downloads 47.83k
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) system optimized for French, fine-tuned based on Facebook's wav2vec2-large-xlsr-53 architecture, capable of converting French speech into text.

Model Features

High-precision French recognition
Achieves a word error rate (WER) of 17.65% and a character error rate (CER) of 4.89% on the Common Voice French test set.
Language model enhancement support
When combined with a language model, WER can be reduced to 13.59% and CER to 3.91%, significantly improving recognition accuracy.
16kHz sampling rate support
Optimized for 16kHz sampled speech input, suitable for most speech application scenarios.
Open-source license
Licensed under Apache-2.0, allowing for commercial and research use.

Model Capabilities

French speech recognition
Real-time speech-to-text
Batch audio processing

Use Cases

Speech transcription
French speech-to-text
Convert French speech content into editable text format
Achieves over 83% accuracy on standard test sets.
Voice assistants
French voice command recognition
Used for voice command recognition in French voice assistants or control systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase