2

20220415 210530

Developed by lilitket
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-2b on the common_voice dataset
Downloads 20
Release Time : 4/15/2022

Model Overview

This is a fine-tuned model for speech recognition tasks, based on the wav2vec2-xls-r-2b architecture and trained on the common_voice dataset

Model Features

Large-scale Pre-trained Model Fine-tuning
Fine-tuned from the 2-billion-parameter wav2vec2-xls-r-2b model
Relatively Low Word Error Rate
Achieves a word error rate of 0.3881 on the evaluation set
Efficient Training
Optimized training process using techniques like gradient accumulation

Model Capabilities

Speech-to-Text
Automatic Speech Recognition

Use Cases

Speech Transcription
Speech-to-Text Service
Convert speech content into text transcripts
Word error rate 0.3881
Assistive Technology
Real-time Caption Generation
Generate real-time captions for video or live streaming content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase