2

20220413 210552

Developed by lilitket
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the common_voice dataset
Downloads 18
Release Time : 4/13/2022

Model Overview

This is a fine-tuned model for speech recognition, based on the wav2vec2-xls-r-300m architecture, trained on the common_voice dataset.

Model Features

Efficient Fine-tuning
Fine-tuned based on the powerful wav2vec2-xls-r-300m base model
Low Word Error Rate
Achieved a word error rate (WER) of 1.0006 on the evaluation set
Optimized Training
Utilized linear learning rate scheduling and 2000-step warm-up training

Model Capabilities

Speech-to-Text
Automatic Speech Recognition

Use Cases

Speech Transcription
Speech to Text
Convert speech content into text transcripts
Word error rate 1.0006
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase