W

Wav2vec2 Xls R 300m Demo Colab

Developed by Mahalakshmi
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the common_voice dataset
Downloads 16
Release Time : 3/2/2022

Model Overview

A fine-tuned model for speech recognition tasks, based on the wav2vec2-xls-r-300m architecture, trained on the common_voice dataset

Model Features

Efficient Fine-tuning
Fine-tuned based on the pre-trained wav2vec2-xls-r-300m model, achieving good results on the common_voice dataset
Excellent Performance
Achieved a word error rate of 1.0377 on the evaluation set, demonstrating outstanding performance
Fast Inference
Can process 25.239 samples per second, with relatively fast inference speed

Model Capabilities

Speech Recognition
Audio to Text

Use Cases

Speech Transcription
Speech to Text
Convert speech content into text records
Word error rate 1.0377
Voice Assistants
Voice Command Recognition
Recognize user voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase