X

Xlsr Wav2vec2 1

Developed by chrisvinsen
A speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting multilingual speech-to-text tasks
Downloads 20
Release Time : 5/24/2022

Model Overview

This model is a fine-tuned version of wav2vec2-large-xlsr-53, focusing on speech recognition tasks, capable of converting speech to text

Model Features

Multilingual Support
Based on XLSR architecture, potentially supporting speech recognition in multiple languages
Efficient Training
Uses mixed-precision training and gradient accumulation techniques to improve training efficiency
Continuous Optimization
After 30 training epochs, word error rate decreased from 1.0 to 0.4412

Model Capabilities

Speech-to-text
Multilingual speech recognition

Use Cases

Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Word error rate 0.4412
Voice Assistant
Serve as the speech recognition component for voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase