Xlsr Wav2vec2 1
X
Xlsr Wav2vec2 1
Developed by chrisvinsen
A speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting multilingual speech-to-text tasks
Downloads 20
Release Time : 5/24/2022
Model Overview
This model is a fine-tuned version of wav2vec2-large-xlsr-53, focusing on speech recognition tasks, capable of converting speech to text
Model Features
Multilingual Support
Based on XLSR architecture, potentially supporting speech recognition in multiple languages
Efficient Training
Uses mixed-precision training and gradient accumulation techniques to improve training efficiency
Continuous Optimization
After 30 training epochs, word error rate decreased from 1.0 to 0.4412
Model Capabilities
Speech-to-text
Multilingual speech recognition
Use Cases
Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Word error rate 0.4412
Voice Assistant
Serve as the speech recognition component for voice assistants
Featured Recommended AI Models
Š 2025AIbase