2

20220412 203254

Developed by lilitket
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the common_voice dataset, supporting automatic speech recognition tasks.
Downloads 18
Release Time : 4/12/2022

Model Overview

This is a speech recognition model based on the wav2vec2-xls-r-300m architecture, fine-tuned on the common_voice dataset, capable of converting speech to text.

Model Features

Efficient Fine-tuning
Fine-tuned based on the pre-trained wav2vec2-xls-r-300m model, leveraging the advantages of large-scale pre-training.
Low Word Error Rate
Achieved a word error rate (WER) of 1.0019 on the evaluation set, demonstrating excellent performance.
Mixed Precision Training
Utilizes native AMP mixed precision training technology to improve training efficiency.

Model Capabilities

Speech to Text
Automatic Speech Recognition

Use Cases

Speech Transcription
Automatic Meeting Transcription
Automatically converts meeting recordings into text transcripts
Word error rate as low as 1.0019
Voice Assistant
Used in the speech recognition module of voice assistant systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase