W

Wav2vec2 Xls R Tf Left Right Shuru

Developed by hrdipto
A speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, achieving a word error rate (WER) of 1.2628 on the evaluation set.
Downloads 29
Release Time : 3/2/2022

Model Overview

This is a speech recognition model fine-tuned based on the wav2vec2-xls-r-300m architecture, suitable for speech-to-text tasks.

Model Features

Low Word Error Rate
Achieved a word error rate (WER) of 1.2628 on the evaluation set, demonstrating excellent performance.
Based on wav2vec2-xls-r Architecture
Utilizes facebook's wav2vec2-xls-r-300m as the base model, featuring powerful speech feature extraction capabilities.
Mixed Precision Training
Employs native AMP for mixed precision training, improving training efficiency.

Model Capabilities

Speech Recognition
Speech-to-Text

Use Cases

Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Word error rate 1.2628
Voice Notes
Convert voice notes into editable text
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase