W

Wav2vec2 Large Xls R 300m Ia

Developed by ayameRushia
An automatic speech recognition model fine-tuned on the Common Voice 8.0 international language dataset based on facebook/wav2vec2-xls-r-300m
Downloads 23
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) model optimized for international languages, fine-tuned on the Common Voice 8.0 dataset, supporting speech-to-text conversion.

Model Features

High-Performance Speech Recognition
Achieved a word error rate (WER) of 8.6074% and a character error rate (CER) of 2.4147% on the Common Voice 8.0 international language test set
Language Model Support
Supports decoding with a language model, significantly improving recognition accuracy
Based on Large-Scale Pretrained Model
Fine-tuned on the facebook/wav2vec2-xls-r-300m model, inheriting its powerful speech feature extraction capabilities

Model Capabilities

Speech-to-Text
International Speech Recognition
Supports Language Model Decoding

Use Cases

Speech Transcription
International Language Speech Transcription
Convert international language speech content into text
Achieved a word error rate of 8.6074% on the test set
Voice Assistants
International Language Voice Command Recognition
Recognize international language voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase