W

Wav2vec2 Xlsr Interlingua

Developed by sammy786
This model is a fine-tuned version of facebook/wav2vec2-xls-r-1b on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - ia dataset for automatic speech recognition in Interlingua.
Downloads 183
Release Time : 3/2/2022

Model Overview

An optimized automatic speech recognition model for Interlingua, fine-tuned based on the wav2vec2-xls-r-1b architecture and trained on the Common Voice 8 dataset, supporting Interlingua speech-to-text tasks.

Model Features

High-Performance Interlingua Recognition
Achieves a 16.81% Word Error Rate (WER) and 4.76% Character Error Rate (CER) on the Common Voice 8 Interlingua test set.
Based on Large-Scale Pretrained Model
Fine-tuned from the facebook/wav2vec2-xls-r-1b model, inheriting its powerful speech feature extraction capabilities.
Optimized Training Process
Utilizes cosine_with_restarts learning rate scheduling and mixed-precision training for efficient and stable training.

Model Capabilities

Interlingua Speech Recognition
Speech-to-Text
Robust Speech Event Processing

Use Cases

Speech Transcription
Interlingua Speech Transcription
Convert Interlingua speech content into text
16.81% WER
Dialogue Systems
Interlingua Dialogue Understanding
Process speech input for Interlingua dialogue systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase