W

Wav2vec2 Xls R 1b 21 To En

Developed by facebook
Facebook's Wav2Vec2 XLS-R model for multilingual speech-to-English translation tasks
Downloads 511
Release Time : 3/2/2022

Model Overview

This is a model based on the SpeechEncoderDecoder architecture, capable of translating speech from 21 languages into English. The encoder is based on facebook/wav2vec2-xls-r-1b, and the decoder is based on facebook/mbart-large-50, fine-tuned on the Covost2 dataset.

Model Features

Multilingual support
Supports speech translation from 21 languages to English
Large-scale pretraining
Based on the 2-billion-parameter XLS-R model with powerful speech feature extraction capabilities
End-to-end translation
Direct end-to-end translation from speech to target language text

Model Capabilities

Speech recognition
Multilingual translation
Speech-to-text conversion

Use Cases

Speech translation
Real-time speech translation
Translates real-time speech in meetings, lectures, etc., into English
Performs excellently on the Covost2 dataset
Multilingual voice assistant
Provides multilingual input support for voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase