W

Wav2vec2 Xls R 2b 22 To 16

Developed by facebook
Facebook's Wav2Vec2 XLS-R model fine-tuned for multilingual speech translation tasks, supporting mutual translation between 22 input languages and 16 output languages.
Downloads 38
Release Time : 3/2/2022

Model Overview

This is a speech translation model based on the SpeechEncoderDecoder architecture, capable of translating multiple spoken languages into written languages. The encoder is based on wav2vec2-xls-r-2b, and the decoder is based on mbart-large-50, fine-tuned on the Covost2 dataset.

Model Features

Multilingual Support
Supports mutual translation between 22 input languages and 16 output languages, covering a wide range of language needs.
Large-scale Pretraining
Based on the 2-billion-parameter Wav2Vec2-XLS-R model, with powerful speech feature extraction capabilities.
End-to-end Translation
Direct translation from speech to target language text, without intermediate transcription steps.

Model Capabilities

Speech Recognition
Multilingual Translation
Speech-to-Text Conversion

Use Cases

International Communication
Real-time Speech Translation
Translates speech in meetings or conversations into other languages in real-time.
Supports accurate translation for multiple language combinations.
Media Processing
Video Subtitle Generation
Automatically generates translated subtitles for foreign-language videos.
Supports subtitle generation for multiple language pairs.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase