S

S2t Wav2vec2 Large En Ca

Developed by facebook
This is a Transformer-based end-to-end speech translation model specifically designed for English-to-Catalan speech translation tasks.
Downloads 35
Release Time : 3/2/2022

Model Overview

The model uses a pre-trained Wav2Vec2 as the encoder paired with a Transformer decoder, capable of directly translating English speech into Catalan text.

Model Features

End-to-end speech translation
Directly generates target language text from speech input without intermediate transcription steps
Wav2Vec2 pre-training
Utilizes large-scale self-supervised pre-trained Wav2Vec2 as the speech encoder
Transformer architecture
Employs a standard Transformer decoder for sequence generation

Model Capabilities

English speech recognition
English-to-Catalan translation
End-to-end speech translation

Use Cases

Speech translation
Real-time speech translation
Translates English speech into Catalan text in real-time
Achieves 34.1 BLEU score on CoVoST-V2 test set
Speech transcription and translation
Transcribes and translates English speech content into Catalan
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase