S

S2t Wav2vec2 Large En De

Developed by facebook
Transformer-based end-to-end speech translation model, specifically designed for English-to-German speech translation
Downloads 817
Release Time : 3/2/2022

Model Overview

This model is a Transformer-based sequence-to-sequence model, combined with a pre-trained Wav2Vec2 encoder, for end-to-end translation from English speech to German text.

Model Features

End-to-end speech translation
Directly generates German text output from English speech input without intermediate transcription steps
Based on Wav2Vec2 pre-training
Utilizes large-scale self-supervised pre-trained Wav2Vec2 as the speech encoder to improve model performance
Transformer architecture
Adopts a Transformer decoder for high-quality sequence generation

Model Capabilities

English speech recognition
English-to-German speech translation
End-to-end speech processing

Use Cases

Speech translation services
Real-time speech translation
Translates English speech to German text in real-time
Achieves 26.5 BLEU score on the CoVoST-V2 test set
Meeting minutes translation
Automatically translates English meeting recordings into German meeting minutes
Speech-assisted technologies
Multilingual voice assistant
Supports voice assistant functionality with English input and German output
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase