S

S2t Medium Mustc Multilingual St

Developed by facebook
Transformer-based end-to-end multilingual speech translation model supporting English-to-multiple language speech translation
Downloads 7,322
Release Time : 3/2/2022

Model Overview

This model adopts the Transformer architecture, specifically designed for end-to-end automatic speech recognition and speech translation. It processes speech input through convolutional downsampling and generates translation results in an autoregressive manner.

Model Features

Multilingual support
Supports speech translation from English to 8 languages, including French, German, Spanish, etc.
End-to-end architecture
Features an end-to-end design that directly generates target language text from speech features, simplifying traditional pipeline systems.
Efficient speech processing
Reduces speech input length by 3/4 through convolutional downsampling, improving processing efficiency.

Model Capabilities

English speech recognition
Multilingual speech translation
Automatic speech-to-text

Use Cases

Speech translation services
Real-time speech translation
Translates English speeches or conversations into target language text in real-time
Achieves 24.5-34.9 BLEU scores on the MuST-C test set
Multimedia subtitle generation
Generates multilingual subtitles for English video content
Language learning assistance
Language learning tool
Helps language learners understand English speech content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase