S

S2t Small Covost2 En Fa St

Developed by facebook
A Transformer-based end-to-end speech translation model for English-to-Persian speech translation tasks
Downloads 49
Release Time : 3/2/2022

Model Overview

This model is a sequence-to-sequence speech-to-text (S2T) converter specifically designed for English speech to Persian text translation tasks. It uses a convolutional downsampler to process speech input and employs a Transformer architecture for translation.

Model Features

End-to-end speech translation
Directly generates Persian text output from English speech input without intermediate transcription steps
Convolutional downsampler
Uses convolutional layers to reduce the length of speech input before feeding it to the encoder, improving processing efficiency
Transformer-based architecture
Adopts standard Transformer encoder-decoder structure with excellent sequence modeling capabilities
Multilingual support
Supports English-to-Persian translation tasks

Model Capabilities

Speech translation
English speech recognition
Persian text generation

Use Cases

Speech translation applications
Real-time speech translation
Translates English speech into Persian text in real time
Achieves 11.43 BLEU score on CoVOST2 test set
Meeting transcript translation
Automatically translates English meeting recordings into Persian text transcripts
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase