T

Test Audio

Developed by joaogante
A Transformer-based end-to-end speech translation model specifically designed for French-to-English speech translation tasks.
Downloads 19
Release Time : 5/16/2022

Model Overview

This model is a sequence-to-sequence speech-to-text converter, specifically designed for translating French speech to English text. It uses a convolutional downsampler to process speech input and generates translated text through a Transformer architecture.

Model Features

End-to-end speech translation
Directly generates translated text from speech input without intermediate transcription steps.
Transformer-based architecture
Utilizes advanced Transformer architecture to effectively capture long-range dependencies between speech and text.
Convolutional downsampling
Employs a convolutional downsampler to reduce the length of speech features before they enter the encoder, improving processing efficiency.

Model Capabilities

French speech recognition
French-to-English speech translation
End-to-end speech processing

Use Cases

Speech translation services
Real-time speech translation
Translates French speech to English text in real-time, suitable for meetings, lectures, and similar scenarios.
Achieved a BLEU score of 26.25 on the CoVoST2 test set
Speech content transcription and translation
Transcribes and translates French speech content into English text for content localization.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase