X

Xm Transformer S2ut En Hk

Developed by facebook
Fairseq-developed English-Hokkien (Taiwanese) speech-to-speech translation model, featuring a single-channel decoder architecture that supports direct speech conversion without intermediate text
Downloads 31
Release Time : 10/7/2022

Model Overview

This model facilitates direct speech-to-speech translation between English and Hokkien (Taiwanese), utilizing a Transformer architecture combined with speech synthesis technology for end-to-end conversion

Model Features

Direct Speech Conversion
Achieves end-to-end speech-to-speech translation without intermediate text representation
Multi-Data Source Training
Combines supervised data from the TED domain with weakly supervised data from TED and audiobook domains for training
High-Quality Speech Synthesis
Employs the unit_hifigan_HK_layer12 vocoder to generate natural and fluent speech output

Model Capabilities

English-to-Hokkien Speech Translation
Hokkien-to-English Speech Translation
Cross-Language Speech Conversion

Use Cases

Language Communication
Real-Time Speech Translation
Used for real-time conversation translation between English and Hokkien speakers
Enables natural and fluent cross-language communication
Media Content Processing
TED Talk Translation
Automatically translates English TED Talks into Hokkien versions
Expands content audience reach
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase