Xm Transformer S2ut En Hk
Fairseq-developed English-Hokkien (Taiwanese) speech-to-speech translation model, featuring a single-channel decoder architecture that supports direct speech conversion without intermediate text
Downloads 31
Release Time : 10/7/2022
Model Overview
This model facilitates direct speech-to-speech translation between English and Hokkien (Taiwanese), utilizing a Transformer architecture combined with speech synthesis technology for end-to-end conversion
Model Features
Direct Speech Conversion
Achieves end-to-end speech-to-speech translation without intermediate text representation
Multi-Data Source Training
Combines supervised data from the TED domain with weakly supervised data from TED and audiobook domains for training
High-Quality Speech Synthesis
Employs the unit_hifigan_HK_layer12 vocoder to generate natural and fluent speech output
Model Capabilities
English-to-Hokkien Speech Translation
Hokkien-to-English Speech Translation
Cross-Language Speech Conversion
Use Cases
Language Communication
Real-Time Speech Translation
Used for real-time conversation translation between English and Hokkien speakers
Enables natural and fluent cross-language communication
Media Content Processing
TED Talk Translation
Automatically translates English TED Talks into Hokkien versions
Expands content audience reach
Featured Recommended AI Models