O

Opus Mt En Zh

Developed by Helsinki-NLP
A Transformer-based English-to-Multidialectal Chinese translation model supporting translation tasks from English to 13 Chinese variants
Downloads 442.08k
Release Time : 3/2/2022

Model Overview

This machine translation system developed by Helsinki-NLP team specializes in English-to-multidialectal Chinese translation tasks, employing SentencePiece tokenization for preprocessing.

Model Features

Multidialectal Support
Supports translation output for 13 Chinese dialects and variants including Mandarin, Cantonese, Classical Chinese etc.
Standardized Preprocessing
Employs standardized+SentencePiece tokenization (spm32k) for text preprocessing
Target Language Identification
Specifies output dialect variants by adding >>id<< identifiers at sentence beginnings

Model Capabilities

English-to-Chinese translation
Multidialectal text generation
Cross-language conversion

Use Cases

Language Services
Multidialectal Content Localization
Translating English content into different Chinese dialect versions
Supports output in 13 dialect variants
Classical Text Translation Assistance
Translating English into Classical Chinese format
Supports simplified/traditional Classical Chinese output
Educational Applications
Dialect Learning Assistance
Generating different dialect versions of the same content for language learning
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase