E

Elan Mt Bt Ja En

Developed by Mitsua
ElanMT-BT-ja-en is a Japanese-to-English translation model developed by the ELAN MITSUA Project/Abstract Engine, trained exclusively using open-license data and back-translated Wikipedia data.
Downloads 502
Release Time : 5/20/2024

Model Overview

This model is a Japanese-to-English translation model based on the Marian MT architecture, focusing on training with open-license data while avoiding the use of web-scraped or other machine translation corpora.

Model Features

Open Data Training
Trained exclusively using open-license corpora such as CC0, CC BY, and CC BY-SA, avoiding copyright issues.
Back-Translation Enhancement
Enhanced training data through back-translation models, improving translation quality.
High-Quality Vocabulary Performance
A newly constructed 1.5 million-line Wikipedia parallel corpus significantly improves vocabulary-level performance.

Model Capabilities

Japanese-to-English text translation
Multi-sentence text processing

Use Cases

Text Translation
Japanese-to-English Document Translation
Translate Japanese documents into English, suitable for open-license content translation needs.
Performs well on FLORES+ and NTREX datasets, achieving BLEU scores of 24.87 and 22.57, respectively.
Featured Recommended AI Models
ยฉ 2025AIbase