ALMA-7B and ALMA-13B-R Open-Source Translation Models: Comparable to GPT-4 with High-Quality Translation Results

ALMA 7B

Developed by haoranxu

ALMA-13B-R is an advanced translation model based on large language models, fine-tuned using Contrastive Preference Optimization (CPO), capable of matching or even surpassing GPT-4 or WMT competition winners.

Machine Translation

Transformers

Open Source License:MIT #Large Language Model Translation #Two-stage Fine-tuning #Contrastive Preference Optimization

Downloads 256

Release Time : 9/17/2023

Model Overview

ALMA-13B-R is a translation model based on LLaMA-2-13B, achieving high-performance machine translation through two-stage fine-tuning (monolingual data fine-tuning + high-quality parallel data optimization) and Contrastive Preference Optimization (CPO).

Model Features

Two-stage Fine-tuning

First fine-tuned on monolingual data, then optimized using high-quality parallel data to ensure robust translation performance.

Contrastive Preference Optimization (CPO)

Employs innovative contrastive preference optimization for LoRA fine-tuning instead of traditional supervised fine-tuning, significantly improving translation quality.

High-performance Translation

Capable of matching or even surpassing GPT-4 or WMT competition winners, delivering professional-grade translation quality.

Model Capabilities

High-quality machine translation

Multilingual translation

Domain-specific translation

Use Cases

Professional Translation

Technical Document Translation

Translate technical documents from one language to another while maintaining accuracy of specialized terminology.

Translation quality comparable to professional human translation

International Conference Material Translation

Provide high-quality translation of presentation materials and conference proceedings for international events.

Achieves performance level of WMT competition winners

Business Applications

Multinational Corporate Communication

Assist enterprises with cross-language internal communication and document translation.

Enhances communication efficiency for multinational corporations

🚀 ALMA (Advanced Language Model-based Translator)

ALMA is an LLM-based translation model that employs a novel translation model paradigm. It starts with fine - tuning on monolingual data and is further optimized using high - quality parallel data. This two - step fine - tuning process guarantees strong translation performance. For more details, refer to our paper.

@misc{xu2023paradigm,
      title={A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models}, 
      author={Haoran Xu and Young Jin Kim and Amr Sharaf and Hany Hassan Awadalla},
      year={2023},
      eprint={2309.11674},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

ALMA - R (NEW!) is now released! ALMA - R builds on ALMA models. Instead of the supervised fine - tuning used in ALMA, it undergoes further LoRA fine - tuning with our proposed Contrastive Preference Optimization (CPO). CPO fine - tuning requires our [triplet preference data](https://huggingface.co/datasets/haoranxu/ALMA - R - Preference) for preference learning. ALMA - R can now match or even outperform GPT - 4 or WMT winners!

@misc{xu2024contrastive,
      title={Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation}, 
      author={Haoran Xu and Amr Sharaf and Yunmo Chen and Weiting Tan and Lingfeng Shen and Benjamin Van Durme and Kenton Murray and Young Jin Kim},
      year={2024},
      eprint={2401.08417},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

✨ Features

Novel Paradigm: ALMA uses a two - step fine - tuning process (monolingual then parallel data) for better translation performance.
ALMA - R Upgrade: ALMA - R further enhances performance with CPO fine - tuning.
Multiple Model Variants: We offer six translation models with different fine - tuning strategies.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

A quick start to use system ALMA - 13B - LoRA for translation. An example of translating "我爱机器翻译。" into English:

import torch
from peft import PeftModel
from transformers import AutoModelForCausalLM
from transformers import LlamaTokenizer

# Load base model and LoRA weights
model = AutoModelForCausalLM.from_pretrained("haoranxu/ALMA-13B-Pretrain", torch_dtype=torch.float16, device_map="auto")
model = PeftModel.from_pretrained(model, "haoranxu/ALMA-13B-Pretrain-LoRA")
tokenizer = LlamaTokenizer.from_pretrained("haoranxu/ALMA-13B-Pretrain", padding_side='left')

# Add the source setence into the prompt template
prompt="Translate this from Chinese to English:\nChinese: 我爱机器翻译。\nEnglish:"
input_ids = tokenizer(prompt, return_tensors="pt", padding=True, max_length=40, truncation=True).input_ids.cuda()

# Translation
with torch.no_grad():
    generated_ids = model.generate(input_ids=input_ids, num_beams=5, max_new_tokens=20, do_sample=True, temperature=0.6, top_p=0.9)
outputs = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)
print(outputs)

📚 Documentation

Model Variants

We release six translation models presented in the paper:

ALMA - 7B: Full - weight Fine - tune LLaMA - 2 - 7B on 20B monolingual tokens and then Full - weight fine - tune on human - written parallel data
ALMA - 7B - LoRA: Full - weight Fine - tune LLaMA - 2 - 7B on 20B monolingual tokens and then LoRA fine - tune on human - written parallel data
ALMA - 7B - R (NEW!): Further LoRA fine - tuning upon ALMA - 7B - LoRA with contrastive preference optimization.
ALMA - 13B: Full - weight Fine - tune LLaMA - 2 - 7B on 12B monolingual tokens and then Full - weight fine - tune on human - written parallel data
ALMA - 13B - LoRA (Our best system): Full - weight Fine - tune LLaMA - 2 - 7B on 12B monolingual tokens and then LoRA fine - tune on human - written parallel data
ALMA - 13B - R (NEW!): Further LoRA fine - tuning upon ALMA - 13B - LoRA with contrastive preference optimization.

Model Checkpoints

Model checkpoints are released at huggingface:

Property	Details
ALMA - 7B	Base Model Link: [haoranxu/ALMA - 7B](https://huggingface.co/haoranxu/ALMA - 7B), LoRA Link: -
ALMA - 7B - LoRA	Base Model Link: [haoranxu/ALMA - 7B - Pretrain](https://huggingface.co/haoranxu/ALMA - 7B - Pretrain), LoRA Link: [haoranxu/ALMA - 7B - Pretrain - LoRA](https://huggingface.co/haoranxu/ALMA - 7B - Pretrain - LoRA)
ALMA - 7B - R (NEW!)	Base Model Link: [haoranxu/ALMA - 7B - R (LoRA merged)](https://huggingface.co/haoranxu/ALMA - 7B - R), LoRA Link: -
ALMA - 13B	Base Model Link: [haoranxu/ALMA - 13B](https://huggingface.co/haoranxu/ALMA - 13B), LoRA Link: -
ALMA - 13B - LoRA	Base Model Link: [haoranxu/ALMA - 13B - Pretrain](https://huggingface.co/haoranxu/ALMA - 13B - Pretrain), LoRA Link: [haoranxu/ALMA - 13B - Pretrain - LoRA](https://huggingface.co/haoranxu/ALMA - 13B - Pretrain - LoRA)
ALMA - 13B - R (NEW!)	Base Model Link: [haoranxu/ALMA - 13B - R (LoRA merged)](https://huggingface.co/haoranxu/ALMA - 13B - R), LoRA Link: -

Datasets

Datasets used by ALMA and ALMA - R are also released at huggingface now (NEW!)

Property	Train / Validation	Test
Human - Written Parallel Data (ALMA)	[train and validation](https://huggingface.co/datasets/haoranxu/ALMA - Human - Parallel)	[WMT'22](https://huggingface.co/datasets/haoranxu/WMT22 - Test)
Triplet Preference Data	[train](https://huggingface.co/datasets/haoranxu/ALMA - R - Preference)	[WMT'22](https://huggingface.co/datasets/haoranxu/WMT22 - Test) and [WMT'23](https://huggingface.co/datasets/haoranxu/WMT23 - Test)

Note

⚠️ Important Note

Note that ALMA - 7B - Pretrain and ALMA - 13B - Pretrain are NOT translation models. They only experience stage 1 monolingual fine - tuning (20B tokens for the 7B model and 12B tokens for the 13B model), and should be utilized in conjunction with their LoRA models.

Please find more details in our GitHub repository

📄 License

This project is released under the MIT license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご