TowerInstruct-7B-v0.2 Open-Source Multilingual Large Model - A Free Translation Tool Supporting 10 Languages

Towerinstruct 7B V0.2

Developed by Unbabel

TowerInstruct-7B-v0.2 is a 7-billion-parameter multilingual large language model focused on translation-related tasks, supporting 10 languages.

Large Language Model

Transformers

Supports Multiple Languages#Multilingual translation #Document-level translation #Term-aware translation

Downloads 5,003

Release Time : 2/9/2024

Model Overview

This model is fine-tuned based on TowerBase and can handle various translation-related tasks, including general machine translation, automatic post-editing, named entity recognition, grammatical error correction, and paraphrase generation.

Model Features

Multilingual support

Supports translation and related tasks in 10 languages.

Diverse task handling

Capable of handling various translation-related tasks, including sentence and paragraph/document-level translation, term-aware translation, and context-aware translation.

Document-level translation capability

Version 0.2 specifically improves the reliability and consistency of document-level translation.

Model Capabilities

Machine translation

Automatic post-editing

Named entity recognition

Grammatical error correction

Paraphrase generation

Use Cases

Translation services

Multilingual text translation

Translate text between the 10 supported languages.

Term-aware translation

Maintain consistency of specific terms during translation.

Text processing

Grammatical error correction

Identify and correct grammatical errors in text.

Text paraphrasing

Generate text that conveys the same meaning as the original but in different expressions.

🚀 TowerInstruct-7B-v0.2 Model Card

TowerInstruct-7B-v0.2 is a language model fine - tuned for a variety of translation - related tasks. It offers reliable performance in multiple languages and has been trained on diverse datasets.

🚀 Quick Start

Here's how you can run the model using the pipeline() function from 🤗 Transformers:

# Install transformers from source - only needed for versions <= v4.34
# pip install git+https://github.com/huggingface/transformers.git
# pip install accelerate

import torch
from transformers import pipeline

pipe = pipeline("text-generation", model="Unbabel/TowerInstruct-7B-v0.2", torch_dtype=torch.bfloat16, device_map="auto")
# We use the tokenizer’s chat template to format each message - see https://huggingface.co/docs/transformers/main/en/chat_templating
messages = [
    {"role": "user", "content": "Translate the following text from Portuguese into English.\nPortuguese: Um grupo de investigadores lançou um novo modelo para tarefas relacionadas com tradução.\nEnglish:"},
]
prompt = pipe.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
outputs = pipe(prompt, max_new_tokens=256, do_sample=False)
print(outputs[0]["generated_text"])
# <|im_start|>user
# Translate the following text from Portuguese into English.
# Portuguese: Um grupo de investigadores lançou um novo modelo para tarefas relacionadas com tradução.
# English:<|im_end|>
# <|im_start|>assistant
# A group of researchers has launched a new model for translation-related tasks.

✨ Features

Multiple Translation Tasks: Capable of handling general machine translation (sentence - and paragraph/document - level), automatic post - edition, named - entity recognition, grammatical error correction, and paraphrase generation.
Multilingual Support: Supports English, Portuguese, Spanish, French, German, Dutch, Italian, Korean, Chinese, and Russian.
Improved Version: TowerInstruct-7B-v0.2 has more reliable document - level translation capabilities compared to TowerInstruct-7B-v0.1.

📦 Installation

The installation steps are included in the quick - start code example. You may need to install transformers from source for versions <= v4.34 and also install accelerate.

# Install transformers from source - only needed for versions <= v4.34
# pip install git+https://github.com/huggingface/transformers.git
# pip install accelerate

📚 Documentation

Model Details

Model Description

TowerInstruct-7B is a language model resulting from fine - tuning TowerBase on the TowerBlocks supervised fine - tuning dataset. TowerInstruct-7B-v0.2 is the first model in the series. The model is trained to handle several translation - related tasks. More details will be released in the upcoming technical report. You can check the model results here.

Property	Details
Developed by	Unbabel, Instituto Superior Técnico, CentraleSupélec University of Paris - Saclay
Model Type	A 7B parameter model fine - tuned on a mix of publicly available, synthetic datasets on translation - related tasks, as well as conversational datasets and code instructions.
Language(s) (NLP)	English, Portuguese, Spanish, French, German, Dutch, Italian, Korean, Chinese, Russian
License	CC - BY - NC - 4.0, Llama 2 is licensed under the LLAMA 2 Community License, Copyright © Meta Platforms, Inc. All Rights Reserved.
Finetuned from model	TowerBase

Update: TowerInstruct-7B-v0.2 has more reliable document - level translation capabilities compared to TowerInstruct-7B-v0.1. The new version of TowerBlocks used to train v0.2 is also available in the Tower collection.

Intended uses & limitations

The model was initially fine - tuned on a filtered and preprocessed supervised fine - tuning dataset (TowerBlocks), which contains diverse data sources:

Translation (sentence and paragraph - level)
Automatic Post Edition
Machine Translation Evaluation
Context - aware Translation
Terminology - aware Translation
Multi - reference Translation
Named - entity Recognition
Paraphrase Generation
Synthetic Chat data
Code instructions

You can find the dataset and all data sources of TowerBlocks here.

Out - of - Scope Use

The model is not guaranteed to perform for languages other than the 10 languages it supports. Even though it was trained on conversational data and code instructions, it is not intended to be used as a conversational chatbot or code assistant. The model is also not intended to be used as a document - level translator as the team is currently working on improving quality and consistency on document - level translation.

Bias, Risks, and Limitations

TowerInstruct-v0.2 has not been aligned to human preferences, so the model may generate problematic outputs (e.g., hallucinations, harmful content, or false statements).

Prompt Format

TowerInstruct-v0.2 was trained using the ChatML prompt templates without any system prompts. An example is as follows:

<|im_start|>user
{USER PROMPT}<|im_end|>
<|im_start|>assistant
{MODEL RESPONSE}<|im_end|>
<|im_start|>user
[...]

Supervised tasks

The prompts for all supervised tasks can be found in TowerBlocks. Multiple prompt templates were used for each task. While different prompts may offer different outputs, the difference in downstream performance should be very minimal.

Training Details

Training Data

Link to TowerBlocks.

Training Hyperparameters

The following hyperparameters were used during training:

total_train_batch_size: 256
learning_rate: 7e - 06
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 500
weight_decay: 0.01
optimizer: Adam with betas=(0.9,0.999) and epsilon = 1e - 08
num_epochs: 4
max_seq_length: 2048

📄 License

📖 Citation

@misc{tower_llm_2024,
      title={Tower: An Open Multilingual Large Language Model for Translation-Related Tasks}, 
      author={Duarte M. Alves and José Pombal and Nuno M. Guerreiro and Pedro H. Martins and João Alves and Amin Farajian and Ben Peters and Ricardo Rei and Patrick Fernandes and Sweta Agrawal and Pierre Colombo and José G. C. de Souza and André F. T. Martins},
      year={2024},
      eprint={2402.17733},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご