Italia-9B-Instruct-v0.1 Open-source Large Language Model - Accurately understand Italian culture and perform excellently in English translation

Italia 9B Instruct V0.1

Developed by iGeniusAI

Italia 9B is an open-source large language model developed by iGenius, specifically designed for the Italian language. It understands the linguistic and cultural nuances of Italian and also performs excellently in English and translation tasks.

Large Language Model

Transformers

Open Source License:MIT #Italian language optimization #Enterprise-level LLM #Cultural nuance understanding

Downloads 8,624

Release Time : 7/4/2024

Model Overview

Italia 9B is a foundational large language model based on a Transformer architecture with 9 billion parameters, designed for enterprises in the public and private sectors. It can provide secure, efficient, and accurate artificial intelligence solutions.

Model Features

Designed specifically for Italian

Italia 9B is specifically trained for the Italian language and can understand all the linguistic and cultural nuances of Italian.

High-performance training

It is trained and fine-tuned on a large scale on the Leonardo supercomputer, which is one of the world's most advanced and high-performance computing infrastructures.

Training with multiple data sources

It is trained from scratch using trillions of Italian tokens, and the data sources include public resources, synthetic data, and domain-specific content provided by business partners.

Advanced post-training process

After post-training processes such as supervised fine-tuning (SFT) and direct preference optimization (DPO), the instruction-following ability is enhanced and strong security measures are ensured.

Model Capabilities

Italian text generation

English text generation

Translation tasks

Instruction following

Natural language understanding

Logical reasoning

Use Cases

Enterprise applications

Customer service

Used for enterprise customer service to provide automated responses in Italian and English.

Efficient and accurate customer interactions

Content generation

Generate business content in Italian and English, such as reports, emails, etc.

High-quality content output

Translation tasks

Italian-English translation

Translate Italian text into English, or vice versa.

High-quality translation output

🚀 Italia 9B - Instruct v0.1

Italia 9B - Instruct v0.1 is an open - source large language model developed by iGenius. It's designed for companies in public and private sectors, with excellent performance in Italian and English, suitable for various NLP tasks.

Italia 9B

🚀 Quick Start

For more details on Italia and iGenius, please visit our website and read our release blog post.

Subscribe to our newsletter to receive updates on our latest AI model advancements.

Use with transformers

from transformers import pipeline, AutoTokenizer, AutoModelForCausalLM

model_id = "iGeniusAI/Italia-9B-Instruct-v0.1"

model = AutoModelForCausalLM.from_pretrained(model_id, trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained(model_id)

t_pipeline = pipeline(
    "text-generation",
    model=model,
    tokenizer=tokenizer,
    device_map="auto",
    return_full_text=False, 
    top_p = 0.95, 
    top_k = 50
)

SYSTEM_PROMPT = """Il tuo nome è Modello Italia. Tu sei un'intelligenza artificiale, un modello di linguaggio naturale addestrato da iGenius su Leonardo, uno dei supercomputer più potenti al mondo."""
TEMPERATURE = 0.3
MAX_NEW_TOKENS = 250

messages = [
    {"role": "system", "content": SYSTEM_PROMPT},
    {"role": "user", "content": "Ciao come stai?"},
]

conv_template = tokenizer.apply_chat_template(
        messages,
        tokenize=False
    )

outputs = t_pipeline(
    conv_template,
    max_new_tokens=MAX_NEW_TOKENS,
    do_sample=True,
    temperature=TEMPERATURE,
    num_return_sequences=1,
)
print(outputs[0]["generated_text"])

Chat Format

Italy-9B Instruct is a finetuned model to follow instructions provided by a user, so for best results it is necessary to use the chat format as follow:

<|system|>
Your system prompt.</s>
<|user|>
user request.</s>
<|assistant|>

For example:

<|system|>
Il tuo nome è Modello Italia. Tu sei un'intelligenza artificiale, un modello di linguaggio naturale addestrato da iGenius su Leonardo, uno dei supercomputer più potenti al mondo.</s>
<|user|>
Scrivi una funzione python che genera numeri random.</s>
<|assistant|>

where the model generates the text after <|assistant|>.
</s> is the EOS token.

✨ Features

Multilingual Proficiency: Italia 9B is proficient in both Italian and English. More than 90% of its pre - training data consists of Italian text, with the remaining portion in English, enabling it to perform well in translation tasks.
High - Precision Understanding: Trained from scratch in Italian on trillions of tokens, it can understand all Italian linguistic and cultural nuances with unprecedented precision.
Instruction - Following: The model has undergone a post - training process including supervised fine - tuning and direct preference optimization to enhance instruction - following capabilities.
Safety Measures: Robust safety measures are ensured through the post - training process.

📦 Installation

The provided code example uses the transformers library. You can install it using the following command:

pip install transformers

📚 Documentation

Introduction

Italia is a family of Open Source large language models developed by iGenius, designed for companies operating in the public and private sectors. The first model in the series, Italia 9B, is a foundational LLM with a 9 - billion - parameter Transformer architecture, developed in collaboration with Cineca and released under the MIT license.

Hardware and Software

Thanks to the partnership with Cineca, Italia 9B was trained and fine - tuned on a large scale using thousands of GPUs on the Leonardo supercomputer, one of the most advanced and high - performing computing infrastructures in the world.

Training

Italia 9B was trained from scratch in Italian on trillions of tokens, using a heterogeneous mix of data: public sources, synthetic data, and domain - specific content provided by commercial partners. More than 90% of the pre - training data is in Italian, and the rest is in English. The model has also undergone a post - training process including supervised fine - tuning and direct preference optimization. The pretraining data has a cutoff date of December 2023.

Benchmarks

All existing benchmarks for evaluating language models are designed for the English - speaking ecosystem. Italia demonstrated nearly state - of - the - art performance among models of a similar size when assessed against benchmarks testing common sense, language understanding, and logical reasoning. Here are the benchmark results generated with llm - harness:

Property	Details
xcopa_it	0.73
lambada_openai_mt_it (perplexity)	40.6
lambada_openai_mt_it (acc)	0.43
m_mmlu_it (5 - shot)	0.42
arc_it (5 - shot)	0.43
belebele_ita_Latn (5 - shot)	0.46
hellaswag_it (5 - shot)	0.55
truthfulqa_it_mc1 (0 - shot)	0.30
truthfulqa_it_mc2 (0 - shot)	0.42

Intended Use

Italia 9B is suitable for both commercial and research purposes, focusing on the Italian language. It can be used for a wide range of natural language processing tasks. However, users should consider aspects such as attribution, limitation of liability, sharing modifications, and compatibility according to the MIT license.

Out of Scope

Italia should not be used for applications related to violations of law, infringement of privacy, malicious activities, misinformation, discriminatory practices, and coding tasks.

Limitations

As a new technology, Italia may produce inaccurate, biased, or otherwise objectionable responses. Developers are recommended to perform safety testing before deploying applications based on it.

Contributors

The iGenius Team. Special thanks to Cineca and their team for their support.

📄 License

Italia 9B is released under the MIT license. The license link is here.

⚠️ Important Note

Italia should not be used for applications related to violations of law, infringement of privacy, malicious activities, misinformation, discriminatory practices, and coding tasks.

💡 Usage Tip

Developers should perform safety testing before deploying any applications based on Italia.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご