maestrale-chat-v0.4-beta開源意大利語聊天模型 - 經大規模語料訓練高效對話

首頁

Maestrale Chat V0.4 Beta

由mii-llm開發

基於 Mistral-7b 的意大利語聊天模型，經過大規模意大利語語料庫預訓練和微調

大型語言模型

Transformers

其他#意大利語對話 #多輪SFT微調 #DPO對齊

下載量 6,555

發布時間 : 6/6/2024

模型概述

這是一個意大利語聊天模型，基於 Mistral-7b 架構，經過專門的意大利語預訓練和指令微調，具備對話、推理和多種專業任務處理能力。

模型特點

意大利語優化

專門針對意大利語進行了大規模預訓練和微調

多輪對話能力

在170萬對話/指令數據上進行了2輪SFT微調

DPO對齊

使用多個數據集通過DPO進行對齊訓練

多功能支持

支持思維導圖生成、SQL轉換、文章撰寫等多種專業任務

模型能力

意大利語對話

文本生成

指令跟隨

思維導圖生成

SQL查詢轉換

文章撰寫

數學推理

拉丁語翻譯

詩歌創作

使用案例

教育

語言學習輔助

幫助學習意大利語的學生進行對話練習

拉丁語翻譯

提供拉丁語到意大利語的翻譯服務

商業

數據庫查詢

將自然語言問題轉換為SQL查詢

創意寫作

詩歌創作

根據主題生成意大利語詩歌

文章撰寫

根據目錄結構自動生成完整文章

🚀 Maestrale chat beta ༄

Maestrale chat beta 是一款專為意大利語設計的語言模型，通過在精心策劃的大規模高質量語料庫上進行持續預訓練，並結合了SFT和DPO微調技術，在多個任務上展現出了良好的性能。它能處理多種複雜的任務，如生成Mermaid思維導圖、SQL查詢語句和文章等。

✨ 主要特性

語言模型：基於Mistral - 7b的意大利語模型，在精心挑選的大規模高質量意大利語語料庫上進行持續預訓練，並與 occiglot 合併。
微調：在170萬個對話/指令上進行了2個輪次的SFT微調。
DPO對齊：在多個數據集上使用DPO進行對齊。
v0.4版本更新：
- 新增Agent功能。
- 提高了回答的真實性。
- 增強了數學和推理能力。
- 支持Mermaid思維導圖。
- 提供更多拉丁語翻譯、詩歌等內容。

📦 安裝指南

文檔未提供具體安裝步驟，暫不展示安裝指南相關內容。

💻 使用示例

基礎用法

from transformers import (
    AutoTokenizer, 
    AutoModelForCausalLM, 
    GenerationConfig,
    TextStreamer
)
import torch

tokenizer = AutoTokenizer.from_pretrained("mii-llm/maestrale-chat-v0.4-beta")
model = AutoModelForCausalLM.from_pretrained("mii-llm/maestrale-chat-v0.4-beta", load_in_8bit=True, device_map="auto")

gen = GenerationConfig(
    do_sample=True,
    temperature=0.7,
    repetition_penalty=1.2,
    top_k=50,
    top_p=0.95,
    max_new_tokens=500,
    pad_token_id=tokenizer.eos_token_id,
    eos_token_id=tokenizer.convert_tokens_to_ids("<|im_end|>")
)

streamer = TextStreamer(tokenizer, skip_prompt=True)

messages = [
    {"role": "system", "content": "Sei un assistente utile."},
    {"role": "user", "content": "{prompt}"}
]

with torch.no_grad():
    temp = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
    inputs = tokenizer(temp, return_tensors="pt").to("cuda")

    _ = model.generate(
        **inputs,
        streamer=streamer,
        generation_config=gen
    )

高級用法

生成Mermaid思維導圖

messages = [
  {"role": "system", "content": "Fornisci una mindmap Mermaid sull'argomento in input."},
  {"role": "user", "content": "Argomento: [argomento]"}
]

生成SQL查詢語句

schema = "[db schema]"
messages = [
  {"role": "system", "content": f"Sei un assistente SQL e il tuo compito è convertire la domanda dell'utente in codice SQL valido rispetto allo schema del database fornito.\n\nSchema:\n```sql\n{schema}\n```"},
  {"role": "user", "content": "Conta il numero di X prodotti dall'azienda Y"}
]

根據標題和索引生成文章

messages = [
  {"role": "system", "content": "Sei un assistente utile."},
  {"role": "user", "content": (
    "Scrivi un articolo a partire dal titolo e dall'indice dei contenuti.\n\n"
    "Titolo: [titolo]\n\n"
    "Indice:\n\n"
    "1. Introduzione\n"
    "2. [heading]\n"
    "..."
  )}
]

📚 詳細文檔

模型得分

任務	版本	過濾器	n-shot	指標	值		標準誤差
hellaswag_it	1	none	0	acc	0.5270	±	0.0052
		none	0	acc_norm	0.7037	±	0.0048
arc_it	1	none	0	acc	0.1771	±	0.0112
		none	0	acc_norm	0.5218	±	0.0146
m_mmlu_it	0	none	5	acc	0.5623	±	0.0043