RakutenAI-7B-chat開源多語言LLM - 免費部署，日英雙語言任務處理優選

首頁

Rakutenai 7B Chat

由Rakuten開發

RakutenAI-7B-chat是樂天集團開發的日語大語言模型，在日語理解基準測試中表現優異，同時支持英語任務。基於Mistral-7B架構擴展詞彙表優化日文處理。

大型語言模型

Transformers

支持多種語言開源協議:Apache-2.0 #日語優化 #雙語對話 #指令微調

下載量 3,702

發布時間 : 3/18/2024

模型概述

該模型是RakutenAI-7B的對話優化版本，專為自然語言交互設計，可生成有幫助、詳細且有禮貌的回答。

模型特點

日語優化處理

通過擴展詞彙表至48k顯著提升日文字符-標記比，在日語基準測試中取得最佳成績

雙語支持

同時保持英語任務處理能力，與同類日語模型相比具有競爭力

對話優化

經過指令調優專門優化對話交互場景，生成有幫助且詳細的回答

模型能力

日語文本生成

英語文本生成

多輪對話

問答系統

跨語言理解

使用案例

客戶服務

日語客服助手

處理日語客戶諮詢，提供自然流暢的回覆

在Japanese MT-bench評估中表現優異

教育

語言學習助手

幫助日語或英語學習者進行語言練習

🚀 RakutenAI-7B-chat

RakutenAI-7B-chat是一個系統性的項目，它將最新技術引入到日語大語言模型領域。該模型在日語理解基準測試中取得了最佳成績，同時在英語測試集上與OpenCalm、Elyza、Youri、Nekomata和Swallow等類似模型相比，也保持著有競爭力的性能。

🚀 快速開始

RakutenAI-7B-chat模型在日語和英語語言處理方面表現出色。若你正在尋找基礎模型，可查看 RakutenAI-7B；若你需要指令微調模型，可查看 RakutenAI-7B-instruct。

✨ 主要特性

性能卓越：RakutenAI-7B在日語語言理解基準測試中取得最佳成績，在英語測試集上也有有競爭力的表現。
架構先進：採用Mistral模型架構，並基於 Mistral-7B-v0.1 預訓練檢查點。
詞彙擴展：將Mistral的詞彙表從32k擴展到48k，為日語提供更好的字符與標記比率。

💻 使用示例

基礎用法

# With RakutenAI-7B-Chat's custom chat template.

from transformers import AutoModelForCausalLM, AutoTokenizer

model_path = "Rakuten/RakutenAI-7B-chat"
tokenizer = AutoTokenizer.from_pretrained(model_path)
model = AutoModelForCausalLM.from_pretrained(model_path, torch_dtype="auto", device_map="auto")
model.eval()

chat = [
    
    {"role": "system", "content": "A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions."},
    {"role": "user", "content": "How to make an authentic Spanish Omelette?"},
]

input_ids = tokenizer.apply_chat_template(chat, tokenize=True, add_generation_prompt=True, return_tensors="pt").to(device=model.device)
tokens = model.generate(
    input_ids,
    max_length=4096,
    do_sample=False,
    num_beams=1,
    pad_token_id=tokenizer.eos_token_id,
)
out = tokenizer.decode(tokens[0][len(input_ids[0]):], skip_special_tokens=True)
print("ASSISTANT:\n" + out)
print()


# Without using custom chat template.

from transformers import AutoModelForCausalLM, AutoTokenizer

model_path = "Rakuten/RakutenAI-7B-chat"
tokenizer = AutoTokenizer.from_pretrained(model_path)
model = AutoModelForCausalLM.from_pretrained(model_path, torch_dtype="auto", device_map="auto")
model.eval()

requests = [
    "「馬が合う」はどう言う意味ですか",
    "How to make an authentic Spanish Omelette?",
]

system_message = "A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions. USER: {user_input} ASSISTANT:"

for req in requests:
    input_req = system_message.format(user_input=req)
    input_ids = tokenizer.encode(input_req, return_tensors="pt").to(device=model.device)
    tokens = model.generate(
        input_ids,
        max_new_tokens=1024,
        do_sample=True,
        pad_token_id=tokenizer.eos_token_id,
    )
    out = tokenizer.decode(tokens[0][len(input_ids[0]):], skip_special_tokens=True)
    print("USER:\n" + req)
    print("ASSISTANT:\n" + out)
    print()
    print()

📚 詳細文檔

模型詳情

屬性	詳情
開發方	Rakuten Group, Inc.
支持語言	日語、英語
許可證	Apache License, Version 2.0
指令微調數據集	使用開源和內部手工製作的數據集對基礎模型進行微調，包括 JSNLI、RTE 等。

侷限性和偏差

RakutenAI-7B系列模型能夠在廣泛的主題上生成類似人類的文本。然而，像所有大語言模型一樣，它們也有侷限性，可能會產生有偏見、不準確或不安全的輸出。在與它們交互時，請謹慎並運用判斷力。

📄 許可證

本模型採用 Apache License, Version 2.0 許可。

🔧 技術細節

技術報告可在 arXiv 上獲取。

📚 引用

如需引用我們在RakutenAI-7B系列模型上的工作，請使用以下格式：

@misc{rakutengroup2024rakutenai7b,
      title={RakutenAI-7B: Extending Large Language Models for Japanese}, 
      author={{Rakuten Group, Inc.} and Aaron Levine and Connie Huang and Chenguang Wang and Eduardo Batista and Ewa Szymanska and Hongyi Ding and Hou Wei Chou and Jean-François Pessiot and Johanes Effendi and Justin Chiu and Kai Torben Ohlhus and Karan Chopra and Keiji Shinzato and Koji Murakami and Lee Xiong and Lei Chen and Maki Kubota and Maksim Tkachenko and Miroku Lee and Naoki Takahashi and Prathyusha Jwalapuram and Ryutaro Tatsushima and Saurabh Jain and Sunil Kumar Yadav and Ting Cai and Wei-Te Chen and Yandi Xia and Yuki Nakayama and Yutaka Higashiyama},
      year={2024},
      eprint={2403.15484},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}