オープンソースのMistral-7B-Instruct-v0.2モデル - 枝刈り圧縮され、再トレーニング不要で高パフォーマンスを維持

ホーム

Mistral 7B Instruct V0.2 Sparsity 20 V0.1

wang7776によって開発

Mistral-7B-Instruct-v0.2はMistral-7B-Instruct-v0.1を改良した命令微調整大規模言語モデルで、Wanda枝刈り手法を用いて2%の疎性に圧縮され、再訓練不要で競争力のある性能を維持します。

大規模言語モデル

Transformers

オープンソースライセンス:Apache-2.0 #命令微調整最適化 #再訓練不要の枝刈り #マルチターン対話サポート

ダウンロード数 80

リリース時間 : 1/17/2024

モデル概要

このモデルは命令微調整された大規模言語モデルで、テキスト生成タスクに主に使用され、特に命令追従能力が最適化されています。

モデル特徴

Wanda枝刈り技術

Wanda枝刈り手法を用いて疎性を2%に圧縮、再訓練や重み更新不要で競争力のある性能を維持

改良された命令微調整

v0.1版と比較して命令微調整が改良され、より優れた命令追従能力を提供

効率的な注意機構

グループ化クエリ注意とスライディングウィンドウ注意機構を採用し、推論効率を向上

モデル能力

テキスト生成

命令追従

対話システム

使用事例

対話システム

料理アシスタント

調味料の選択やレシピに関する質問に回答する料理アシスタントとして利用可能

詳細な調味料の好みやレシピ提案を提供可能

汎用QA

知識QA

様々な知識質問に回答するために使用可能

🚀 Mistral-7B-Instruct-v0.2の概要

このモデルは、Wanda pruning methodを使用して2%の疎度に剪定されています。この方法では再トレーニングや重みの更新を必要とせず、依然として競争力のある性能を達成します。ベースモデルへのリンクはこちらです。

🚀 クイックスタート

Mistral-7B-Instruct-v0.2 Large Language Model (LLM) は、Mistral-7B-Instruct-v0.1 を改良した命令に基づくファインチューニング版です。このモデルの詳細については、論文とリリースブログ記事をご覧ください。

✨ 主な機能

命令形式

命令に基づくファインチューニングを活用するために、プロンプトは [INST] と [/INST] トークンで囲む必要があります。最初の命令は文頭識別子で始める必要があります。次の命令はその必要はありません。アシスタントの生成は文末トークン識別子で終了します。

例:

text = "<s>[INST] What is your favourite condiment? [/INST]"
"Well, I'm quite partial to a good squeeze of fresh lemon juice. It adds just the right amount of zesty flavour to whatever I'm cooking up in the kitchen!</s> "
"[INST] Do you have mayonnaise recipes? [/INST]"

この形式は、apply_chat_template() メソッドを介してチャットテンプレートとして利用できます。

💻 使用例

基本的な使用法

from transformers import AutoModelForCausalLM, AutoTokenizer

device = "cuda" # the device to load the model onto

model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.2")
tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.2")

messages = [
    {"role": "user", "content": "What is your favourite condiment?"},
    {"role": "assistant", "content": "Well, I'm quite partial to a good squeeze of fresh lemon juice. It adds just the right amount of zesty flavour to whatever I'm cooking up in the kitchen!"},
    {"role": "user", "content": "Do you have mayonnaise recipes?"}
]

encodeds = tokenizer.apply_chat_template(messages, return_tensors="pt")

model_inputs = encodeds.to(device)
model.to(device)

generated_ids = model.generate(model_inputs, max_new_tokens=1000, do_sample=True)
decoded = tokenizer.batch_decode(generated_ids)
print(decoded[0])

🔧 技術詳細

モデルアーキテクチャ

この命令モデルは、Mistral-7B-v0.1に基づいており、以下のアーキテクチャが選択されています。

Grouped-Query Attention
Sliding-Window Attention
Byte-fallback BPE tokenizer

トラブルシューティング

次のエラーが表示された場合:

Traceback (most recent call last):
File "", line 1, in
File "/transformers/models/auto/auto_factory.py", line 482, in from_pretrained
config, kwargs = AutoConfig.from_pretrained(
File "/transformers/models/auto/configuration_auto.py", line 1022, in from_pretrained
config_class = CONFIG_MAPPING[config_dict["model_type"]]
File "/transformers/models/auto/configuration_auto.py", line 723, in getitem
raise KeyError(key)
KeyError: 'mistral'

ソースからtransformersをインストールすると問題が解決するはずです。

pip install git+https://github.com/huggingface/transformers

これはtransformers-v4.33.4以降では必要ないはずです。

制限事項

Mistral 7B Instructモデルは、ベースモデルを簡単にファインチューニングして魅力的な性能を達成できることをすばやく実証するものです。このモデルにはモデレーションメカニズムがありません。モデルがガードレールをきめ細かく尊重し、モデレートされた出力が必要な環境でのデプロイを可能にする方法について、コミュニティと協力することを楽しみにしています。

Mistral AIチーム

Albert Jiang, Alexandre Sablayrolles, Arthur Mensch, Blanche Savary, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Emma Bou Hanna, Florian Bressand, Gianna Lengyel, Guillaume Bour, Guillaume Lample, Lélio Renard Lavaud, Louis Ternon, Lucile Saulnier, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Théophile Gervet, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed.