Mistral-7B-Instruct-Ukrainianオープンソース大規模言語モデル - ウクライナ語のコミュニケーションアプリケーションを無料でデプロイしサポート

ホーム

Mistral 7B Instruct Ukrainian

SherlockAssistantによって開発

ウクライナ語に最適化されたオープンソース大規模言語モデルで、ファインチューニング、モデル融合、直接選好最適化の3段階トレーニングプロセスで構築

大規模言語モデル

Transformers

オープンソースライセンス:Apache-2.0 #ウクライナ語最適化 #命令ファインチューニング #DPO強化

ダウンロード数 1,443

リリース時間 : 2/26/2024

モデル概要

Mistral-7B-v0.2を基に改良したウクライナ語大規模言語モデルで、ウクライナ語の理解と生成能力を特別に最適化し、質問応答や命令追従などのタスクに適している

モデル特徴

ウクライナ語最適化

ウクライナ語向けに特別に設計された3段階トレーニング最適化（ファインチューニング、モデル融合、直接選好最適化）を実施

効率的な注意機構

グループ化クエリ注意とスライディングウィンドウ注意機構を採用し、推論効率を向上

命令追従能力

[INST]命令フォーマットをサポートし、ユーザーの命令を正確に理解して実行可能

モデル能力

ウクライナ語テキスト生成

質問応答システム

命令理解と実行

知識検索

使用事例

教育

ウクライナ語試験問題解答

ウクライナZNO試験関連の質問に回答

UNLP 2024評価データセットで良好な性能を発揮

技術サポート

技術質問応答

ウクライナStackExchangeの技術質問に回答

🚀 Mistral-7B-Instruct-Ukrainianのモデルカード

Mistral-7B-UKは、ウクライナ語向けにファインチューニングされた大規模言語モデルです。

Mistral-7B-UKは、以下の手順で学習されています。

Mistral-7B-v0.2を構造化および非構造化データセットを用いて初期ファインチューニング。
ファインチューニングされたモデルと、OpenLLMベンチマークでMistral-7B-v0.2よりも優れた性能を発揮するモデルNeuralTrix-7BをSLERPマージ。
最終モデルのDPO。

🚀 クイックスタート

指示書の形式

指示のファインチューニングを活用するには、プロンプトを[INST]と[/INST]トークンで囲む必要があります。

例:

text = "[INST]Відповідайте лише буквою правильної відповіді: Елементи експресіонізму наявні у творі: A. «Камінний хрест», B. «Інститутка», C. «Маруся», D. «Людина»[/INST]"

この形式は、apply_chat_template()メソッドを介してチャットテンプレートとして利用できます。

モデルアーキテクチャ

この指示モデルは、Mistral-7B-v0.2をベースにしたトランスフォーマーモデルで、以下のアーキテクチャが採用されています。

グループ化クエリアテンション
スライディングウィンドウアテンション
バイトフォールバックBPEトークナイザー

データセット - 構造化

データセット - 非構造化

ウクライナ語ウィキ

データセット - DPO

distilabel-indel-orca-dpo-pairsのウクライナ語翻訳

💻 使用例

基本的な使用法

!pip install -qU transformers accelerate

from transformers import AutoTokenizer
import transformers
import torch

model = "SherlockAssistant/Mistral-7B-Instruct-Ukrainian"
messages = [{"role": "user", "content": "What is a large language model?"}]

tokenizer = AutoTokenizer.from_pretrained(model)
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
pipeline = transformers.pipeline(
    "text-generation",
    model=model,
    torch_dtype=torch.bfloat16,
    device_map="auto",
)

outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
print(outputs[0]["generated_text"])

📚 ドキュメント

引用

このモデルを研究で使用して論文を公開する場合は、以下のように引用していただけると助かります。

BIB

@inproceedings{boros-chivereanu-dumitrescu-purcaru-2024-llm-uk,
    title = "Fine-tuning and Retrieval Augmented Generation for Question Answering using affordable Large Language Models",
    author = "Boros, Tiberiu and Chivereanu, Radu and Dumitrescu, Stefan Daniel and Purcaru, Octavian",
    booktitle = "Proceedings of the Third Ukrainian Natural Language Processing Workshop, LREC-COLING",
    month = may,
    year = "2024",
    address = "Torino, Italy",
    publisher = "European Language Resources Association",
}

APA Boros, T., Chivereanu, R., Dumitrescu, S., & Purcaru, O. (2024). Fine-tuning and Retrieval Augmented Generation for Question Answering using affordable Large Language Models. In Proceedings of the Third Ukrainian Natural Language Processing Workshop, LREC-COLING. European Language Resources Association.

MLA Boros, Tiberiu, Radu, Chivereanu, Stefan Daniel, Dumitrescu, Octavian, Purcaru. "Fine-tuning and Retrieval Augmented Generation for Question Answering using affordable Large Language Models." Proceedings of the Third Ukrainian Natural Language Processing Workshop, LREC-COLING. European Language Resources Association, 2024.

Chicago Boros, Tiberiu, Radu, Chivereanu, Stefan Daniel, Dumitrescu, and Octavian, Purcaru. "Fine-tuning and Retrieval Augmented Generation for Question Answering using affordable Large Language Models." . In Proceedings of the Third Ukrainian Natural Language Processing Workshop, LREC-COLING. European Language Resources Association, 2024.