オープンソースのコード生成ツールStarcoder2 - 3b - Instruct、Pythonコードの生産を効率的に支援する

ホーム

Starcoder2 3b Instruct

TechxGenusによって開発

starcoder2-3bモデルを微調整した大規模言語モデルで、コード生成タスクに特化しており、HumanEval-Pythonテストで65.9 pass@1の成績を達成

大規模言語モデル

Transformers

その他オープンソースライセンス:Openrail #コード生成 #命令微調整 #HumanEval高スコア

ダウンロード数 44

リリース時間 : 3/5/2024

モデル概要

これは命令微調整されたコード生成モデルで、Alpaca命令形式を採用し、プログラミング関連タスクの処理に優れています

モデル特徴

高品質なコード生成

追加で7億の高品質なコード関連トークンを使用して微調整

効率的なトレーニング

DeepSpeed ZeRO 3とFlash Attention 2を採用してトレーニングを加速

命令追従

Alpaca命令形式に従い、プログラミング関連の命令を理解して実行

モデル能力

コード生成

コード補完

プログラミング問題解答

使用事例

ソフトウェア開発

Pythonコード生成

自然言語の記述に基づいてPythonコードを生成

HumanEval-Pythonテストで65.9 pass@1を達成

プログラミング教育支援

学生がプログラミング概念を理解し、サンプルコードを生成するのを支援

🚀 starcoder2-instruct

このモデルは、starcoder2-3bを追加の07億個の高品質なコード関連トークンで3エポック微調整したものです。DeepSpeed ZeRO 3とFlash Attention 2を使用してトレーニングプロセスを高速化しました。HumanEval-Pythonで65.9 pass@1を達成しています。このモデルはAlpaca命令形式（システムプロンプトを除く）で動作します。

starcoder2-instruct

🚀 クイックスタート

このモデルは、追加の0.7億個の高品質なコード関連トークンで微調整されたstarcoder2-3bベースのテキスト生成モデルです。DeepSpeed ZeRO 3とFlash Attention 2を用いてトレーニングが高速化されています。

✨ 主な機能

starcoder2-3bを追加の0.7億個の高品質なコード関連トークンで3エポック微調整。
DeepSpeed ZeRO 3とFlash Attention 2を使用した高速トレーニング。
HumanEval-Pythonで65.9 pass@1を達成。
Alpaca命令形式（システムプロンプトを除く）で動作。

💻 使用例

基本的な使用法

from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
PROMPT = """### Instruction
{instruction}
### Response
"""
instruction = <Your code instruction here>
prompt = PROMPT.format(instruction=instruction)
tokenizer = AutoTokenizer.from_pretrained("TechxGenus/starcoder2-3b-instruct")
model = AutoModelForCausalLM.from_pretrained(
    "TechxGenus/starcoder2-3b-instruct",
    torch_dtype=torch.bfloat16,
    device_map="auto",
)
inputs = tokenizer.encode(prompt, return_tensors="pt")
outputs = model.generate(input_ids=inputs.to(model.device), max_new_tokens=2048)
print(tokenizer.decode(outputs[0]))

高度な使用法

from transformers import pipeline
import torch
PROMPT = """### Instruction
{instruction}
### Response
"""
instruction = <Your code instruction here>
prompt = PROMPT.format(instruction=instruction)
generator = pipeline(
    model="TechxGenus/starcoder2-3b-instruct",
    task="text-generation",
    torch_dtype=torch.bfloat16,
    device_map="auto",
)
result = generator(prompt, max_length=2048)
print(result[0]["generated_text"])