Qra-1b-dolly-instruction-0.1開源問答模型 - 免費部署精準解答波蘭語問題

首頁

Qra 1b Dolly Instruction 0.1

由nie3e開發

這是一個基於Qra-1b模型在波蘭語指令數據集上微調的問答模型，主要用於回答用戶提出的問題。

大型語言模型

Transformers

其他#波蘭語問答 #指令微調 #1B參數規模

下載量 16

發布時間 : 4/3/2024

模型概述

該模型是基於OPI-PG/Qra-1b在波蘭語指令數據集上微調的版本，針對問答任務進行了優化。雖然可以用於聊天，但由於訓練數據不包含對話內容，聊天效果可能不佳。

模型特點

波蘭語問答優化

專門針對波蘭語問答任務進行了微調

指令跟隨

能夠理解和執行用戶提供的指令

高效訓練

使用LoRA技術進行高效微調，訓練時間僅需約1小時

模型能力

波蘭語問答

指令理解與執行

文本生成

使用案例

教育

波蘭語學習輔助

幫助學生理解波蘭語問題和概念

客服

波蘭語FAQ系統

用於回答常見波蘭語問題

🚀 Qra-1b-dolly-instruction-0.1

本模型是基於 OPI-PG/Qra-1b 在 s3nh/alpaca-dolly-instruction-only-polish 數據集上進行微調後的版本，可用於文本生成任務。

🚀 快速開始

環境準備

確保你已經安裝了必要的庫，如 torch、transformers 等。

代碼示例

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline

model_id = "nie3e/Qra-1b-dolly-instruction-0.1"
device = "cuda" if torch.cuda.is_available() else "cpu"

model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
)
tokenizer = AutoTokenizer.from_pretrained(model_id)
pipe = pipeline(
    "text-generation", model=model, tokenizer=tokenizer, device=device
)

def get_answer(system_prompt: str, user_prompt: str) -> str:
    input_msg = [
        {"role": "system", "content": system_prompt},
        {"role": "user", "content": user_prompt}
    ]
    prompt = pipe.tokenizer.apply_chat_template(
        input_msg, tokenize=False,
        add_generation_prompt=True
    )
    outputs = pipe(
        prompt, max_new_tokens=512, do_sample=False, temperature=0.1, top_k=50,
        top_p=0.1, eos_token_id=pipe.tokenizer.eos_token_id,
        pad_token_id=pipe.tokenizer.pad_token_id
    )
    return outputs[0]['generated_text'][len(prompt):].strip()

print(
        get_answer(
        system_prompt="Jesteś przyjaznym chatbotem",
        user_prompt="Napisz czym jest dokument architectural decision record."
    )
)

✨ 主要特性

基於 OPI-PG/Qra-1b 模型進行微調，適用於問答任務。
可用於聊天場景，但由於數據集未包含對話內容，聊天效果可能不佳。

📚 詳細文檔

模型描述

本模型是從 OPI-PG/Qra-1b 訓練而來。

預期用途與限制

本模型針對問答任務進行了微調。雖然可以將其用於聊天，但由於數據集未包含對話內容，效果可能不太理想。

訓練和評估數據

數據集：s3nh/alpaca-dolly-instruction-only-polish
數據轉換：每行數據使用以下函數轉換為對話形式：

system_message = """Jesteś przyjaznym chatbotem"""

def create_conversation(sample) -> dict:
    strip_characters = "\"'"
    return {
        "messages": [
            {"role": "system", "content": system_message},
            {"role": "user",
             "content": f"{sample['instruction'].strip(strip_characters)} "
                        f"{sample['input'].strip(strip_characters)}"},
            {"role": "assistant",
             "content": f"{sample['output'].strip(strip_characters)}"}
        ]
    }

訓練/測試分割：90%/10%

訓練過程

GPU：2x RTX 4060Ti 16GB
訓練時間：約1小時
使用的工具：accelerate + deepspeed，配置如下：

compute_environment: LOCAL_MACHINE
debug: false
deepspeed_config:
  gradient_accumulation_steps: 2
  zero3_init_flag: false
  zero_stage: 1
distributed_type: DEEPSPEED
downcast_bf16: 'no'
machine_rank: 0
main_training_function: main
mixed_precision: bf16
num_machines: 1
num_processes: 2
rdzv_backend: static
same_network: true
tpu_env: []
tpu_use_cluster: false
tpu_use_sudo: false
use_cpu: false

訓練超參數

Lora 配置：

peft_config = LoraConfig(
    lora_alpha=128,
    lora_dropout=0.05,
    r=256,
    bias="none",
    target_modules="all-linear",
    task_type="CAUSAL_LM"
)

訓練參數：

args = TrainingArguments(
    output_dir="Qra-1b-dolly-instruction-0.1",
    num_train_epochs=3,
    per_device_train_batch_size=3,
    gradient_accumulation_steps=2,
    gradient_checkpointing=True,
    optim="adamw_torch_fused",
    logging_steps=10,
    save_strategy="epoch",
    learning_rate=2e-4,
    bf16=True,
    tf32=True,
    max_grad_norm=0.3,
    warmup_ratio=0.03,
    lr_scheduler_type="constant",
    push_to_hub=False,
    report_to=["tensorboard"],
)