Qra-1b-dolly-instruction-0.1 Open-source Q&A Model - Free Deployment, Accurately Answer Polish Questions

Qra 1b Dolly Instruction 0.1

Developed by nie3e

This is a Q&A model fine-tuned on Polish instruction datasets based on the Qra-1b model, primarily used to answer user questions.

Large Language Model

Transformers

Other#Polish Q&A #Instruction Fine-tuning #1B Parameter Scale

Downloads 16

Release Time : 4/3/2024

Model Overview

This model is a fine-tuned version of OPI-PG/Qra-1b on Polish instruction datasets, optimized for Q&A tasks. While it can be used for chatting, the performance may not be ideal due to the lack of dialogue content in the training data.

Model Features

Polish Q&A Optimization

Specially fine-tuned for Polish Q&A tasks

Instruction Following

Capable of understanding and executing user-provided instructions

Efficient Training

Uses LoRA technology for efficient fine-tuning, requiring only about 1 hour of training time

Model Capabilities

Polish Q&A

Instruction Understanding & Execution

Text Generation

Use Cases

Education

Polish Learning Aid

Helps students understand Polish questions and concepts

Customer Service

Polish FAQ System

Used to answer common Polish questions

🚀 Qra-1b-dolly-instruction-0.1

This is a fine - tuned model based on OPI - PG/Qra-1b, designed to handle question - answering tasks.

🚀 Quick Start

The Qra-1b-dolly-instruction-0.1 model is a fine - tuned version of OPI - PG/Qra-1b on the s3nh/alpaca-dolly-instruction-only-polish dataset.

✨ Features

Question - Answering: This model has been fine - tuned for question - answering tasks.
Based on Fine - Tuning: It is a fine - tuned version of OPI - PG/Qra-1b.

💻 Usage Examples

Basic Usage

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline

model_id = "nie3e/Qra-1b-dolly-instruction-0.1"
device = "cuda" if torch.cuda.is_available() else "cpu"

model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
)
tokenizer = AutoTokenizer.from_pretrained(model_id)
pipe = pipeline(
    "text-generation", model=model, tokenizer=tokenizer, device=device
)

def get_answer(system_prompt: str, user_prompt: str) -> str:
    input_msg = [
        {"role": "system", "content": system_prompt},
        {"role": "user", "content": user_prompt}
    ]
    prompt = pipe.tokenizer.apply_chat_template(
        input_msg, tokenize=False,
        add_generation_prompt=True
    )
    outputs = pipe(
        prompt, max_new_tokens=512, do_sample=False, temperature=0.1, top_k=50,
        top_p=0.1, eos_token_id=pipe.tokenizer.eos_token_id,
        pad_token_id=pipe.tokenizer.pad_token_id
    )
    return outputs[0]['generated_text'][len(prompt):].strip()

print(
        get_answer(
        system_prompt="Jesteś przyjaznym chatbotem",
        user_prompt="Napisz czym jest dokument architectural decision record."
    )
)

📚 Documentation

Model Description

The model is trained from OPI - PG/Qra-1b.

Intended uses & limitations

This model has been fine - tuned for question - answering tasks. It can be used as a chat model, but its performance is not ideal because the dataset does not contain conversations.

Training and evaluation data

Dataset: s3nh/alpaca-dolly-instruction-only-polish
Data Conversion: Each row has been converted into a conversation using the following function:

system_message = """Jesteś przyjaznym chatbotem"""

def create_conversation(sample) -> dict:
    strip_characters = "\"'"
    return {
        "messages": [
            {"role": "system", "content": system_message},
            {"role": "user",
             "content": f"{sample['instruction'].strip(strip_characters)} "
                        f"{sample['input'].strip(strip_characters)}"},
            {"role": "assistant",
             "content": f"{sample['output'].strip(strip_characters)}"}
        ]
    }

Train/test split: 90%/10%

Training procedure

Hardware: 2x RTX 4060Ti 16GB
Training time: ~1 hour
Configuration: Using accelerate + deepspeed with the following config:

compute_environment: LOCAL_MACHINE
debug: false
deepspeed_config:
  gradient_accumulation_steps: 2
  zero3_init_flag: false
  zero_stage: 1
distributed_type: DEEPSPEED
downcast_bf16: 'no'
machine_rank: 0
main_training_function: main
mixed_precision: bf16
num_machines: 1
num_processes: 2
rdzv_backend: static
same_network: true
tpu_env: []
tpu_use_cluster: false
tpu_use_sudo: false
use_cpu: false

Training hyperparameters

Lora config:

peft_config = LoraConfig(
    lora_alpha=128,
    lora_dropout=0.05,
    r=256,
    bias="none",
    target_modules="all-linear",
    task_type="CAUSAL_LM"
)

Training arguments:

args = TrainingArguments(
    output_dir="Qra-1b-dolly-instruction-0.1",
    num_train_epochs=3,
    per_device_train_batch_size=3,
    gradient_accumulation_steps=2,
    gradient_checkpointing=True,
    optim="adamw_torch_fused",
    logging_steps=10,
    save_strategy="epoch",
    learning_rate=2e-4,
    bf16=True,
    tf32=True,
    max_grad_norm=0.3,
    warmup_ratio=0.03,
    lr_scheduler_type="constant",
    push_to_hub=False,
    report_to=["tensorboard"],
)

Framework versions

PEFT 0.10.0
Transformers 4.39.2
Pytorch 2.2.2+cu121
Datasets 2.18.0
Tokenizers 0.15.2

📄 License

The model is under the llama2 license.

📊 Model Information

Property	Details
Model Type	Fine - tuned version of OPI - PG/Qra-1b
Training Data	s3nh/alpaca-dolly-instruction-only-polish

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご