Zurich-7B-GCv2-100k Open-source Dialogue Model - Achieving Intelligent Communication and Interaction through Gamma Corpus Training

Zurich 7B GCv2 100k

Developed by rubenroy

A dialogue model fine-tuned based on Qwen 2.5 7B Instruct model, trained with Gamma Corpus v2-100k

EnglishOpen Source License:Apache-2.0 #Multi-turn Dialogue Optimization #Unsloth Efficient Fine-tuning #Chinese QA Enhancement

Downloads 45

Release Time : 1/28/2025

Model Overview

This model is a fine-tuned version of Alibaba's Qwen 2.5 7B Instruct model, designed to surpass other models of the same size through training on the Gamma Corpus, delivering high-quality dialogue experiences.

Model Features

Efficient Fine-tuning

Completed fine-tuning training in just 70 minutes on a single T4 GPU using the Unsloth framework

Advanced Architecture

Incorporates cutting-edge technologies like Rotary Position Embedding and SwiGLU activation function

High-quality Training Data

Trained on the meticulously curated Gamma Corpus v2-100k version

Model Capabilities

Multi-turn Dialogue Generation

QA Systems

Text Generation

Chat Functionality

Use Cases

Dialogue Systems

Smart Assistant

Can serve as an intelligent dialogue assistant for information queries and daily Q&A services

Customer Service System

Suitable for building automated customer service dialogue systems

Education

Learning Assistant

Helps students answer various knowledge-based questions

🚀 Zurich 7B GammaCorpus v2-100k

A Qwen 2.5 model fine-tuned on the GammaCorpus dataset

Zurich 7B GammaCorpus v2-100k is a fine - tuned version of Alibaba's Qwen 2.5 7B Instruct model. It aims to outperform similar - sized models and showcases the GammaCorpus v2-100k dataset.

🚀 Quick Start

Here is a quick guide on how to load and use the Zurich 7B GammaCorpus v2-100k model.

Requirements

We strongly recommend you use the latest version of the transformers package. You may install it via pip as follows:

pip install transformers

Basic Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "rubenroy/Zurich-7B-GCv2-100k"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "How tall is the Eiffel tower?"
messages = [
    {"role": "system", "content": "You are Zurich, an AI assistant built on the Qwen 2.5 7B model developed by Alibaba Cloud, and fine-tuned by Ruben Roy. You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=512
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]

✨ Features

Zurich 7B GammaCorpus v2-100k is designed to outperform other models of similar size and showcases the GammaCorpus v2-100k dataset.

📦 Installation

To use this model, you need to install the transformers library. You can install it using pip:

pip install transformers

📚 Documentation

Model Details

Property	Details
Base Model	Qwen/Qwen2.5-7B-Instruct
Model Type	Causal Language Models
Architecture	Transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
Number of Parameters	7.61B
Number of Paramaters (Non - Embedding)	6.53B
Number of Layers	28
Number of Attention Heads (GQA)	28 for Q and 4 for KV

Training Details

Zurich-7B-GCv2-100k was fine - tuned using 1 T4 GPU for approximately 70 minutes with the Unsloth framework. It was trained for 60 Epochs.

About GammaCorpus

This model and all Zurich models are trained with GammaCorpus, a dataset on HuggingFace filled with structured and filtered multi - turn conversations. GammaCorpus has 4 versions with different sizes:

GammaCorpus v1

10k UNFILTERED
50k UNFILTERED
70k UNFILTERED

Link to the GCv1 dataset collection: GammaCorpus v1

GammaCorpus v2

10k
50k
100k <-- This is the version of GammaCorpus v2 that the Zurich model you are using was trained on.
500k
1m
5m

Link to the GCv2 dataset collection: GammaCorpus v2

GammaCorpus CoT

Math 170k

Link to the GC - CoT dataset collection: GammaCorpus CoT

GammaCorpus QA

Fact 450k

Link to the GC - QA dataset collection: GammaCorpus QA

The link to the full GammaCorpus dataset collection can be found here.

🔧 Technical Details

Zurich 7B GammaCorpus v2-100k is a fine - tuned version of the Qwen 2.5 7B Instruct model. It uses a Transformer architecture with specific components like RoPE, SwiGLU, RMSNorm, and Attention QKV bias.

📄 License

The model is released under the Apache 2.0 License. Please refer to the license for usage rights and restrictions.

⚠️ Important Note

We have tried our best to mitigate as much bias as possible, but please be aware that the model might generate some biased answers.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご