YamshadowExperiment28-7B Open-Source Model - Topped the Performance Chart of Open LLM 7B Parameters in April 2024

Yamshadowexperiment28 7B

Developed by automerger

As of April 8, 2024, Yamshadow Experiment 28-7B is the top-performing 7B parameter model on the Open LLM Leaderboard. Use with caution, as this may indicate overfitting to benchmarks.

Large Language Model

Transformers

Open Source License:Apache-2.0 #7B Parameter Champion #8k Long Context #Alpaca Template Compatible

Downloads 101

Release Time : 3/18/2024

Model Overview

This model is an automatic fusion of YamShadow-7B and Experiment28-7B by Maxime Labonne, supporting an 8k context window and recommended for use with the Alpaca chat template.

Model Features

High-Performance 7B Model

Ranked first among 7B parameter models on the Open LLM Leaderboard

Long Context Support

Supports an 8k-length context window

Alpaca Compatibility

Recommended for use with the Alpaca chat template, fully compatible with LM Studio

Model Capabilities

Text Generation

Dialogue Systems

Instruction Following

Use Cases

Dialogue Systems

Intelligent Assistant

Build high-performance conversational assistants

Capable of generating fluent and natural dialogue responses

Text Generation

Content Creation

Used for generating various types of textual content

Can produce coherent and creative text

🚀 🧪 YamshadowExperiment28-7B

YamshadowExperiment28-7B is an automated merged model that currently ranks top among 7B models on the Open LLM Leaderboard. However, it may overfit the benchmarks.

🚀 Quick Start

YamshadowExperiment28-7B is an automated merge created by Maxime Labonne using the following base models:

🎉 YamshadowExperiment28-7B is currently the best-performing 7B model on the Open LLM Leaderboard (08 Apr 24). Use it with caution, as it is likely a sign of overfitting the benchmarks.

image/jpeg

✨ Features

🔍 Applications

This model uses a context window of 8k. It is recommended to use it with the Alpaca chat template (works perfectly with LM Studio).

However, the model can sometimes break and output a lot of "INST". Based on experience, its excellent results on the Open LLM Leaderboard are probably a sign of overfitting.

⚡ Quantized models

GGUF: https://huggingface.co/automerger/YamshadowExperiment28-7B-GGUF

🏆 Evaluation

Open LLM Leaderboard

YamshadowExperiment28-7B is currently the best-performing 7B model on the Open LLM Leaderboard (08 Apr 24).

image/png

EQ-bench

Thanks to Samuel J. Paech, who kindly ran the evaluation.

image/png

Nous

Evaluation performed using LLM AutoEval. See the entire leaderboard here.

image/png

🌳 Model Family Tree

image/png

📦 Installation

!pip install -qU transformers accelerate

💻 Usage Examples

Basic Usage

from transformers import AutoTokenizer
import transformers
import torch

model = "automerger/YamshadowExperiment28-7B"
messages = [{"role": "user", "content": "What is a large language model?"}]

tokenizer = AutoTokenizer.from_pretrained(model)
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
pipeline = transformers.pipeline(
    "text-generation",
    model=model,
    torch_dtype=torch.float16,
    device_map="auto",
)

outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
print(outputs[0]["generated_text"])

📚 Documentation

🧩 Configuration

slices:
  - sources:
      - model: automerger/YamShadow-7B
        layer_range: [0, 32]
      - model: yam-peleg/Experiment28-7B
        layer_range: [0, 32]
merge_method: slerp
base_model: automerger/YamShadow-7B
parameters:
  t:
    - filter: self_attn
      value: [0, 0.5, 0.3, 0.7, 1]
    - filter: mlp
      value: [1, 0.5, 0.7, 0.3, 0]
    - value: 0.5
dtype: bfloat16
random_seed: 0

📄 License

This model is licensed under the Apache-2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご