Darebeagle 7B

Developed by shadowml

DareBeagle-7B is a 7B-parameter large language model obtained by merging mlabonne/NeuralBeagle14-7B and mlabonne/NeuralDaredevil-7B using LazyMergekit, demonstrating excellent performance across multiple benchmarks.

Large Language Model

Transformers

Open Source License:Apache-2.0 #Efficient Text Generation #Multi-task Reasoning #Few-shot Learning

Downloads 77

Release Time : 1/16/2024

Model Overview

DareBeagle-7B is a merged model that combines the strengths of NeuralBeagle14-7B and NeuralDaredevil-7B, focusing on text generation tasks and performing exceptionally well on the Open LLM Leaderboard.

Model Features

Model Merging Technique

Uses slerp method to merge two excellent models, combining their respective strengths

High Performance

Achieves outstanding results in multiple benchmarks with an average score of 74.58

Flexible Layer Configuration

Adopts different merging parameters for self_attn and mlp layers to optimize model performance

Model Capabilities

Text Generation

Question Answering

Reasoning Tasks

Knowledge QA

Use Cases

Education

Knowledge QA

Answers questions across various academic disciplines

65.03% accuracy on MMLU test

Research

Reasoning Tasks

Solves complex reasoning problems

71.67% normalized accuracy on AI2 Reasoning Challenge

Business Applications

Math Problem Solving

Solves mathematical calculations and reasoning problems

71.49% accuracy on GSM8k test

license: apache-2.0 tags:

merge
mergekit
lazymergekit
mlabonne/NeuralBeagle14-7B
mlabonne/NeuralDaredevil-7B model-index:
name: DareBeagle-7B results:
- task: type: text-generation name: Text Generation dataset: name: AI2 Reasoning Challenge (25-Shot) type: ai2_arc config: ARC-Challenge split: test args: num_few_shot: 25 metrics:
  - type: acc_norm value: 71.67 name: normalized accuracy source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=shadowml/DareBeagle-7B name: Open LLM Leaderboard
- task: type: text-generation name: Text Generation dataset: name: HellaSwag (10-Shot) type: hellaswag split: validation args: num_few_shot: 10 metrics:
  - type: acc_norm value: 88.01 name: normalized accuracy source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=shadowml/DareBeagle-7B name: Open LLM Leaderboard
- task: type: text-generation name: Text Generation dataset: name: MMLU (5-Shot) type: cais/mmlu config: all split: test args: num_few_shot: 5 metrics:
  - type: acc value: 65.03 name: accuracy source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=shadowml/DareBeagle-7B name: Open LLM Leaderboard
- task: type: text-generation name: Text Generation dataset: name: TruthfulQA (0-shot) type: truthful_qa config: multiple_choice split: validation args: num_few_shot: 0 metrics:
  - type: mc2 value: 68.98 source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=shadowml/DareBeagle-7B name: Open LLM Leaderboard
- task: type: text-generation name: Text Generation dataset: name: Winogrande (5-shot) type: winogrande config: winogrande_xl split: validation args: num_few_shot: 5 metrics:
  - type: acc value: 82.32 name: accuracy source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=shadowml/DareBeagle-7B name: Open LLM Leaderboard
- task: type: text-generation name: Text Generation dataset: name: GSM8k (5-shot) type: gsm8k config: main split: test args: num_few_shot: 5 metrics:
  - type: acc value: 71.49 name: accuracy source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=shadowml/DareBeagle-7B name: Open LLM Leaderboard

DareBeagle-7B

DareBeagle-7B is a merge of the following models using LazyMergekit:

🧩 Configuration

slices:
  - sources:
      - model: mlabonne/NeuralBeagle14-7B
        layer_range: [0, 32]
      - model: mlabonne/NeuralDaredevil-7B
        layer_range: [0, 32]
merge_method: slerp
base_model: mlabonne/NeuralDaredevil-7B
parameters:
  t:
    - filter: self_attn
      value: [0, 0.5, 0.3, 0.7, 1]
    - filter: mlp
      value: [1, 0.5, 0.7, 0.3, 0]
    - value: 0.45 # fallback for rest of tensors
dtype: float16

💻 Usage

!pip install -qU transformers accelerate

from transformers import AutoTokenizer
import transformers
import torch

model = "shadowml/DareBeagle-7B"
messages = [{"role": "user", "content": "What is a large language model?"}]

tokenizer = AutoTokenizer.from_pretrained(model)
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
pipeline = transformers.pipeline(
    "text-generation",
    model=model,
    torch_dtype=torch.float16,
    device_map="auto",
)

outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
print(outputs[0]["generated_text"])

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	74.58
AI2 Reasoning Challenge (25-Shot)	71.67
HellaSwag (10-Shot)	88.01
MMLU (5-Shot)	65.03
TruthfulQA (0-shot)	68.98
Winogrande (5-shot)	82.32
GSM8k (5-shot)	71.49

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご