Aurora - Borealis - LLaMa - 70B Open - Source Multi - Model Fusion Project Brings a New Experience of Multi

Aurora Borealis LLaMa 70B

Developed by Tarek07

This is an experimental multi-model fusion project based on the LLaMa-70B architecture, utilizing the DARE TIES fusion method and combining six different versions of the MO-MODEL.

Large Language Model

Transformers

#Multi-model fusion #DARE TIES technique #70B parameters

Downloads 112

Release Time : 5/1/2025

Model Overview

A result of professional model fusion experiments, attempting to use gradient techniques to finely control the influence of each model on the final fusion outcome, suitable for advanced natural language processing tasks.

Model Features

Multi-model fusion

Combines six different versions of 70B parameter models, achieving fine control through the DARE TIES method.

Gradient technique application

Attempts to use gradient techniques during the fusion process to optimize each model's contribution to the final result.

High-precision requirements

It is recommended not to run on configurations below Q5 quantization to ensure model performance.

Model Capabilities

Text generation

Language understanding

Complex reasoning

Use Cases

Research and Development

Model fusion technology research

Used for studying multi-model fusion methods and effect evaluation

Provides comparisons of fusion effects under different weight configurations

Natural Language Processing

Advanced text generation

Generates high-quality, coherent long-form text content

🚀 MERGE2

This is a merge of pre - trained language models aiming to leverage the strengths of multiple models through a specialized merging process, providing more powerful and flexible language processing capabilities.

image/png

Formerly known as MO - MODEL - Fused - V0.6 - LLaMa - 70B, this model is part of my ongoing experiments with merging specialized curated models. For this one, I started experimenting with gradients, to give myself more finetuned control of how the models influence the final merge.

📦 Installation

Since it's a pre - trained model merge, there's no specific installation command here. You can use it with the transformers library as specified.

💻 Usage Examples

Recommended sampler settings

Temp 1.0
Min P 0.02

⚠️ Important Note

Because of the nature of this sort of 'Hyper Multi Model Merge', my recommendation is not to run this on anything lower than a Q5 quant.

💡 Usage Tip

If you enjoy my work, please consider supporting me. It helps me make more models like this! Support on KO - FI <3

📚 Documentation

Merge Details

Merge Method

This model was merged using the DARE TIES merge method using [TareksLab/MO - MODEL6 - V0.1 - LLaMa - 70B](https://huggingface.co/TareksLab/MO - MODEL6 - V0.1 - LLaMa - 70B) as a base.

Models Merged

The following models were included in the merge:

[TareksLab/MO - MODEL3 - V0.2 - LLaMa - 70B](https://huggingface.co/TareksLab/MO - MODEL3 - V0.2 - LLaMa - 70B)
[TareksLab/MO - MODEL5 - V0.3 - LLaMa - 70B](https://huggingface.co/TareksLab/MO - MODEL5 - V0.3 - LLaMa - 70B)
[TareksLab/MO - MODEL2 - V0.2 - LLaMa - 70B](https://huggingface.co/TareksLab/MO - MODEL2 - V0.2 - LLaMa - 70B)
[TareksLab/MO - MODEL1 - V1 - LLaMa - 70B](https://huggingface.co/TareksLab/MO - MODEL1 - V1 - LLaMa - 70B)
[TareksLab/MO - MODEL4 - V0.1 - LLaMa - 70B](https://huggingface.co/TareksLab/MO - MODEL4 - V0.1 - LLaMa - 70B)

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: TareksLab/MO-MODEL6-V0.1-LLaMa-70B
    parameters:
      weight: [0.1, 0.1, 0.1, 0.2, 0.5]
      density: 0.5
  - model: TareksLab/MO-MODEL4-V0.1-LLaMa-70B
    parameters:
      weight: [0.1, 0.1, 0.2, 0.4, 0.2]
      density: 0.5
  - model: TareksLab/MO-MODEL5-V0.3-LLaMa-70B
    parameters:
      weight: [0.1, 0.2, 0.4, 0.2, 0.1]
      density: 0.5
  - model: TareksLab/MO-MODEL3-V0.2-LLaMa-70B
    parameters:
      weight: [0.2, 0.4, 0.2, 0.1, 0.1]
      density: 0.5
  - model: TareksLab/MO-MODEL2-V0.2-LLaMa-70B
    parameters:
      weight: [0.5, 0.2, 0.1, 0.1, 0.1]
      density: 0.5
  - model: TareksLab/MO-MODEL1-V1-LLaMa-70B
    parameters:
      weight: 0.10
      density: 0.5
merge_method: dare_ties
base_model: TareksLab/MO-MODEL6-V0.1-LLaMa-70B
parameters:
  normalize: false
  int8_mask: true
dtype: float32
out_dtype: bfloat16
chat_template: llama3
tokenizer:
 source: base

📄 License

This model is under the llama3.3 license.

📋 Information Table

Property	Details
Base Models	TareksLab/MO - MODEL3 - V0.2 - LLaMa - 70B, TareksLab/MO - MODEL5 - V0.3 - LLaMa - 70B, TareksLab/MO - MODEL2 - V0.2 - LLaMa - 70B, TareksLab/MO - MODEL1 - V1 - LLaMa - 70B, TareksLab/MO - MODEL6 - V0.1 - LLaMa - 70B, TareksLab/MO - MODEL4 - V0.1 - LLaMa - 70B
Library Name	transformers
Tags	mergekit, merge
License	llama3.3

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご