đ supermario-slerp-v2
supermario-slerp-v2
is a merged model that uses the Slerp
method. It can be run on Jan Desktop, offering an open - source, offline, and OpenAI - compatible alternative for text generation tasks.
đ Quick Start
You can run this model using Jan Desktop on Mac, Windows, or Linux.
Jan is an open source, ChatGPT alternative with the following features:
- đģ 100% offline on your machine: Your conversations remain confidential, and visible only to you.
- đī¸ An Open File Format: Conversations and model settings stay on your computer and can be exported or deleted at any time.
- đ OpenAI Compatible: Local server on port
1337
with OpenAI compatible endpoints
- đ Open Source & Free: We build in public; check out our Github

⨠Features
- Model Merging: This model uses the
Slerp
merge method from 2 models:
- [v1olet_marcoroni - go - bruins - merge - 7B](https://huggingface.co/v1olet/v1olet_marcoroni - go - bruins - merge - 7B)
- [juanako - 7b - UNA](https://huggingface.co/fblgit/juanako - 7b - UNA)
- Base Model: [v1olet_marcoroni - go - bruins - merge - 7B](https://huggingface.co/v1olet/v1olet_marcoroni - go - bruins - merge - 7B)
đ Documentation
Model Description
This model uses the Slerp
merge method from 2 models:
- [v1olet_marcoroni - go - bruins - merge - 7B](https://huggingface.co/v1olet/v1olet_marcoroni - go - bruins - merge - 7B)
- [juanako - 7b - UNA](https://huggingface.co/fblgit/juanako - 7b - UNA)
- base model: [v1olet_marcoroni - go - bruins - merge - 7B](https://huggingface.co/v1olet/v1olet_marcoroni - go - bruins - merge - 7B)
The yaml config file for this model is here:
slices:
- sources:
- model: v1olet/v1olet_marcoroni - go - bruins - merge - 7B
layer_range: [0, 32]
- model: fblgit/juanako - 7b - UNA
layer_range: [0, 32]
merge_method: slerp
base_model: v1olet/v1olet_marcoroni - go - bruins - merge - 7B
parameters:
t:
- filter: self_attn
value: [0, 0.5, 0.3, 0.7, 1]
- filter: mlp
value: [1, 0.5, 0.7, 0.3, 0]
- value: 0.5
dtype: bfloat16
About Jan
Jan believes in the need for an open - source AI ecosystem and is building the infra and tooling to allow open - source AIs to compete on a level playing field with proprietary ones.
Jan's long - term vision is to build a cognitive framework for future robots, who are practical, useful assistants for humans and businesses in everyday life.
Jan Model Merger
This is a test project for merging models.
Detailed results can be found [here](https://huggingface.co/datasets/open - llm - leaderboard/details_janhq__supermario - slerp - v2)
Metric |
Value |
Avg. |
71.35 |
AI2 Reasoning Challenge (25 - Shot) |
69.37 |
HellaSwag (10 - Shot) |
86.60 |
MMLU (5 - Shot) |
64.91 |
TruthfulQA (0 - shot) |
62.96 |
Winogrande (5 - shot) |
80.82 |
GSM8k (5 - shot) |
63.46 |
đ License
This model is licensed under the apache - 2.0
license.
Acknowlegement
- mergekit
- [DARE](https://github.com/yule - BUAA/MergeLM/blob/main/README.md)
- [SLERP](https://github.com/Digitous/LLM - SLERP - Merge)
- [lm - evaluation - harness](https://github.com/EleutherAI/lm - evaluation - harness)