Llama-3.1-8B-Instruct-abliterated_via_adapter Open Source Model - Eliminate Rejected Responses for Uninterrupted Conversations

Home

Llama 3.1 8B Instruct Abliterated Via Adapter

Developed by grimjim

Eliminate the rejection response problem of the Llama-3.1-8B-Instruct model through LoRA technology

Large Language Model

Transformers

#Rejection response elimination #LoRA adaptation #Instruction fine-tuning

Downloads 3,173

Release Time : 7/25/2024

Model Overview

This model is merged based on Llama-3.1-8B-Instruct using LoRA technology, mainly solving the rejection response problem of the original model and is suitable for text generation tasks.

Model Features

Rejection response elimination

By applying LoRA technology, the rejection response problem of the original model is eliminated

Feature commonality

Although LoRA is derived from Llama 3, it is still effective on Llama 3.1, indicating significant feature commonality between the two

Model Capabilities

Text generation

Use Cases

Text generation

Instruction following

The model can better follow user instructions for text generation

Reduce the situation of rejection responses

🚀 Llama-3.1-8B-Instruct-abliterated_via_adapter

This model is a merged pre - trained language model, which effectively addresses refusal issues in text generation. It showcases significant commonalities between Llama 3 and Llama 3.1 models, demonstrating high compatibility and feature consistency.

🚀 Quick Start

This model is a merge of pre - trained language models created using mergekit.

A LoRA was applied to "abliterate" refusals in [meta - llama/Meta - Llama - 3.1 - 8B - Instruct](https://huggingface.co/meta - llama/Meta - Llama - 3.1 - 8B - Instruct). The result appears to work despite the LoRA having been derived from Llama 3 instead of Llama 3.1, which implies that there is significant feature commonality between the 3 and 3.1 models.

The LoRA was extracted from [failspy/Meta - Llama - 3 - 8B - Instruct - abliterated - v3](https://huggingface.co/failspy/Meta - Llama - 3 - 8B - Instruct - abliterated - v3) using [meta - llama/Meta - Llama - 3 - 8B - Instruct](https://huggingface.co/meta - llama/Meta - Llama - 3 - 8B - Instruct) as a base.

Built with Llama.

✨ Features

Merge Details

Merge Method

This model was merged using the task arithmetic merge method using [meta - llama/Meta - Llama - 3.1 - 8B - Instruct](https://huggingface.co/meta - llama/Meta - Llama - 3.1 - 8B - Instruct) + [grimjim/Llama - 3 - Instruct - abliteration - LoRA - 8B](https://huggingface.co/grimjim/Llama - 3 - Instruct - abliteration - LoRA - 8B) as a base.

Configuration

The following YAML configuration was used to produce this model:

base_model: meta-llama/Meta-Llama-3.1-8B-Instruct+grimjim/Llama-3-Instruct-abliteration-LoRA-8B
dtype: bfloat16
merge_method: task_arithmetic
parameters:
  normalize: false
slices:
- sources:
  - layer_range: [0, 32]
    model: meta-llama/Meta-Llama-3.1-8B-Instruct+grimjim/Llama-3-Instruct-abliteration-LoRA-8B
    parameters:
      weight: 1.0

📚 Documentation

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	22.95
IFEval (0 - Shot)	48.70
BBH (3 - Shot)	29.42
MATH Lvl 5 (4 - Shot)	12.39
GPQA (0 - shot)	8.50
MuSR (0 - shot)	9.26
MMLU - PRO (5 - shot)	29.46

📄 License

The model uses the llama3.1 license.

Property	Details
Model Type	Llama - 3.1 - 8B - Instruct - abliterated_via_adapter
Training Data	Not specified in the original document

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご