The open-source conversational tool Llama_3.x_70b_Hexagon_Purple_V2: Optimized performance for unrestricted chatting!

Llama 3.x 70b Hexagon Purple V2

Developed by Nexesenex

Hexagon Purple V2 is a three-tier standard merged model based on Smartracks, incorporating capabilities from Deepseek Distill R1, Nemotron, and Tulu, with performance optimized through multi-model merging.

Large Language Model

Transformers

#Multi-capability merged model #Low censorship #Creative writing optimization

Downloads 417

Release Time : 3/14/2025

Model Overview

Hexagon Purple V2 is an advanced language model that enhances creativity and benchmark performance by merging multiple models with 70B parameters. Suitable for text generation, creative writing, and complex problem-solving.

Model Features

Multi-model merging technology

Uses model_stock merging method to integrate five 70B models with different characteristics, achieving complementary capabilities

Balance of creativity and performance

Enhances creative writing ability while maintaining excellent benchmark scores (ARC-C 60.55/ARC-E 81.05)

Low-censorship design

Systematically applies abliterated and lorablated technologies to maintain extremely low content censorship

Complexity optimization

Uses Tess 3.0 model as a complexity reduction module to improve reasoning efficiency

Model Capabilities

Long text generation

Creative writing

Knowledge Q&A

Instruction following

Context understanding

Use Cases

Content creation

Novel writing assistance

Utilizes Gutenberg-Doppel model component to provide literary creation support

Generates literary long-form text

Creative writing

Combines creative capabilities of Priestess and Tess models

Produces novel story ideas and plot developments

Knowledge application

Complex problem solving

Leverages knowledge integration capabilities of Smartracks base model

Accurately answers cross-domain comprehensive questions

🚀 Hexagon Purple V2

Hexagon Purple V2 is a merged pre - trained language model. It retains the base of Smartracks and makes several improvements compared to V1, aiming for better performance and lower censorship.

🚀 Quick Start

The document doesn't provide specific start - up steps, but it's a merged model created with mergekit. You can refer to the mergekit documentation for more details on how to use and deploy this model.

✨ Features

Model Evolution

The base of Hexagon Purple V2, Smartracks, remains unchanged. It is a "3 levels" stock merge that includes Deepseek Distill R1 (3 flavors), Nemotron, and Tulu capabilities.
Compared to V1, it makes the following improvements:
- Replaced Black - Ink - Guild's Perniscious Prophecy with Steelskull's Electra R1 because it performs better.
- Replaced the Hostess stock merge with a Priestess one, bringing in 70Blivision and removing the Lumitron merge on top of Tess R1 and Llama Creative Writer.
- Replaced standalone models Dobby, Wayfarer, and Drummer's Fallen Llama R1 with a stock - merge of these 3, DoppelGanger R1.
- Added Nbeerbower's Doppel Gutemberg as a 3.1 instruct (and novel writing) stabilizer working in tandem with the following model.
- Added Miguel Tissera's Tess 3.0 70B 3.1 as a perplexity dropper.

Low Censorship

When available, abliterated and lorablated techniques (thanks to Huihui - ai, Maxime Labonne, and ofc Failspy) are used systematically. Otherwise, the model focuses on very low censorship.

Benchmark Performance

PPL Wikitext Eng 512: 3.43 (good)
ARC - C: 60.55 (good)
ARC - E: 81.05 (good also)

📦 Installation

The document doesn't provide specific installation steps. You can follow the general installation process of mergekit - based models and refer to the mergekit repository for more information.

💻 Usage Examples

The document doesn't provide specific code examples. You can use the model according to the general usage of pre - trained language models, such as using the transformers library in Python:

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "your_model_path"  # Replace with the actual path or name of the model
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

input_text = "Your input text here"
input_ids = tokenizer(input_text, return_tensors='pt').input_ids
output = model.generate(input_ids)
generated_text = tokenizer.decode(output[0], skip_special_tokens=True)
print(generated_text)

📚 Documentation

Merge Details

Merge Method

This model was merged using the Model Stock merge method, with Nexesenex/Llama_3.x_70b_SmarTracks_V1.01 as the base.

Models Merged

The following models were included in the merge:

[Steelskull/L3.3 - Electra - R1 - 70b](https://huggingface.co/Steelskull/L3.3 - Electra - R1 - 70b)
NexesMess/Llama_3.3_70b_DoppelGanger_R1
[nbeerbower/Llama3.1 - Gutenberg - Doppel - 70B](https://huggingface.co/nbeerbower/Llama3.1 - Gutenberg - Doppel - 70B)
NexesMess/Llama_3.1_70b_Priestess_V1
[migtissera/Tess - 3 - Llama - 3.1 - 70B](https://huggingface.co/migtissera/Tess - 3 - Llama - 3.1 - 70B)

Configuration

The following YAML configuration was used to produce this model:

merge_method: model_stock
models:
  - model: migtissera/Tess-3-Llama-3.1-70B
    parameters:
      weight: 1.0
  - model: nbeerbower/Llama3.1-Gutenberg-Doppel-70B
    parameters:
      weight: 1.0
  - model: NexesMess/Llama_3.1_70b_Priestess_V1
    parameters:
      weight: 1.0
  - model: Steelskull/L3.3-Electra-R1-70b
    parameters:
      weight: 1.0
  - model: NexesMess/Llama_3.3_70b_DoppelGanger_R1
    parameters:
      weight: 1.0
base_model: Nexesenex/Llama_3.x_70b_SmarTracks_V1.01
dtype: bfloat16
out_dtype: bfloat16
parameters:
  int8_mask: true
  normalize: true
  rescale: false
chat_template: auto
tokenizer:
  source: union

🔧 Technical Details

The merge of this model uses the Model Stock method. By combining multiple pre - trained models, it aims to achieve better performance in various benchmarks and creative tasks. The specific weights of each model in the merge are set in the configuration file, which helps balance the contributions of different models.

📄 License

The document doesn't provide license information. You should check the licenses of the individual models used in the merge and the mergekit library for more details.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご