Athena-70B-L3-i1-GGUF Open-Source Large Language Model - Free Support for English Text Generation

Athena 70B L3 I1 GGUF

Developed by mradermacher

Athena-70B-L3 is a large language model with 70B parameters, supporting English text generation tasks and utilizing parameter-efficient fine-tuning techniques.

Large Language Model

Transformers

English#70B Parameter Quantization #IQ Efficient Quantization #English Text Generation

Downloads 141

Release Time : 6/28/2024

Model Overview

Athena-70B-L3 is a large language model based on the transformers library, focusing on text generation tasks and suitable for scenarios requiring high-quality text output.

Model Features

Parameter-Efficient Fine-Tuning

Utilizes parameter-efficient fine-tuning techniques to reduce computational resource requirements while maintaining model performance.

Multiple Quantization Versions

Offers multiple quantization versions to adapt to different hardware and performance needs, ranging from extreme compression to high-quality inference.

High-Quality Text Generation

Focuses on generating high-quality, coherent English text, suitable for various text generation tasks.

Model Capabilities

Text Generation

Text Reasoning

Use Cases

Text Generation

Content Creation

Generate high-quality articles, stories, or other creative text content.

Dialogue Systems

Used to build intelligent dialogue systems, providing coherent and natural conversation responses.

🚀 Athena-70B-L3 Quantized Model

This project provides quantized versions of the Athena-70B-L3 model, offering various options for different usage scenarios.

🚀 Quick Start

If you're new to using GGUF files, check out TheBloke's READMEs for detailed instructions, including how to concatenate multi - part files.

✨ Features

Offers weighted/imatrix quants of the base model from https://huggingface.co/AiMavenAi/Athena-70B-L3.
Provides a variety of quantized versions with different sizes and characteristics.

📚 Documentation

About

The model is a quantized version of AiMavenAi/Athena-70B-L3. Weighted/imatrix quants are provided, and static quants can be found at mradermacher/Athena-70B-L3-GGUF.

Usage

Refer to TheBloke's READMEs for detailed usage instructions of GGUF files, including how to handle multi - part files.

Provided Quants

The following table lists the provided quantized models, sorted by size (not necessarily quality. IQ - quants are often preferable over similar sized non - IQ quants):

Link	Type	Size/GB	Notes
GGUF	i1-IQ1_S	15.4	for the desperate
GGUF	i1-IQ1_M	16.9	mostly desperate
GGUF	i1-IQ2_XXS	19.2
GGUF	i1-IQ2_XS	21.2
GGUF	i1-IQ2_S	22.3
GGUF	i1-IQ2_M	24.2
GGUF	i1-Q2_K	26.5	IQ3_XXS probably better
GGUF	i1-IQ3_XXS	27.6	lower quality
GGUF	i1-IQ3_XS	29.4
GGUF	i1-IQ3_S	31.0	beats Q3_K*
GGUF	i1-Q3_K_S	31.0	IQ3_XS probably better
GGUF	i1-IQ3_M	32.0
GGUF	i1-Q3_K_M	34.4	IQ3_S probably better
GGUF	i1-Q3_K_L	37.2	IQ3_M probably better
GGUF	i1-IQ4_XS	38.0
GGUF	i1-Q4_0	40.2	fast, low quality
GGUF	i1-Q4_K_S	40.4	optimal size/speed/quality
GGUF	i1-Q4_K_M	42.6	fast, recommended
GGUF	i1-Q5_K_S	48.8
GGUF	i1-Q5_K_M	50.0
PART 1 PART 2	i1-Q6_K	58.0	practically like static Q6_K

Here is a handy graph by ikawrakow comparing some lower - quality quant types (lower is better):

And here are Artefact2's thoughts on the matter: Artefact2's Gist

FAQ / Model Request

For answers to common questions or if you want other models to be quantized, visit mradermacher/model_requests.

📄 License

This project is licensed under the cc - by - nc - nd - 4.0 license.

🔧 Technical Details

Property	Details
Base Model	AiMavenAi/Athena-70B-L3
Library Name	transformers
License	cc - by - nc - nd - 4.0
Quantized By	mradermacher
Tags	autotrain, text - generation - inference, text - generation, peft

Thanks

I'm grateful to my company, nethype GmbH, for allowing me to use its servers and upgrading my workstation, which enables me to do this work in my free time. I also want to thank @nicoboss for giving me access to his private supercomputer, which allows me to provide many more high - quality imatrix quants.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご