# Llama-3.1-8B-Instuct-Uz-GGUF Open-Source Model - Supports Multi-Type Text Generation in Both Uzbek and English

Llama 3.1 8B Instuct Uz GGUF

Developed by mradermacher

This is a statically quantized version based on behbudiy/Llama-3.1-8B-Instuct-Uz, supporting Uzbek and English, suitable for various text generation tasks.

Large Language Model Supports Multiple Languages#Uzbek-English bilingual #Instruction fine-tuning optimization #Lightweight quantized deployment

Downloads 241

Release Time : 9/17/2024

Model Overview

This model is an 8B-parameter large language model, specifically fine-tuned for Uzbek and English instructions, suitable for tasks such as text generation, summarization, translation, and Q&A.

Model Features

Multilingual support

Optimized specifically for Uzbek and English, supporting bilingual text generation tasks.

Multiple quantized versions

Offers various quantized versions from Q2_K to f16 to meet different hardware and performance needs.

Instruction fine-tuning

Fine-tuned on datasets like alpaca-cleaned, making it more suitable for dialogue and instruction-following tasks.

Model Capabilities

Text generation

Summarization

Translation

Q&A

Use Cases

Language processing

Uzbek text generation

Generate coherent text in Uzbek

English-Uzbek bilingual translation

Translate text between English and Uzbek

Education

Language learning assistance

Help learners practice Uzbek

🚀 Llama-3.1-8B-Instuct-Uz Quantized Model

This project provides static quantizations of the Llama-3.1-8B-Instuct-Uz model, offering various quantization types for different usage scenarios.

✨ Features

Multi-language Support: Supports both Uzbek (uz) and English (en).
Diverse Datasets: Trained on multiple datasets including yahma/alpaca-cleaned, behbudiy/alpaca-cleaned-uz, and behbudiy/translation-instruction.
Multiple Quantization Types: Offers a range of quantization types with different sizes and qualities.

📦 Model Information

Property	Details
Base Model	behbudiy/Llama-3.1-8B-Instuct-Uz
Datasets	yahma/alpaca-cleaned, behbudiy/alpaca-cleaned-uz, behbudiy/translation-instruction
Languages	uz, en
Library Name	transformers
License	llama3.1
Quantized By	mradermacher
Tags	llama, text-generation-inference, summarization, translation, question-answering

🚀 Quick Start

About

These are static quants of https://huggingface.co/behbudiy/Llama-3.1-8B-Instuct-Uz.

Weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion.

Usage

If you are unsure how to use GGUF files, refer to one of TheBloke's READMEs for more details, including on how to concatenate multi-part files.

📚 Documentation

Provided Quants

(sorted by size, not necessarily quality. IQ - quants are often preferable over similar sized non - IQ quants)

Link	Type	Size/GB	Notes
GGUF	Q2_K	3.3
GGUF	IQ3_XS	3.6
GGUF	Q3_K_S	3.8
GGUF	IQ3_S	3.8	beats Q3_K*
GGUF	IQ3_M	3.9
GGUF	Q3_K_M	4.1	lower quality
GGUF	Q3_K_L	4.4
GGUF	IQ4_XS	4.6
GGUF	Q4_K_S	4.8	fast, recommended
GGUF	Q4_K_M	5.0	fast, recommended
GGUF	Q5_K_S	5.7
GGUF	Q5_K_M	5.8
GGUF	Q6_K	6.7	very good quality
GGUF	Q8_0	8.6	fast, best quality
GGUF	f16	16.2	16 bpw, overkill

Here is a handy graph by ikawrakow comparing some lower - quality quant types (lower is better):

And here are Artefact2's thoughts on the matter: https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9

FAQ / Model Request

See https://huggingface.co/mradermacher/model_requests for some answers to questions you might have and/or if you want some other model quantized.

📄 Thanks

I thank my company, nethype GmbH, for letting me use its servers and providing upgrades to my workstation to enable this work in my free time.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご