M

Meta Llama 3.1 8B Instruct FP8

Developed by RedHatAI
FP8 quantized version of Meta-Llama-3.1-8B-Instruct, suitable for multilingual business and research applications, specially optimized for assistant-like chat scenarios.
Downloads 361.53k
Release Time : 7/23/2024

Model Overview

This model is an FP8 quantized version of Meta-Llama-3.1-8B-Instruct, significantly reducing disk size and GPU memory requirements by decreasing the bit count per parameter from 16 to 8. Suitable for multilingual text generation tasks.

Model Features

FP8 quantization
Weights and activations quantized to FP8 data type, significantly reducing memory requirements and disk usage.
Multilingual support
Supports multiple languages including English, German, French, Italian, etc.
Efficient inference
Optimized for vLLM backend, delivering efficient inference performance.

Model Capabilities

Text generation
Multilingual support
Chat assistant

Use Cases

Chat assistant
Multilingual chatbot
Can be used to build chatbots supporting multiple languages, providing natural and fluent conversational experiences.
Business applications
Customer support
Used for automated customer support systems handling multilingual customer inquiries.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase