M

Meta Llama 3.1 8B FP8

Developed by RedHatAI
FP8 quantized version of Meta-Llama-3.1-8B, suitable for multilingual business and research applications.
Downloads 4,154
Release Time : 7/31/2024

Model Overview

This model is a quantized version of Meta-Llama-3.1-8B, significantly reducing disk size and GPU memory requirements by quantizing weights and activations to FP8 data type.

Model Features

FP8 quantization
Quantization of weights and activations to FP8 data type reduces disk size and GPU memory requirements by approximately 50%.
Multilingual support
Supports text generation tasks in multiple languages including English, German, French, and more.
High performance recovery rate
Achieves an average score recovery rate of 99.14% in OpenLLM benchmarks, closely matching the performance of the original model.

Model Capabilities

Text generation
Multilingual support
Business applications
Research purposes

Use Cases

Business applications
Multilingual customer service chatbot
Leverage the model's multilingual support to build efficient customer service chatbots.
Enables real-time interaction in multiple languages, improving customer satisfaction.
Research purposes
Language model research
Used to study the impact of quantization on language model performance.
Provides efficient quantized models for research and experimentation.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase