L

Llama 4 Scout 17B 16E Instruct Bnb 8bit

Developed by bnb-community
The Llama 4 series is a multimodal AI model developed by Meta, supporting text and image interaction, utilizing a Mixture of Experts (MoE) architecture, and demonstrating leading performance in text and image comprehension.
Downloads 132
Release Time : 4/7/2025

Model Overview

This model is the int8 quantized version of Llama-4-Scout-17B-16E-Instruct, natively supporting multimodal input (text + images) and multilingual output, suitable for complex reasoning tasks.

Model Features

Multimodal Fusion
Supports simultaneous processing of text and image inputs, enabling cross-modal understanding and reasoning.
Mixture of Experts Architecture
Utilizes a 16-expert system with 17B active parameters and 1090B total parameters, balancing performance and efficiency.
Long Context Support
10M tokens context window, suitable for processing long documents and complex tasks.
Multilingual Optimization
Comprehensively tested in 12 languages, with pretraining covering 200 languages.

Model Capabilities

Multimodal dialogue
Cross-language text generation
Image comparison analysis
Code generation and explanation
Long document comprehension
Knowledge Q&A

Use Cases

Intelligent Assistant
Multimodal Customer Service
Resolve user issues through text and image interaction
Achieved 89.4 points in DocVQA testing
Education
Multilingual Learning
Supports translation and explanation in 12 languages
Book translation chrF score 42.2/36.6
Research & Development
Code Assistance
Generates and optimizes code based on requirements
MBPP benchmark score 67.8
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase