L

Llama 4 Scout 17B 16E Instruct

Developed by meta-llama
Llama 4 Scout is a multimodal AI model developed by Meta, featuring a mixture-of-experts architecture, supporting text and image interactions in 12 languages, with 17B active parameters and 109B total parameters.
Downloads 817.62k
Release Time : 4/2/2025

Model Overview

A native multimodal large language model with industry-leading performance in text and image understanding, suitable for commercial and research purposes.

Model Features

Multimodal Support
Processes both text and image inputs for cross-modal understanding and generation
Mixture-of-Experts Architecture
Utilizes 16 expert configuration, achieving 109B total parameter capacity while maintaining 17B active parameters
Long Context Processing
Supports 10M token context window, suitable for long documents and complex tasks
Multilingual Capability
Natively supports 12 languages, covering major Asian and European language families

Model Capabilities

Multilingual text generation
Image content understanding
Cross-modal reasoning
Code generation
Long document translation
Visual question answering

Use Cases

Intelligent Assistant
Multimodal Chatbot
Processes both user-uploaded images and text queries simultaneously
Generates natural language responses incorporating visual information
Content Analysis
Cross-media Content Understanding
Analyzes relationships and semantics of mixed text-image content
Enables commercial applications like advertising compliance checks
Education
Visual Math Problem Solving
Interprets problems containing mathematical formulas and charts
Achieves 70.7 score on MathVista benchmark
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase