D

Debiased Llama 4 Scout 17B 16E Instruct

Developed by hirundo-io
Llama 4 Scout is a native multimodal AI model launched by Meta, supporting multilingual text and image understanding. It adopts the Mixture of Experts architecture and has industry-leading performance in text and image understanding.
Downloads 1,716
Release Time : 4/14/2025

Model Overview

Llama 4 Scout is a multimodal model that supports multilingual text and image understanding and can be used for tasks such as visual recognition, image reasoning, and image caption generation.

Model Features

Multimodal support
Supports multilingual text and image understanding and can be used for tasks such as visual recognition, image reasoning, and image caption generation.
High performance
Adopts the Mixture of Experts architecture and has industry-leading performance in text and image understanding.
Multilingual support
Supports multiple languages such as Arabic, English, French, German, Hindi, Indonesian, Italian, Portuguese, Spanish, Tagalog, Thai, and Vietnamese.
Customizability
Supports model fine-tuning and can be customized according to specific application scenarios.

Model Capabilities

Text generation
Image analysis
Multilingual understanding
Visual reasoning
Image caption generation

Use Cases

Visual recognition
Image description
Generate a detailed description of the input image.
Achieved a relaxed_accuracy of 83.4 in the ChartQA benchmark test.
Image reasoning
Image similarity analysis
Analyze the similarity and difference between two images.
Achieved an accuracy of 69.4 in the MMMU benchmark test.
Multilingual application
Multilingual text generation
Supports text generation tasks in multiple languages.
Achieved an average/em of 90.6 in the MGSM benchmark test.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase