Mistral-Small-3.1-24B-Instruct Open-Source Model - Supports Multilingual and Visual Understanding, Enables Efficient Local Deployment and Inference

Mistral Small 3.1 24B Instruct 2503 Q5 K M GGUF

Developed by Triangle104

A 24B-parameter instruction-tuned model based on Mistral Small 3.1, supporting multilingual and visual understanding, suitable for local deployment and efficient inference.

Text-to-Image Supports Multiple LanguagesOpen Source License:Apache-2.0 #128k long-context understanding #Multimodal visual analysis #Low-latency function calling

Downloads 57

Release Time : 4/6/2025

Model Overview

Mistral Small 3.1 is a multilingual and visual understanding model with 24 billion parameters, supporting a 128k-token context window, suitable for text and visual tasks.

Model Features

Visual capabilities

The model can analyze images and provide insights based on visual content while integrating textual information.

Multilingual support

Supports dozens of languages, including English, French, German, Chinese, and more.

Agent-centric

Offers top-tier agent capabilities, supporting native function calling and JSON output.

Advanced reasoning

Equipped with state-of-the-art conversational and reasoning abilities.

128k-token context window

Supports long-document understanding and complex task processing.

Model Capabilities

Text generation

Visual understanding

Multilingual processing

Function calling

Long-document understanding

Programming and mathematical reasoning

Use Cases

Conversational agents

Low-latency conversational agent

Suitable for conversational scenarios requiring low latency.

Local inference

Local inference for sensitive data

Ideal for individuals or organizations processing sensitive data locally.

Programming and mathematics

Programming and mathematical reasoning

Suitable for programming assistance and solving mathematical problems.

Visual understanding

Image analysis

Capable of analyzing images and providing insights.

🚀 Triangle104/Mistral-Small-3.1-24B-Instruct-2503-Q5_K_M-GGUF

This project is a conversion of the Mistral model to the GGUF format, offering multilingual support and advanced vision and text capabilities.

📋 Model Information

Property	Details
Base Model	mistralai/Mistral-Small-3.1-24B-Instruct-2503
Library Name	vllm
License	apache-2.0
Pipeline Tag	image-text-to-text
Tags	llama-cpp, gguf-my-repo
Inference	false

⚠️ Important Note

If you want to learn more about how we process your personal data, please read our Privacy Policy.

🚀 Quick Start

This model was converted to GGUF format from mistralai/Mistral-Small-3.1-24B-Instruct-2503 using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

✨ Features

Vision: Vision capabilities enable the model to analyze images and provide insights based on visual content in addition to text.
Multilingual: Supports dozens of languages, including English, French, German, Greek, Hindi, Indonesian, Italian, Japanese, Korean, Malay, Nepali, Polish, Portuguese, Romanian, Russian, Serbian, Spanish, Swedish, Turkish, Ukrainian, Vietnamese, Arabic, Bengali, Chinese, Farsi.
Agent-Centric: Offers best-in-class agentic capabilities with native function calling and JSON outputting.
Advanced Reasoning: State-of-the-art conversational and reasoning capabilities.
Apache 2.0 License: Open license allowing usage and modification for both commercial and non-commercial purposes.
Context Window: A 128k context window.
System Prompt: Maintains strong adherence and support for system prompts.
Tokenizer: Utilizes a Tekken tokenizer with a 131k vocabulary size.

📦 Installation

Install llama.cpp through brew (works on Mac and Linux)

brew install llama.cpp

💻 Usage Examples

Basic Usage

Invoke the llama.cpp server or the CLI.

CLI:

llama-cli --hf-repo Triangle104/Mistral-Small-3.1-24B-Instruct-2503-Q5_K_M-GGUF --hf-file mistral-small-3.1-24b-instruct-2503-q5_k_m.gguf -p "The meaning to life and the universe is"

Server:

llama-server --hf-repo Triangle104/Mistral-Small-3.1-24B-Instruct-2503-Q5_K_M-GGUF --hf-file mistral-small-3.1-24b-instruct-2503-q5_k_m.gguf -c 2048

Advanced Usage

You can also use this checkpoint directly through the usage steps listed in the Llama.cpp repo.

Step 1: Clone llama.cpp from GitHub.

git clone https://github.com/ggerganov/llama.cpp

Step 2: Move into the llama.cpp folder and build it with LLAMA_CURL=1 flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).

cd llama.cpp && LLAMA_CURL=1 make

Step 3: Run inference through the main binary.

./llama-cli --hf-repo Triangle104/Mistral-Small-3.1-24B-Instruct-2503-Q5_K_M-GGUF --hf-file mistral-small-3.1-24b-instruct-2503-q5_k_m.gguf -p "The meaning to life and the universe is"

./llama-server --hf-repo Triangle104/Mistral-Small-3.1-24B-Instruct-2503-Q5_K_M-GGUF --hf-file mistral-small-3.1-24b-instruct-2503-q5_k_m.gguf -c 2048

Model Capabilities and Use Cases

Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance. With 24 billion parameters, this model achieves top-tier capabilities in both text and vision tasks. This model is an instruction-finetuned version of: Mistral-Small-3.1-24B-Base-2503.

Mistral Small 3.1 can be deployed locally and is exceptionally "knowledge-dense," fitting within a single RTX 4090 or a 32GB RAM MacBook once quantized.

It is ideal for:

Fast-response conversational agents.
Low-latency function calling.
Subject matter experts via fine-tuning.
Local inference for hobbyists and organizations handling sensitive data.
Programming and math reasoning.
Long document understanding.
Visual understanding.

For enterprises requiring specialized capabilities (increased context, specific modalities, domain-specific knowledge, etc.), we will release commercial models beyond what Mistral AI contributes to the community.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご