Mistral Small 3.2 24B Instruct 2506

M

Mistral Small 3.2 24B Instruct 2506

由unsloth開發

Mistral-Small-3.2-24B-Instruct-2506是一個圖像文本到文本的模型，是Mistral-Small-3.1-24B-Instruct-2503的更新版本，在指令遵循、減少重複錯誤和函數調用等方面有所改進。

文本生成圖像

支持多種語言開源協議:Apache-2.0 #多模態指令跟隨 #低重複錯誤 #函數調用優化

下載量 1,750

發布時間 : 6/20/2025

模型概述

該模型是一個多語言、多模態的指令遵循模型，支持圖像和文本輸入，輸出為文本。適用於多種任務，包括視覺推理、函數調用和文本生成。

模型特點

改進的指令遵循能力

Small-3.2更擅長遵循精確的指令，提供更準確的響應。

減少重複錯誤

Small-3.2減少了無限生成或重複答案的情況，提高了輸出的穩定性。

增強的函數調用能力

Small-3.2的函數調用模板更加健壯，支持更復雜的任務。

多語言支持

支持24種語言，適用於全球範圍內的多語言任務。

多模態能力

支持圖像和文本輸入，適用於視覺推理任務。

模型能力

文本生成

視覺推理

函數調用

多語言處理

指令遵循

使用案例

視覺推理

遊戲場景分析

分析遊戲場景圖像，提供最佳行動建議。

模型能夠根據圖像內容生成詳細的行動建議和分析。

函數調用

人口數據查詢

根據圖像中的國家信息，調用函數查詢人口數據。

模型能夠正確識別圖像中的國家並調用函數獲取人口數據。

文本生成

指令遵循

生成符合特定指令的文本，如按字母順序排列的句子。

模型能夠嚴格遵循指令生成符合要求的文本。

🚀 Mistral-Small-3.2-24B-Instruct-2506

Mistral-Small-3.2-24B-Instruct-2506是一個圖像文本到文本的模型，是Mistral-Small-3.1-24B-Instruct-2503的一個小更新版本。它在指令遵循、減少重複錯誤和函數調用等方面有所改進，能為用戶提供更準確和高效的服務。

🚀 快速開始

本模型支持以下語言：英語、法語、德語、西班牙語、葡萄牙語、意大利語、日語、韓語、俄語、中文、阿拉伯語、波斯語、印尼語、馬來語、尼泊爾語、波蘭語、羅馬尼亞語、塞爾維亞語、瑞典語、土耳其語、烏克蘭語、越南語、印地語、孟加拉語。

許可證為Apache-2.0，庫名稱為vllm。

✨ 主要特性

Mistral-Small-3.2-24B-Instruct-2506在以下幾個方面進行了改進：

指令遵循：Small-3.2更擅長遵循精確的指令。
重複錯誤：Small-3.2減少了無限生成或重複答案的情況。
函數調用：Small-3.2的函數調用模板更加健壯（詳見此處和示例）。

在其他所有方面，Small-3.2與Mistral-Small-3.1-24B-Instruct-2503相比，表現相當或略有提升。

其關鍵特性與Mistral-Small-3.1-24B-Instruct-2503相同。

📊 基準測試結果

我們將Mistral-Small-3.2-24B與Mistral-Small-3.1-24B-Instruct-2503進行了比較。如需查看與其他類似規模模型的更多比較，請參考Mistral-Small-3.1的基準測試。

文本性能

指令遵循/聊天/語氣

模型	Wildbench v2	Arena Hard v2	IF（內部；準確率）
Small 3.1 24B Instruct	55.6%	19.56%	82.75%
Small 3.2 24B Instruct	65.33%	43.1%	84.78%

無限生成情況

Small 3.2在處理具有挑戰性、長且重複的提示時，將無限生成情況減少了一半。

模型	無限生成情況（內部；數值越低越好）
Small 3.1 24B Instruct	2.11%
Small 3.2 24B Instruct	1.29%

STEM領域

模型	MMLU	MMLU Pro（5次少樣本思維鏈）	MATH	GPQA Main（5次少樣本思維鏈）	GPQA Diamond（5次少樣本思維鏈）	MBPP Plus - Pass@5	HumanEval Plus - Pass@5	SimpleQA（總準確率）
Small 3.1 24B Instruct	80.62%	66.76%	69.30%	44.42%	45.96%	74.63%	88.99%	10.43%
Small 3.2 24B Instruct	80.50%	69.06%	69.42%	44.22%	46.13%	78.33%	92.90%	12.10%

視覺性能

模型	MMMU	Mathvista	ChartQA	DocVQA	AI2D
Small 3.1 24B Instruct	64.00%	68.91%	86.24%	94.08%	93.72%
Small 3.2 24B Instruct	62.50%	67.09%	87.4%	94.86%	92.91%

📦 安裝指南

vLLM（推薦）

我們建議使用vLLM來運行此模型。

安裝步驟

確保安裝vLLM >= 0.9.1：

pip install vllm --upgrade

這樣做應該會自動安裝mistral_common >= 1.6.2。

要檢查是否安裝成功，可以運行以下命令：

python -c "import mistral_common; print(mistral_common.__version__)"

你也可以使用現成的Docker鏡像，或者從Docker Hub獲取。

服務啟動

我們建議在服務器/客戶端環境中使用Mistral-Small-3.2-24B-Instruct-2506。

啟動服務器：

vllm serve mistralai/Mistral-Small-3.2-24B-Instruct-2506 --tokenizer_mode mistral --config_format mistral --load_format mistral --tool-call-parser mistral --enable-auto-tool-choice --limit_mm_per_prompt 'image=10' --tensor-parallel-size 2

注意：在GPU上運行Mistral-Small-3.2-24B-Instruct-2506，在bf16或fp16格式下大約需要55GB的GPU內存。

你可以使用一個簡單的Python代碼片段來測試客戶端。具體示例如下。

💻 使用示例

基礎用法

視覺推理

利用Mistral-Small-3.2-24B-Instruct-2506的視覺能力，在給定場景下做出最佳選擇。

from datetime import datetime, timedelta

from openai import OpenAI
from huggingface_hub import hf_hub_download

# Modify OpenAI's API key and API base to use vLLM's API server.
openai_api_key = "EMPTY"
openai_api_base = "http://localhost:8000/v1"

TEMP = 0.15
MAX_TOK = 131072

client = OpenAI(
    api_key=openai_api_key,
    base_url=openai_api_base,
)

models = client.models.list()
model = models.data[0].id


def load_system_prompt(repo_id: str, filename: str) -> str:
    file_path = hf_hub_download(repo_id=repo_id, filename=filename)
    with open(file_path, "r") as file:
        system_prompt = file.read()
    today = datetime.today().strftime("%Y-%m-%d")
    yesterday = (datetime.today() - timedelta(days=1)).strftime("%Y-%m-%d")
    model_name = repo_id.split("/")[-1]
    return system_prompt.format(name=model_name, today=today, yesterday=yesterday)


model_id = "mistralai/Mistral-Small-3.2-24B-Instruct-2506"
SYSTEM_PROMPT = load_system_prompt(model_id, "SYSTEM_PROMPT.txt")
image_url = "https://static.wikia.nocookie.net/essentialsdocs/images/7/70/Battle.png/revision/latest?cb=20220523172438"

messages = [
    {"role": "system", "content": SYSTEM_PROMPT},
    {
        "role": "user",
        "content": [
            {
                "type": "text",
                "text": "What action do you think I should take in this situation? List all the possible actions and explain why you think they are good or bad.",
            },
            {"type": "image_url", "image_url": {"url": image_url}},
        ],
    },
]


response = client.chat.completions.create(
    model=model,
    messages=messages,
    temperature=TEMP,
    max_tokens=MAX_TOK,
)

print(response.choices[0].message.content)
# In this situation, you are playing a Pokémon game where your Pikachu (Level 42) is facing a wild Pidgey (Level 17). Here are the possible actions you can take and an analysis of each:

# 1. **FIGHT**:
#    - **Pros**: Pikachu is significantly higher level than the wild Pidgey, which suggests that it should be able to defeat Pidgey easily. This could be a good opportunity to gain experience points and possibly items or money.
#    - **Cons**: There is always a small risk of Pikachu fainting, especially if Pidgey has a powerful move or a status effect that could hinder Pikachu. However, given the large level difference, this risk is minimal.

# 2. **BAG**:
#    - **Pros**: You might have items in your bag that could help in this battle, such as Potions, Poké Balls, or Berries. Using an item could help you capture the Pidgey or heal your Pikachu if needed.
#    - **Cons**: Using items might not be necessary given the level difference. It could be more efficient to just fight and defeat the Pidgey quickly.

# 3. **POKÉMON**:
#    - **Pros**: You might have another Pokémon in your party that is better suited for this battle or that you want to gain experience. Switching Pokémon could also be a strategic move if you want to train a lower-level Pokémon.
#    - **Cons**: Switching Pokémon might not be necessary since Pikachu is at a significant advantage. It could also waste time and potentially give Pidgey a turn to attack.

# 4. **RUN**:
#    - **Pros**: Running away could save time and conserve your Pokémon's health and resources. If you are in a hurry or do not need the experience or items, running away is a safe option.
#    - **Cons**: Running away means you miss out on the experience points and potential items or money that you could gain from defeating the Pidgey. It also means you do not get the chance to capture the Pidgey if you wanted to.

# ### Recommendation:
# Given the significant level advantage, the best action is likely to **FIGHT**. This will allow you to quickly defeat the Pidgey, gain experience points, and potentially earn items or money. If you are concerned about Pikachu's health, you could use an item from your **BAG** to heal it before or during the battle. Running away or switching Pokémon does not seem necessary in this situation.

高級用法

函數調用

Mistral-Small-3.2-24B-Instruct-2506在通過vLLM進行函數/工具調用任務方面表現出色。例如：

from openai import OpenAI
from huggingface_hub import hf_hub_download

# Modify OpenAI's API key and API base to use vLLM's API server.
openai_api_key = "EMPTY"
openai_api_base = "http://localhost:8000/v1"

TEMP = 0.15
MAX_TOK = 131072

client = OpenAI(
    api_key=openai_api_key,
    base_url=openai_api_base,
)

models = client.models.list()
model = models.data[0].id

def load_system_prompt(repo_id: str, filename: str) -> str:
    file_path = hf_hub_download(repo_id=repo_id, filename=filename)
    with open(file_path, "r") as file:
        system_prompt = file.read()
    return system_prompt

model_id = "mistralai/Mistral-Small-3.2-24B-Instruct-2506"
SYSTEM_PROMPT = load_system_prompt(model_id, "SYSTEM_PROMPT.txt")

image_url = "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/europe.png"

tools = [
    {
        "type": "function",
        "function": {
            "name": "get_current_population",
            "description": "Get the up-to-date population of a given country.",
            "parameters": {
                "type": "object",
                "properties": {
                    "country": {
                        "type": "string",
                        "description": "The country to find the population of.",
                    },
                    "unit": {
                        "type": "string",
                        "description": "The unit for the population.",
                        "enum": ["millions", "thousands"],
                    },
                },
                "required": ["country", "unit"],
            },
        },
    },
    {
        "type": "function",
        "function": {
            "name": "rewrite",
            "description": "Rewrite a given text for improved clarity",
            "parameters": {
                "type": "object",
                "properties": {
                    "text": {
                        "type": "string",
                        "description": "The input text to rewrite",
                    }
                },
            },
        },
    },
]

messages = [
    {"role": "system", "content": SYSTEM_PROMPT},
    {
        "role": "user",
        "content": "Could you please make the below article more concise?\n\nOpenAI is an artificial intelligence research laboratory consisting of the non-profit OpenAI Incorporated and its for-profit subsidiary corporation OpenAI Limited Partnership.",
    },
    {
        "role": "assistant",
        "content": "",
        "tool_calls": [
            {
                "id": "bbc5b7ede",
                "type": "function",
                "function": {
                    "name": "rewrite",
                    "arguments": '{"text": "OpenAI is an artificial intelligence research laboratory consisting of the non-profit OpenAI Incorporated and its for-profit subsidiary corporation OpenAI Limited Partnership."}',
                },
            }
        ],
    },
    {
        "role": "tool",
        "content": '{"action":"rewrite","outcome":"OpenAI is a FOR-profit company."}',
        "tool_call_id": "bbc5b7ede",
        "name": "rewrite",
    },
    {
        "role": "assistant",
        "content": "---\n\nOpenAI is a FOR-profit company.",
    },
    {
        "role": "user",
        "content": [
            {
                "type": "text",
                "text": "Can you tell me what is the biggest country depicted on the map?",
            },
            {
                "type": "image_url",
                "image_url": {
                    "url": image_url,
                },
            },
        ],
    }
]

response = client.chat.completions.create(
    model=model,
    messages=messages,
    temperature=TEMP,
    max_tokens=MAX_TOK,
    tools=tools,
    tool_choice="auto",
)

assistant_message = response.choices[0].message.content
print(assistant_message)
# The biggest country depicted on the map is Russia.

messages.extend([
    {"role": "assistant", "content": assistant_message},
    {"role": "user", "content": "What is the population of that country in millions?"},
])

response = client.chat.completions.create(
    model=model,
    messages=messages,
    temperature=TEMP,
    max_tokens=MAX_TOK,
    tools=tools,
    tool_choice="auto",
)

print(response.choices[0].message.tool_calls)
# [ChatCompletionMessageToolCall(id='3e92V6Vfo', function=Function(arguments='{"country": "Russia", "unit": "millions"}', name='get_current_population'), type='function')]

指令遵循

Mistral-Small-3.2-24B-Instruct-2506能夠嚴格遵循你的指令。

from openai import OpenAI
from huggingface_hub import hf_hub_download

# Modify OpenAI's API key and API base to use vLLM's API server.
openai_api_key = "EMPTY"
openai_api_base = "http://localhost:8000/v1"

TEMP = 0.15
MAX_TOK = 131072

client = OpenAI(
    api_key=openai_api_key,
    base_url=openai_api_base,
)

models = client.models.list()
model = models.data[0].id


def load_system_prompt(repo_id: str, filename: str) -> str:
    file_path = hf_hub_download(repo_id=repo_id, filename=filename)
    with open(file_path, "r") as file:
        system_prompt = file.read()
    return system_prompt


model_id = "mistralai/Mistral-Small-3.2-24B-Instruct-2506"
SYSTEM_PROMPT = load_system_prompt(model_id, "SYSTEM_PROMPT.txt")

messages = [
    {"role": "system", "content": SYSTEM_PROMPT},
    {
        "role": "user",
        "content": "Write me a sentence where every word starts with the next letter in the alphabet - start with 'a' and end with 'z'.",
    },
]

response = client.chat.completions.create(
    model=model,
    messages=messages,
    temperature=TEMP,
    max_tokens=MAX_TOK,
)

assistant_message = response.choices[0].message.content
print(assistant_message)

# Here's a sentence where each word starts with the next letter of the alphabet, starting from 'a' and ending with 'z':

# "Always brave ca

注意事項

⚠️ 重要提示

建議使用相對較低的溫度，例如temperature=0.15。

確保為模型添加系統提示，以使其更好地滿足你的需求。如果你想將該模型作為通用助手使用，建議使用SYSTEM_PROMPT.txt文件中提供的提示。

📄 許可證

本項目採用Apache-2.0許可證。

精選推薦AI模型

Llama 3 Typhoon V1.5x 8b Instruct

專為泰語設計的80億參數指令模型，性能媲美GPT-3.5-turbo，優化了應用場景、檢索增強生成、受限生成和推理任務

大型語言模型

Transformers 支持多種語言

Cadet-Tiny是一個基於SODA數據集訓練的超小型對話模型，專為邊緣設備推理設計，體積僅為Cosmo-3B模型的2%左右。

Transformers 英語

Roberta Base Chinese Extractive Qa

基於RoBERTa架構的中文抽取式問答模型，適用於從給定文本中提取答案的任務。

問答系統中文

AIbase

智啟未來，您的人工智能解決方案智庫

English 简体中文繁體中文にほんご

© 2025AIbase