M

Magistral Small 2506 Vision

Developed by OptimusePrime
Magistral-Small-2506-Vision is an inference fine-tuned version based on Mistral Small 3.1 with GRPO training, an experimental checkpoint with visual capabilities.
Downloads 125
Release Time : 6/13/2025

Model Overview

This model is an inference fine-tuned version of Mistral Small 3.1 with GRPO training. It ported the visual encoder of Mistral Small 3.1, enabling it to process images. Despite being fine-tuned only on text data, it still shows moderate improvement in multimodal benchmarks.

Model Features

Multilingual support
Supports multiple languages such as English, French, German, Spanish, Portuguese, Italian, Japanese, Korean, Russian, Chinese, Arabic, Persian, Indonesian, Malay, Nepali, Polish, Romanian, Serbian, Swedish, Turkish, Ukrainian, Vietnamese, Hindi, and Bengali.
Visual ability
By porting the visual encoder of Mistral Small 3.1, the model is enabled to process images.
Generalization of reasoning ability
Despite being fine-tuned only on text data, it still shows moderate improvement in multimodal benchmarks, indicating that the reasoning ability can be generalized to multimodal data.

Model Capabilities

Text generation
Image analysis
Multimodal reasoning

Use Cases

Multimodal tasks
Image description generation
Generate descriptive text based on the input image.
Multimodal question answering
Answer questions by combining image and text inputs.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase