K

Kimi VL A3B Thinking 2506

Developed by moonshotai
Kimi-VL-A3B-Thinking-2506 is an upgraded version of Kimi-VL-A3B-Thinking, with significant improvements in multimodal reasoning, visual perception and understanding, video scene processing, etc. It supports higher-resolution images and can achieve more intelligent thinking while consuming fewer tokens.
Downloads 515
Release Time : 6/21/2025

Model Overview

This is a multimodal vision-language model that focuses on image-text to text tasks and has powerful visual understanding and reasoning capabilities.

Model Features

More intelligent thinking, less token consumption
Achieve better accuracy in multimodal reasoning benchmark tests, while the average required thinking length is reduced by 20%
Improved visual perception and understanding capabilities
Achieve the same or better capabilities in general visual perception and understanding, surpassing or matching the capabilities of non-thinking models
Video scene processing capabilities
Improvements in video reasoning and understanding benchmark tests, setting a new technical level for open-source models
High-resolution support
Supports a total of 3.2 million pixels per image, which is 4 times that of the previous version, bringing significant improvements in high-resolution perception and OS agent grounding benchmark tests

Model Capabilities

Multimodal reasoning
Visual perception
Image understanding
Video understanding
High-resolution image processing
Long text processing
Mathematical reasoning
Document processing

Use Cases

Visual question answering
Image content recognition
Identify objects or scenes in the image
Such as accurately identifying the breed of a cat
Video understanding
Video content analysis
Understand the scenes and actions in the video
Achieve an accuracy of 65.2 in the VideoMMMU benchmark test
Mathematical reasoning
Visual mathematical problem solving
Solve mathematical problems containing visual elements
Achieve an accuracy of 80.1 in the MathVista_MINI benchmark test
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase