I

Internvl3 38B

Developed by FriendliAI
InternVL3-38B is an advanced multimodal large language model that excels in multimodal perception, reasoning, and other capabilities. It shows significant improvements compared to previous models and also expands multimodal capabilities such as tool use and GUI agents.
Downloads 166
Release Time : 4/12/2025

Model Overview

InternVL3-38B is a multimodal large language model with powerful multimodal perception and reasoning capabilities, supporting various application scenarios such as tool use and GUI agents.

Model Features

Advanced multimodal capabilities
Compared to InternVL 2.5, InternVL3 demonstrates more excellent multimodal perception and reasoning capabilities, and also extends multimodal capabilities to areas such as tool use, GUI agents, industrial image analysis, and 3D visual perception.
Excellent language performance
Compared with the Qwen2.5 Chat model, thanks to native multimodal pre - training, the InternVL3 series performs better in overall text performance.
Flexible model architecture
Adopts the 'ViT - MLP - LLM' paradigm, integrating the new incremental pre - trained InternViT and various pre - trained large language models, such as InternLM 3 and Qwen 2.5.
Efficient training strategy
Proposes a native multimodal pre - training method that integrates language and visual learning into one pre - training stage; uses high - quality and diverse training data in the supervised fine - tuning stage; adopts the Mixed Preference Optimization (MPO) method to improve reasoning performance.

Model Capabilities

Multimodal perception
Multimodal reasoning
Tool use
GUI agent
Industrial image analysis
3D visual perception
Text generation
Image analysis

Use Cases

Multimodal reasoning
Multimodal reasoning task
Performs well in multiple multimodal reasoning benchmark tests.
InternVL3-38B scores 4.5 points higher than its corresponding model.
GUI operation
GUI agent
Supports GUI operation tasks.
Industrial image analysis
Industrial image analysis
Supports industrial image analysis tasks.
Featured Recommended AI Models
ยฉ 2025AIbase