T

TBAC VLR1 3B Preview

Developed by TencentBAC
A multimodal language model fine-tuned by Tencent PCG Basic Algorithm Center, optimized based on Qwen2.5-VL-3B-Instruct, achieving state-of-the-art performance in multiple multimodal reasoning benchmarks among models of the same scale
Downloads 328
Release Time : 4/16/2025

Model Overview

A vision-language model enhanced with Group Relative Policy Optimization (GRPO) technology to improve multimodal reasoning capabilities

Model Features

GRPO Optimization Technology
Utilizes Group Relative Policy Optimization technology to enhance multimodal reasoning capabilities
Leading Performance
Achieves state-of-the-art performance in multiple multimodal reasoning benchmarks among models of the same scale
Mathematical Reasoning Capability
Excels in mathematical reasoning benchmarks such as MathVista

Model Capabilities

Multimodal Understanding
Vision-Language Reasoning
Mathematical Problem Solving
Logical Reasoning
Image-Text Generation

Use Cases

Education
Math Problem Solving
Analyzes questions containing mathematical formulas and diagrams
Achieves a score of 64.8 on the MathVista benchmark
Research
Multimodal Reasoning Research
Used for research on vision-language reasoning tasks
Achieves an average score of 35.7 in comprehensive evaluations
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase