Model Selection

Multimodal image-text processing

# Multimodal image-text processing

A vision-language model fine-tuned on a mathematical multimodal dataset, with text generation and scoring functions, designed specifically for mathematical problem analysis and feedback.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase