AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Joint training of vision and language

# Joint training of vision and language

Math LLaVA
Math-LLaVA-13B is an open-source multimodal large language model fine-tuned on the MathV360K dataset based on LLaVA-1.5-13B, suitable for scenarios such as multimodal reasoning and Q&A.
Text-to-Image Transformers
M
Zhiqiang007
106
5
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase