AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Visual-text fusion

# Visual-text fusion

Mixtral AI Vision 128k 7b
MIT
A multimodal model that combines visual and language abilities, achieving image-text interaction through a merging method
Image-to-Text Transformers English
M
LeroyDyer
384
4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase