# High-precision Visual Features
Nomic Embed Vision V1.5
Apache-2.0
High-performance visual embedding model, sharing the same embedding space with nomic-embed-text-v1.5, supporting multimodal applications
Text-to-Image
Transformers English

N
nomic-ai
27.85k
161
Vinvl Base Image Captioning
Apache-2.0
Microsoft's VinVL foundational pre-trained model, specifically designed for image captioning tasks, with strong visual-language understanding capabilities.
Image-to-Text
V
michelecafagna26
45
1
Featured Recommended AI Models