Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Vision-Language Bidirectional Understanding
# Vision-Language Bidirectional Understanding
Blip Image Captioning Base
Bsd-3-clause
BLIP is an advanced vision-language pretrained model, excelling in image captioning tasks and supporting both conditional and unconditional text generation.
Image-to-Text
Transformers
B
Salesforce
2.8M
688
Featured Recommended AI Models
Empowering the Future, Your AI Solution Knowledge Base
English
简体中文
繁體中文
にほんご
© 2025
AIbase