Veld Base
V
Veld Base
Developed by KETI-AIR
Pre-trained visual encoder-text decoder model supporting Korean and English
Downloads 40
Release Time : 11/2/2022
Model Overview
VELD is a multilingual vision-language pre-trained model focused on image-to-text conversion tasks, supporting Korean and English processing.
Model Features
Multilingual Support
Specialized support for Korean and English visual language processing
Pre-trained Model
Pre-trained on large-scale data, ready for downstream tasks
Vision-Language Understanding
Capable of understanding image content and generating relevant text descriptions
Model Capabilities
Image understanding
Multilingual text generation
Vision-language representation learning
Use Cases
Content Generation
Image Captioning
Automatically generate Korean or English descriptions for images
Multimodal Applications
Visual Question Answering
Answer questions based on image content
Featured Recommended AI Models
Š 2025AIbase