V

Veld Base

Developed by KETI-AIR
Pre-trained visual encoder-text decoder model supporting Korean and English
Downloads 40
Release Time : 11/2/2022

Model Overview

VELD is a multilingual vision-language pre-trained model focused on image-to-text conversion tasks, supporting Korean and English processing.

Model Features

Multilingual Support
Specialized support for Korean and English visual language processing
Pre-trained Model
Pre-trained on large-scale data, ready for downstream tasks
Vision-Language Understanding
Capable of understanding image content and generating relevant text descriptions

Model Capabilities

Image understanding
Multilingual text generation
Vision-language representation learning

Use Cases

Content Generation
Image Captioning
Automatically generate Korean or English descriptions for images
Multimodal Applications
Visual Question Answering
Answer questions based on image content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase