Open-source Veld - base model, a free image - text intelligent conversion application supporting Korean and English

Veld Base

Developed by KETI-AIR

Pre-trained visual encoder-text decoder model supporting Korean and English

Downloads 40

Release Time : 11/2/2022

Model Overview

VELD is a multilingual vision-language pre-trained model focused on image-to-text conversion tasks, supporting Korean and English processing.

Multilingual Support

Specialized support for Korean and English visual language processing

Pre-trained Model

Pre-trained on large-scale data, ready for downstream tasks

Vision-Language Understanding

Capable of understanding image content and generating relevant text descriptions

Image understanding

Multilingual text generation

Vision-language representation learning

Content Generation

Image Captioning

Automatically generate Korean or English descriptions for images

Multimodal Applications

Visual Question Answering

Answer questions based on image content

Property	Details
Language Support	English, Korean, Multilingual
License	Apache - 2.0
Tags	Vision, Language; Pretrained Model; Image - to - Text
EOS Token

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base