vit-base-patch16-224-turkish-gpt2 Open-source Model - Free Generation of Turkish Image Descriptions

Vit Base Patch16 224 Turkish Gpt2

Developed by atasoglu

This is a vision encoder-decoder model based on ViT and Turkish GPT2 for generating Turkish image descriptions.

Downloads 20

Release Time : 4/6/2024

Model Overview

The model combines Google's ViT image encoder with a Turkish GPT2 text decoder, specifically fine-tuned for Turkish image captioning tasks.

Bilingual Model Architecture

Combines a Vision Transformer encoder with a Turkish GPT2 decoder

Turkish Language Support

Specifically optimized for Turkish image caption generation

End-to-End Image Captioning

Can directly generate coherent Turkish descriptions from images

Image Understanding

Turkish Text Generation

Image Caption Generation

Assistive Technology

Visual Assistance

Generating image descriptions for visually impaired individuals

Provides Turkish descriptions of image content

Content Creation

Social Media Content Generation

Automatically generates Turkish descriptions for uploaded images

Simplifies content creation workflow

Property	Details
Model Type	Vision encoder-decoder model
Training Data	atasoglu/flickr8k-turkish
Metrics	rouge
Library Name	transformers
Pipeline Tag	image-to-text
Tags	image-to-text, image-captioning
Base Model	google/vit-base-patch16-224, ytu-ce-cosmos/turkish-gpt2

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base