Vit Base Patch16 224 Turkish Gpt2 Medium
This is a vision encoder-decoder model based on ViT and Turkish GPT-2 for generating Turkish image captions.
Downloads 14
Release Time : 4/6/2024
Model Overview
This model combines a vision encoder (ViT) and a text decoder (Turkish GPT-2), specifically designed to generate Turkish descriptions for images.
Model Features
Turkish Image Captioning
Image captioning capability specifically optimized for Turkish.
Vision-Language Model
Multimodal architecture combining vision encoder and language decoder.
Fine-tuned Pre-trained Models
Fine-tuned based on pre-trained ViT and Turkish GPT-2 models.
Model Capabilities
Image caption generation
Turkish text generation
Visual content understanding
Use Cases
Assistive Technology
Visual Assistance
Providing Turkish descriptions of image content for visually impaired individuals
Content Creation
Social Media Content Generation
Automatically generating Turkish descriptions for uploaded images
Featured Recommended AI Models