V

Vit Base Patch16 224 Turkish Gpt2 Medium

Developed by atasoglu
This is a vision encoder-decoder model based on ViT and Turkish GPT-2 for generating Turkish image captions.
Downloads 14
Release Time : 4/6/2024

Model Overview

This model combines a vision encoder (ViT) and a text decoder (Turkish GPT-2), specifically designed to generate Turkish descriptions for images.

Model Features

Turkish Image Captioning
Image captioning capability specifically optimized for Turkish.
Vision-Language Model
Multimodal architecture combining vision encoder and language decoder.
Fine-tuned Pre-trained Models
Fine-tuned based on pre-trained ViT and Turkish GPT-2 models.

Model Capabilities

Image caption generation
Turkish text generation
Visual content understanding

Use Cases

Assistive Technology
Visual Assistance
Providing Turkish descriptions of image content for visually impaired individuals
Content Creation
Social Media Content Generation
Automatically generating Turkish descriptions for uploaded images
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase