ShareCaptioner Open-source Image Captioning Generation Model - Free Deployment for High-quality Image Captions

Sharecaptioner

Developed by Lin-Chen

ShareCaptioner is an open-source image description generation model. It is based on the improved InternLM-Xcomposer-7B base model and fine-tuned on the ShareGPT4V dataset assisted by GPT4-Vision. It can generate high-quality image descriptions.

Image-to-Text

Transformers

#GPT4-Vision Fine-tuning #High-precision Image Description #448x448 Resolution

Downloads 401

Release Time : 12/13/2023

Model Overview

ShareCaptioner is an open-source model focused on generating high-quality image descriptions, providing support for the fields of computer vision and natural language processing.

Model Features

High-quality Image Description

Can generate detailed and accurate image descriptions, with quality approaching that of GPT4-Vision

448x448 High-resolution Support

Supports processing image inputs with a resolution of 448x448

Open-source and Fine-tunable

The model is completely open-source and supports users to further fine-tune it to meet specific needs

Model Capabilities

Image Understanding

Natural Language Generation

Multimodal Processing

Use Cases

Computer Vision

Automatic Image Annotation

Generate detailed descriptive labels for image datasets

Improve the efficiency and quality of dataset annotation

Assistive Technology

Visual Impairment Assistance

Provide image content descriptions for visually impaired users

Enhance the barrier-free access experience

Property	Details
Model Type	ShareCaptioner is an open - source captioner fine - tuned on GPT4 - Vision - assisted [ShareGPT4V](https://huggingface.co/datasets/Lin - Chen/ShareGPT4V) detailed caption data with a resolution of 448x448. It is based on the improved [InternLM - Xcomposer - 7B](https://github.com/InternLM/InternLM - XComposer) base model.
Model Date	ShareCaptioner was trained in Nov 2023.
Paper or Resources for More Information	[Project] [Paper] [Code]

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Sharecaptioner

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 ShareCaptioner Model Card

🚀 Quick Start

✨ Features

📚 Documentation

Model Details

Intended Use

Finetuning Dataset

📄 License