O

Otpensource Vision

Developed by hateslopacademy
A vision-language model trained based on Bllossom/llama-3.2-Korean-Bllossom-AICA-5B, supporting Korean and English, specializing in image-to-text and text classification tasks in the fashion domain.
Downloads 14
Release Time : 1/25/2025

Model Overview

otpensource-vision is a multimodal model combining vision and language capabilities, capable of analyzing fashion elements in images and generating structured textual descriptions, while also supporting pure text natural language processing tasks.

Model Features

Multilingual Visual Understanding
Supports Korean and English visual language processing, capable of extracting fashion-related information from images.
Fashion Domain Optimization
Trained with professional fashion datasets, excels in analyzing fashion elements such as clothing categories, colors, and seasons.
Structured Output
Capable of generating structured output in JSON format, facilitating system integration and further processing.
Business-Friendly License
Uses CC-BY-4.0 license, allowing commercial use.

Model Capabilities

Image-to-Text
Fashion Element Analysis
Multilingual Text Generation
Sentiment Analysis
Text Classification

Use Cases

E-Commerce
Product Auto-Tagging
Automatically analyzes product images and generates structured descriptions including categories, colors, and other information.
Can generate product information in JSON format.
Fashion Recommendation System
Recommends style-matching fashion items to users based on visual analysis.
Content Generation
Social Media Content Creation
Automatically generates descriptive text content based on fashion images.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase