Clothes Image Detection
Vision Transformer (ViT)-based clothing image classification model with approximately 78% accuracy
Downloads 412
Release Time : 2/8/2025
Model Overview
This model uses the ViT architecture to classify clothing images and can recognize 15 common clothing categories, including coats, dresses, jeans, etc.
Model Features
High-precision Classification
Achieves 78% accuracy in 15-category clothing classification tasks
ViT Architecture
Based on the Vision Transformer architecture, utilizing self-attention mechanisms for image processing
Multi-category Recognition
Can recognize 15 common clothing categories
Model Capabilities
Clothing Image Classification
Visual Feature Extraction
Multi-category Recognition
Use Cases
E-commerce
Product Auto-classification
Automatically classify clothing product images for e-commerce platforms
Approximately 78% accuracy
Fashion Analysis
Clothing Style Recognition
Identify clothing styles and categories in images
F1 score 0.78 (weighted average)
Featured Recommended AI Models
Š 2025AIbase