C

CLIP ViT L 14 Spectrum Icons 20k

Developed by JianLiao
A vision-language model fine-tuned based on CLIP ViT-L/14, optimized for abstract image-text retrieval tasks
Downloads 1,576
Release Time : 1/5/2025

Model Overview

This model is fine-tuned on 23,000 abstract image-text pairs, enhancing text-to-image and image-to-text retrieval performance, particularly suitable for handling abstract visual features

Model Features

Abstract Visual Feature Understanding
Enhanced understanding of abstract icons and symbols through fine-tuning on a dedicated dataset
Efficient Retrieval Capability
Achieves R@1 of 70% and R@5 over 96% in bidirectional image-text retrieval tasks
Domain Adaptability
Optimized performance in specific domains while maintaining the generalization capability of the base model

Model Capabilities

Zero-shot image classification
Text-to-image retrieval
Image-to-text retrieval
Abstract visual feature matching

Use Cases

Information Retrieval
Icon Library Search
Retrieve matching icon images through natural language descriptions
R@1 accuracy approximately 70%
Content Management
Automatic Image Tagging
Generate descriptive text labels for abstract icons
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase