V

Vit SO400M 16 SigLIP I18n 256

Developed by timm
A SigLIP (Sigmoid Loss for Language-Image Pre-training) model trained on the multilingual WebLI dataset, supporting multilingual image classification tasks.
Downloads 82
Release Time : 10/9/2024

Model Overview

This model is a contrastive image-text model based on the SigLIP architecture, specifically designed for zero-shot image classification tasks with multilingual support.

Model Features

Multilingual Support
Trained with a multilingual tokenizer, supporting image classification tasks in multiple languages.
Sigmoid Loss Function
Utilizes Sigmoid loss for language-image pre-training, enhancing the model's classification performance.
Zero-shot Classification Capability
Capable of classifying images into new categories without specific training.

Model Capabilities

Zero-shot Image Classification
Multilingual Text Understanding
Image Feature Extraction

Use Cases

Image Classification
Multilingual Image Labeling
Classify images using multilingual text labels
Accurately identifies image content and matches multilingual labels
Cross-language Image Search
Search for related images using queries in different languages
Enables cross-language image retrieval capabilities
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase