Model Selection

Image Feature Extraction

# Image Feature Extraction

Openvision Vit Base Patch8 160

OpenVision-ViT-Tiny is a fully open, cost-effective advanced visual encoder, part of the OpenVision family, focusing on multimodal learning.

Image Classification

Openvision Vit Small Patch8 384

OpenVision is a fully open, cost-effective family of advanced vision encoders focused on multimodal learning.

Multimodal Fusion

Openvision Vit Small Patch16 224

OpenVision is a fully open, cost-effective family of advanced vision encoders focused on multimodal learning.

Image Enhancement

Openvision Vit Tiny Patch16 160

OpenVision is a fully open, cost-effective advanced visual encoder family focused on multimodal learning.

Multimodal Fusion

Sam2 Hiera Tiny.fb R896 2pt1

SAM2 model based on the HieraDet image encoder, focusing on image feature extraction tasks.

Object Detection

Sam2 Hiera Small.fb R896

SAM2 model based on the HieraDet image encoder, focused on image feature extraction tasks.

Image Segmentation

Sam2 Hiera Base Plus.fb R896 2pt1

SAM2 model weights based on HieraDet image encoder, focused on image feature extraction tasks

Image Segmentation

Sam2 Hiera Base Plus.fb R896

SAM2 model based on the HieraDet image encoder, focused on image feature extraction tasks.

Image Segmentation

Mambavision T2 1K

The first hybrid computer vision model combining the strengths of Mamba and Transformer, enhancing visual feature modeling through redesigned Mamba formulations and incorporating self-attention modules in the Mamba architecture to improve long-range spatial dependency modeling.

Image Classification

Vit Base Patch16 224.orig In21k

An image classification model based on Vision Transformer, pretrained on ImageNet-21k, suitable for feature extraction and fine-tuning

Image Classification

Eva02 Small Patch14 224.mim In22k

EVA02 feature/representation model, pretrained on ImageNet-22k via masked image modeling, suitable for image classification and feature extraction tasks.

Image Classification

Eva02 Base Patch14 224.mim In22k

EVA02 base version visual representation model, pre-trained on ImageNet-22k through masked image modeling, suitable for image classification and feature extraction tasks.

Image Classification

Face Discriminator 2

A face classification model fine-tuned based on ResNet-50, achieving an accuracy of 94.16% on the evaluation set

Image Classification

Google Vit Base Patch16 224 Cartoon Face Recognition

A cartoon face recognition model fine-tuned based on the Google Vision Transformer (ViT) architecture, excelling in image classification tasks

Vit Small Patch8 224.dino

Self-supervised image feature extraction model based on Vision Transformer (ViT), trained using the DINO method

Image Classification

Vit Large Patch32 224.orig In21k

An image classification model based on Vision Transformer (ViT) architecture, pretrained on the ImageNet-21k dataset, suitable for feature extraction and fine-tuning scenarios.

Image Classification

Vit Base Patch16 224.dino

A Vision Transformer (ViT) image feature model trained with self-supervised DINO method, suitable for image classification and feature extraction tasks.

Image Classification

Vit Base Patch8 224.dino

A vision Transformer (ViT) image feature model trained with the self-supervised DINO method, suitable for image classification and feature extraction tasks.

Image Classification

A ResNet-50 model pre-trained using the DINO self-supervised learning method, suitable for visual feature extraction tasks

Image Classification

RegNet is an image classification model designed through neural architecture search, trained on the ImageNet-1k dataset.

Image Classification

RegNet model trained on imagenet-1k, an efficient vision model designed via neural architecture search

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase