Model Selection

Lightweight vision model

# Lightweight vision model

Vintern 3B Beta GGUF

Vintern-3B-beta is a multilingual foundation model that supports English, Vietnamese, and Chinese, and is mainly used for image-text to text conversion tasks.

Transformers Supports Multiple Languages

Aimv2 Large Patch14 336.apple Pt Dist

AIM-v2 is an efficient image encoder implemented based on the timm library, suitable for various computer vision tasks.

Image Classification

YOLOS is an object detection model based on Vision Transformer (ViT), trained with DETR loss, and performs excellently on the COCO dataset.

Object Detection

Swin Tiny Patch4 Window7 224 Cifar10

A tiny model based on Swin Transformer architecture, specifically fine-tuned for CIFAR-10 image classification tasks

Image Classification

Deit Tiny Patch16 224 Finetuned Main Gpu 20e Final

Lightweight image classification model based on DeiT-tiny architecture, achieving 98.56% validation accuracy after fine-tuning on a custom image dataset

Image Classification

Autotrain Pick A Card 3726099222

This is a multi-category image classification model trained via AutoTrain, demonstrating outstanding performance on the validation set with an accuracy of 90.9%.

Image Classification

Autotrain Weather Classification 3723199089

This is a multi-class image classification model trained via AutoTrain, specifically designed for weather classification tasks.

Image Classification

Swin Tiny Patch4 Window7 224 Finetuned Ai Not

Fine-tuned model based on Swin Transformer architecture for AI-generated content detection tasks

Image Classification

A three-class image classification model trained with AutoTrain, achieving 95% accuracy on the validation set

Image Classification

Swin Tiny Patch4 Window7 224 Finetuned Aiornot Baseline

A vision model based on the Swin Transformer Tiny architecture, fine-tuned on an unknown dataset for image classification tasks

Image Classification

Swin Tiny Patch4 Window7 224 Finetuned Fluro Cls

Fine-tuned model based on Swin Transformer Tiny architecture for image classification tasks

Image Classification

Swin Tiny Patch4 Window7 224 Finetuned Woody LeftGR Clean 130epochs

An image classification model based on the Swin Transformer Tiny architecture, fine-tuned on a custom image dataset for 130 epochs, with an accuracy of 90.23%.

Image Classification

Autotrain Cat Vs Dogs 1858163503

This is a binary classification model trained using AutoTrain, specifically designed to distinguish between images of cats and dogs.

Image Classification

Vit Small Patch16 224

ViT-tiny model converted from timm codebase, suitable for image classification tasks

Image Classification

Vit Tiny Patch16 224

ViT-Tiny model converted from the timm repository, suitable for image classification tasks, with usage consistent with the ViT-base model

Image Classification

Visual Transformer Chihuahua Cookies

An image classification model based on the Vision Transformer architecture, specifically designed to distinguish between images of Chihuahuas and cookies

Image Classification

peterbonnesoeur

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase