Model Selection

High-Precision Image Classification

# High-Precision Image Classification

Mambavision B 21K

The first hybrid computer vision model combining the strengths of Mamba and Transformer, enhancing visual feature modeling efficiency through reconstructed Mamba formulas and introducing self-attention modules at the end of the Mamba architecture to improve long-range spatial dependency modeling.

Image Classification

Smart Tv Hand Gestures Image Detection

A smart TV gesture recognition model based on the Vision Transformer architecture, capable of accurately classifying 9 common gestures.

Image Classification

Aimv2 3B Patch14 448

AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, demonstrating excellent performance across multiple visual understanding benchmarks.

Image Classification

Aimv2 1B Patch14 448

AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving outstanding performance across multiple vision understanding benchmarks.

Image Classification

Aimv2 Huge Patch14 448

AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, demonstrating excellent performance across multiple benchmarks.

Image Classification

Aimv2 Large Patch14 448

AIMv2 is a series of vision models based on multimodal autoregressive objective pretraining, excelling in multiple benchmarks

Image Classification

Aimv2 3B Patch14 336

AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving excellent performance across multiple multimodal understanding benchmarks.

Image Classification

Aimv2 1B Patch14 336

AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving outstanding performance in multiple multimodal understanding benchmarks.

Image Classification

Aimv2 Huge Patch14 336

AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving outstanding performance across multiple visual understanding benchmarks.

Image Classification

Aimv2 Large Patch14 336

AIMv2 is a series of vision models based on multimodal autoregressive objective pretraining, excelling in various vision tasks.

Image Classification

Aimv2 3B Patch14 224

AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving outstanding performance in multiple benchmarks

Image Classification

Aimv2 1B Patch14 224

AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, excelling in various vision tasks.

Image Classification

Aimv2 Large Patch14 224

AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, excelling in various vision tasks.

Image Classification

An image classification model generated based on HuggingPics, capable of recognizing world-famous landmarks

Image Classification

Efficientnet B4

EfficientNet is a mobile-friendly pure convolutional model that uniformly scales depth, width, and resolution dimensions, trained on the ImageNet-1k dataset.

Image Classification

Autotrain Furryornot 2093267379

This is a binary classification model trained with AutoTrain, designed to determine whether objects in images are furry.

Image Classification

Pond Image Classification 9

This is an image classification model built on PyTorch and HuggingPics, specifically designed for pond scene classification.

Image Classification

CarViT is a car logo classifier based on the Vision Transformer architecture, capable of recognizing logos from 40 different car manufacturers.

Image Classification

Architectural Styles

This model is designed to identify five prevalent architectural styles from the early to mid-20th century, trained on a database of over 700 images.

Image Classification

gatecitypreservation

Cricket Baseball Smrn

This is an image classification model based on the PyTorch framework, capable of accurately distinguishing between cricket and baseball images.

Image Classification

ResNet50 v1.5 is an improved version of the original ResNet50 v1 model, achieving approximately 0.5% higher top1 accuracy by adjusting convolution strides.

Image Classification

Convnext Base 384 22k 1k

ConvNeXT is a pure convolutional model inspired by vision Transformer designs, pretrained on ImageNet-22k and fine-tuned on ImageNet-1k, outperforming Transformers.

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase