# High-Precision Image Classification

Mambavision B 21K
Other
The first hybrid computer vision model combining the strengths of Mamba and Transformer, enhancing visual feature modeling efficiency through reconstructed Mamba formulas and introducing self-attention modules at the end of the Mamba architecture to improve long-range spatial dependency modeling.
Image Classification Transformers
M
nvidia
1,395
4
Smart Tv Hand Gestures Image Detection
Apache-2.0
A smart TV gesture recognition model based on the Vision Transformer architecture, capable of accurately classifying 9 common gestures.
Image Classification Transformers
S
dima806
65
1
Aimv2 3B Patch14 448
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, demonstrating excellent performance across multiple visual understanding benchmarks.
Image Classification
A
apple
161
12
Aimv2 1B Patch14 448
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving outstanding performance across multiple vision understanding benchmarks.
Image Classification
A
apple
71
0
Aimv2 Huge Patch14 448
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, demonstrating excellent performance across multiple benchmarks.
Image Classification
A
apple
1,672
3
Aimv2 Large Patch14 448
AIMv2 is a series of vision models based on multimodal autoregressive objective pretraining, excelling in multiple benchmarks
Image Classification
A
apple
2,210
5
Aimv2 3B Patch14 336
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving excellent performance across multiple multimodal understanding benchmarks.
Image Classification
A
apple
23
2
Aimv2 1B Patch14 336
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving outstanding performance in multiple multimodal understanding benchmarks.
Image Classification
A
apple
52
0
Aimv2 Huge Patch14 336
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving outstanding performance across multiple visual understanding benchmarks.
Image Classification
A
apple
188
0
Aimv2 Large Patch14 336
AIMv2 is a series of vision models based on multimodal autoregressive objective pretraining, excelling in various vision tasks.
Image Classification
A
apple
6,177
3
Aimv2 3B Patch14 224
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving outstanding performance in multiple benchmarks
Image Classification
A
apple
57
3
Aimv2 1B Patch14 224
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, excelling in various vision tasks.
Image Classification
A
apple
299
7
Aimv2 Large Patch14 224
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, excelling in various vision tasks.
Image Classification
A
apple
759
50
Maravillas
An image classification model generated based on HuggingPics, capable of recognizing world-famous landmarks
Image Classification Transformers
M
Chasty
17
0
Efficientnet B4
Apache-2.0
EfficientNet is a mobile-friendly pure convolutional model that uniformly scales depth, width, and resolution dimensions, trained on the ImageNet-1k dataset.
Image Classification Transformers
E
google
5,528
1
Autotrain Furryornot 2093267379
This is a binary classification model trained with AutoTrain, designed to determine whether objects in images are furry.
Image Classification Transformers
A
micole66
20
0
Pond Image Classification 9
This is an image classification model built on PyTorch and HuggingPics, specifically designed for pond scene classification.
Image Classification Transformers
P
SummerChiam
28
0
Carvit
CarViT is a car logo classifier based on the Vision Transformer architecture, capable of recognizing logos from 40 different car manufacturers.
Image Classification Transformers
C
abdusah
73
1
Architectural Styles
This model is designed to identify five prevalent architectural styles from the early to mid-20th century, trained on a database of over 700 images.
Image Classification Transformers
A
gatecitypreservation
38
5
Cricket Baseball Smrn
This is an image classification model based on the PyTorch framework, capable of accurately distinguishing between cricket and baseball images.
Image Classification Transformers
C
smaranjitghose
31
0
Test Model
Apache-2.0
ResNet50 v1.5 is an improved version of the original ResNet50 v1 model, achieving approximately 0.5% higher top1 accuracy by adjusting convolution strides.
Image Classification Transformers
T
mchochowski
18
0
Convnext Base 384 22k 1k
Apache-2.0
ConvNeXT is a pure convolutional model inspired by vision Transformer designs, pretrained on ImageNet-22k and fine-tuned on ImageNet-1k, outperforming Transformers.
Image Classification Transformers
C
facebook
797
3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase