V

Vit Base 224 In21k Ft Cifar10

Developed by edumunozsala
A Spanish-language image classification model based on the Vision Transformer architecture, fine-tuned on the CIFAR-10 dataset with an accuracy of 97%.
Downloads 16
Release Time : 6/11/2022

Model Overview

This model was trained using Amazon SageMaker and Hugging Face deep learning containers. The base model is Vision Transformer (base size), pre-trained on the ImageNet-21k dataset and fine-tuned on the CIFAR-10 dataset for image classification tasks.

Model Features

High Accuracy
Achieves 97% accuracy on the CIFAR-10 dataset.
Based on Vision Transformer
Uses Transformer encoder architecture, suitable for image classification tasks.
Fine-tuned Model
Pre-trained on ImageNet-21k and fine-tuned on the CIFAR-10 dataset.

Model Capabilities

Image classification
High-precision recognition

Use Cases

Image recognition
CIFAR-10 Image Classification
Used for classifying images in the CIFAR-10 dataset.
97% accuracy
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase