H

Hiera Base 224 In1k Hf

Developed by facebook
Hiera is a hierarchical vision Transformer model that is fast, powerful, and concise. It surpasses state-of-the-art performance in a wide range of image and video tasks while significantly improving runtime speed.
Downloads 188
Release Time : 5/12/2024

Model Overview

Hiera is a streamlined hierarchical vision Transformer optimized for image classification tasks, achieving high efficiency through simplified architecture and MAE training methods.

Model Features

Efficient Hierarchical Design
Adopts a hierarchical structure that reduces feature count in early layers and spatial resolution in later layers, significantly improving runtime efficiency.
Simplified Architecture
Removes redundant modules found in traditional vision Transformers and teaches the model spatial biases through MAE training, maintaining a clean architecture.
High Performance
Achieves breakthroughs in multiple image and video recognition tasks, surpassing state-of-the-art accuracy.

Model Capabilities

Image Classification
Feature Extraction
Masked Image Modeling

Use Cases

Computer Vision
Image Classification
Classifies and identifies input images
Example output: 'Tabby cat'
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase