H

Hiera Huge 224 Hf

Developed by facebook
Hiera is an efficient hierarchical vision Transformer model that excels in image and video tasks with fast runtime
Downloads 41
Release Time : 5/12/2024

Model Overview

Hiera is a hierarchical vision Transformer model designed for simplicity and efficiency. It simplifies redundant modules in traditional vision Transformers through MAE training and surpasses existing technologies in multiple image and video recognition tasks

Model Features

Hierarchical Design
Adopts a hierarchical architecture, reducing feature quantity in early layers and spatial resolution in deep layers to improve efficiency
Simplified Architecture
Simplifies or removes redundant modules in traditional Transformers through MAE training, maintaining high efficiency
High Performance
Outperforms existing technologies in multiple image and video recognition tasks while significantly improving runtime speed

Model Capabilities

Image classification
Feature extraction
Masked image modeling

Use Cases

Computer Vision
Image Classification
Classifies and identifies image content
Performs excellently on benchmarks like ImageNet-1K
Feature Extraction
Extracts multi-level feature representations from images
Can be used for transfer learning in downstream vision tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase