R

RADIO

Developed by nvidia
A visual feature extraction model developed by NVIDIA that converts images into embedding vectors for downstream tasks
Downloads 5,166
Release Time : 12/11/2023

Model Overview

An image feature extraction model based on Vision Transformer architecture, supporting flexible input resolutions, with generated embeddings suitable for computer vision tasks such as image classification and semantic segmentation

Model Features

Flexible input resolution
Supports input resolutions up to 2048x2048 (in 16-pixel increments), adapting to various application scenarios
Dual output features
Simultaneously outputs global features (summary) and local spatial features (spatial_features) to meet different task requirements
Large-scale pre-training
Pre-trained on the DataComp dataset with 128 billion internet images, possessing powerful feature extraction capabilities

Model Capabilities

Image feature extraction
Image classification
Semantic segmentation
Visual embedding generation

Use Cases

Computer Vision
Image classification
Using RADIO-extracted image embeddings as input for downstream classifiers
Semantic segmentation
Utilizing RADIO's spatial features for dense prediction tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase