Model Selection

Attention Distillation

# Attention Distillation

X2I is a multimodal diffusion Transformer model capable of converting various input modalities (text, images, videos, audio, speech) into image outputs.

Text-to-Image Other

Deit Small Patch16 224

DeiT is a more efficiently trained Vision Transformer model, pre-trained and fine-tuned on the ImageNet-1k dataset at 224x224 resolution, suitable for image classification tasks.

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase