Model Selection

Multimodal Adaptation

# Multimodal Adaptation

Sam2 Hiera Small.fb R896

SAM2 model based on the HieraDet image encoder, focused on image feature extraction tasks.

Image Segmentation

Resnet101 Clip.yfcc15m

CLIP-style dual-modal model trained on YFCC-15M dataset, compatible with both open_clip and timm frameworks

Image Classification

Mambavision B 1K

PAVE is a model focused on repairing and adapting video large language models, aiming to enhance the conversion capability between video and text.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase