Model Selection

Low video memory optimization

# Low video memory optimization

Smolvlm Instruct GGUF

SmolVLM is a compact open-source multimodal model that can accept image and text inputs and generate text outputs. It is designed for high efficiency and is suitable for device-side applications.

Transformers English

Llama Joycaption Beta One Hf Llava GGUF

An image captioning vision-language model (VLM) freely open to the community, which can be used to train diffusion models and supports diverse image styles and content.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase