U

UI TARS 1.5 7B 4bit

Developed by mlx-community
UI-TARS-1.5-7B-4bit is a multimodal model focused on image-text-to-text conversion tasks, supporting the English language.
Downloads 184
Release Time : 4/25/2025

Model Overview

This model is converted from ByteDance-Seed/UI-TARS-1.5-7B to the MLX format, primarily designed for interactive tasks between images and text.

Model Features

Multimodal Support
Capable of handling interactive tasks between images and text.
MLX Format
Converted to MLX format for easier execution in specific environments.
4-bit Quantization
The model is 4-bit quantized to reduce resource consumption.

Model Capabilities

Image-Text Generation
Multimodal Interaction

Use Cases

Image Captioning
Image Content Description
Generate detailed textual descriptions based on input images.
Multimodal Interaction
Image-based Q&A
Answer related questions based on image content.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase