UI TARS 1.5 7B 4bit
U
UI TARS 1.5 7B 4bit
Developed by mlx-community
UI-TARS-1.5-7B-4bit is a multimodal model focused on image-text-to-text conversion tasks, supporting the English language.
Downloads 184
Release Time : 4/25/2025
Model Overview
This model is converted from ByteDance-Seed/UI-TARS-1.5-7B to the MLX format, primarily designed for interactive tasks between images and text.
Model Features
Multimodal Support
Capable of handling interactive tasks between images and text.
MLX Format
Converted to MLX format for easier execution in specific environments.
4-bit Quantization
The model is 4-bit quantized to reduce resource consumption.
Model Capabilities
Image-Text Generation
Multimodal Interaction
Use Cases
Image Captioning
Image Content Description
Generate detailed textual descriptions based on input images.
Multimodal Interaction
Image-based Q&A
Answer related questions based on image content.
Featured Recommended AI Models