E

Erax VL 2B V1.5 GGUF

Developed by mradermacher
EraX-VL-2B-V1.5 is a multimodal model supporting Vietnamese, English, and Chinese, with capabilities for image-to-text and image-text-to-text conversion.
Downloads 95
Release Time : 12/29/2024

Model Overview

EraX-VL-2B-V1.5 is a multimodal model based on the transformers library, primarily used for image-to-text and image-text-to-text tasks, supporting Vietnamese, English, and Chinese.

Model Features

Multimodal Support
Supports joint processing of images and text, capable of converting image content into textual descriptions.
Multilingual Support
Supports processing in three languages: Vietnamese, English, and Chinese.
Diverse Quantization Versions
Offers multiple quantized versions suitable for different hardware and performance needs.

Model Capabilities

Image-to-text
Image-text-to-text
Multilingual Processing
Optical Character Recognition

Use Cases

Insurance
Insurance Document Processing
Automatically identifies and converts image and text content in insurance documents.
Optical Character Recognition
Document OCR
Converts images in scanned documents into editable text.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase