Erax VL 2B V1.5 GGUF
EraX-VL-2B-V1.5 is a multimodal model supporting Vietnamese, English, and Chinese, with capabilities for image-to-text and image-text-to-text conversion.
Downloads 95
Release Time : 12/29/2024
Model Overview
EraX-VL-2B-V1.5 is a multimodal model based on the transformers library, primarily used for image-to-text and image-text-to-text tasks, supporting Vietnamese, English, and Chinese.
Model Features
Multimodal Support
Supports joint processing of images and text, capable of converting image content into textual descriptions.
Multilingual Support
Supports processing in three languages: Vietnamese, English, and Chinese.
Diverse Quantization Versions
Offers multiple quantized versions suitable for different hardware and performance needs.
Model Capabilities
Image-to-text
Image-text-to-text
Multilingual Processing
Optical Character Recognition
Use Cases
Insurance
Insurance Document Processing
Automatically identifies and converts image and text content in insurance documents.
Optical Character Recognition
Document OCR
Converts images in scanned documents into editable text.
Featured Recommended AI Models
Š 2025AIbase