EraX-VL-2B-V1.5-Q4_K_M-GGUF Open-source Multimodal Model - Supports Trilingual Visual Question Answering

Erax VL 2B V1.5 Q4 K M GGUF

Developed by Ngoac

This is a multimodal visual question answering model supporting Vietnamese, English, and Chinese, converted to GGUF format based on erax-ai/EraX-VL-2B-V1.5.

Text-to-Image Supports Multiple LanguagesOpen Source License:Apache-2.0 #Multimodal Visual Question Answering #Vietnamese OCR Support #Insurance Document Parsing

Downloads 55

Release Time : 1/2/2025

Model Overview

This model is a visual question answering (VQA) model capable of processing image and text inputs to generate relevant answers. It is particularly suitable for scenarios such as insurance and optical character recognition (OCR).

Model Features

Multilingual Support

Supports visual question answering tasks in three languages: Vietnamese, English, and Chinese.

GGUF Format Optimization

Converted to GGUF format for efficient operation on tools like llama.cpp.

Multimodal Capability

Capable of processing both image and text inputs for cross-modal understanding.

Industry Application Optimization

Specifically optimized for applications such as insurance and OCR.

Model Capabilities

Visual Question Answering

Image Understanding

Multilingual Processing

Text Generation

Use Cases

Insurance

Insurance Document Processing

Automatically identify and analyze information in insurance documents.

Healthcare

Prescription Recognition

Recognize text and content in medical prescriptions.

Property	Details
Model Type	Converted to GGUF format from `erax-ai/EraX-VL-2B-V1.5`
Training Data	Not specified in the provided document
Supported Languages	Vietnamese, English, Chinese
Pipeline Tag	Visual - Question - Answering

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Erax VL 2B V1.5 Q4 K M GGUF

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Ngoac/EraX-VL-2B-V1.5-Q4_K_M-GGUF

🚀 Quick Start

✨ Features

📦 Installation

💻 Usage Examples

Basic Usage

CLI:

Server:

Advanced Usage

Step 1: Clone llama.cpp from GitHub.

Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL = 1` flag along with other hardware - specific flags (for ex: `LLAMA_CUDA = 1` for Nvidia GPUs on Linux).

Step 3: Run inference through the main binary.

📄 License

Erax VL 2B V1.5 Q4 K M GGUF

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Ngoac/EraX-VL-2B-V1.5-Q4_K_M-GGUF

🚀 Quick Start

✨ Features

📦 Installation

💻 Usage Examples

Basic Usage

CLI:

Server:

Advanced Usage

Step 1: Clone llama.cpp from GitHub.

Step 2: Move into the llama.cpp folder and build it with LLAMA_CURL = 1 flag along with other hardware - specific flags (for ex: LLAMA_CUDA = 1 for Nvidia GPUs on Linux).

Step 3: Run inference through the main binary.

📄 License

Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL = 1` flag along with other hardware - specific flags (for ex: `LLAMA_CUDA = 1` for Nvidia GPUs on Linux).