EraX-VL-7B-V1.5-GGUF Open-Source Multimodal Model - Supports Multiple Languages to Solve Insurance and OCR Tasks

Erax VL 7B V1.5 GGUF

Developed by mradermacher

Quantized version of EraX-VL-7B-V1.5, supporting Vietnamese, English, and Chinese, suitable for tasks like insurance and OCR.

Image-to-Text Supports Multiple LanguagesOpen Source License:Apache-2.0 #Multimodal OCR #Vietnamese Insurance Analysis #Visual Text Generation

Downloads 190

Release Time : 1/1/2025

Model Overview

This is a multimodal model supporting tasks such as image-to-text conversion and optical character recognition (OCR), particularly suited for the insurance industry.

Model Features

Multilingual Support

Supports three languages: Vietnamese, English, and Chinese.

Multimodal Capabilities

Combines visual and language processing abilities, suitable for tasks like image-to-text conversion.

Multiple Quantization Versions

Offers various quantization versions from Q2_K to f16 to meet different needs.

Insurance Industry Optimization

Specially optimized for applications in the insurance industry.

Model Capabilities

Image-to-text

Optical Character Recognition

Multilingual Text Generation

Visual Question Answering

Use Cases

Insurance

Insurance Document Processing

Automatically identifies and processes text information in insurance documents.

Document Processing

Multilingual OCR

Recognizes document content in Vietnamese, English, and Chinese.

🚀 EraX-VL-7B-V1.5 Static Quants

This project provides static quants of the EraX-VL-7B-V1.5 model, supporting multiple languages and offering various quantized versions for different usage scenarios.

🚀 Quick Start

The project offers static quantizations of the model from https://huggingface.co/erax-ai/EraX-VL-7B-V1.5. If you're new to using GGUF files, refer to TheBloke's READMEs for detailed guidance, including how to concatenate multi - part files.

✨ Features

Multilingual Support: Supports languages such as Vietnamese (vi), English (en), and Chinese (zh).
Multiple Quantized Versions: Provides a variety of quantized versions with different sizes and qualities.
Multimodal Capabilities: Tags include multimodal, suitable for tasks like image - to - text.

📦 Installation

No specific installation steps are provided in the original document.

📚 Documentation

About

Static quants of https://huggingface.co/erax-ai/EraX-VL-7B-V1.5. Weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion.

Usage

If you are unsure how to use GGUF files, refer to one of TheBloke's READMEs for more details, including on how to concatenate multi - part files.

Provided Quants

(sorted by size, not necessarily quality. IQ - quants are often preferable over similar sized non - IQ quants)

Link	Type	Size/GB	Notes
GGUF	mmproj - fp16	1.5	vision supplement
GGUF	Q2_K	3.1
GGUF	Q3_K_S	3.6
GGUF	Q3_K_M	3.9	lower quality
GGUF	Q3_K_L	4.2
GGUF	IQ4_XS	4.4
GGUF	Q4_K_S	4.6	fast, recommended
GGUF	Q4_K_M	4.8	fast, recommended
GGUF	Q5_K_S	5.4
GGUF	Q5_K_M	5.5
GGUF	Q6_K	6.4	very good quality
GGUF	Q8_0	8.2	fast, best quality
GGUF	f16	15.3	16 bpw, overkill

Here is a handy graph by ikawrakow comparing some lower - quality quant types (lower is better):

And here are Artefact2's thoughts on the matter: https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9

FAQ / Model Request

See https://huggingface.co/mradermacher/model_requests for some answers to questions you might have and/or if you want some other model quantized.

Thanks

I thank my company, nethype GmbH, for letting me use its servers and providing upgrades to my workstation to enable this work in my free time. Additional thanks to @nicoboss for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.

📄 License

This project is licensed under the apache - 2.0 license.

Property	Details
Base Model	erax-ai/EraX-VL-7B-V1.5
Languages	Vietnamese (`vi`), English (`en`), Chinese (`zh`)
Library Name	transformers
License	apache - 2.0
Quantized By	mradermacher
Tags	erax, multimodal, erax-vl-7B, insurance, ocr, vietnamese, bcg, image - to - text

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご