Erax VL 7B V2.0 Preview I1 GGUF
This is the result of applying weight/importance matrix quantization to the EraX-VL-7B-V2.0-Preview model, offering multiple quantization versions to suit different needs
Image-to-Text Supports Multiple LanguagesOpen Source License:Apache-2.0#Multimodal Q&A#Vietnamese OCR#Medical Document Processing
Downloads 246
Release Time : 1/12/2025
Model Overview
A multimodal vision-language model supporting Vietnamese, English, and Chinese, suitable for various document and image understanding tasks
Model Features
Multilingual Support
Native processing capabilities for Vietnamese, English, and Chinese
Multimodal Capabilities
Ability to process both text and visual information for image-text understanding
Multiple Quantization Versions
Offers quantization options ranging from 1-bit to 6-bit to meet different hardware requirements
Document Understanding
Specially optimized for OCR, handwriting recognition, and ancient text processing
Model Capabilities
Optical Character Recognition (OCR)
Handwriting Recognition
Ancient Text Processing
Visual Question Answering
Document Q&A
Multilingual Text Generation
Image Content Understanding
Use Cases
Insurance Industry
Traffic Accident Report Processing
Automatically analyze accident scene images and report texts
Improves claims processing efficiency
Medical Field
Radiology Report Generation
Generate diagnostic reports from medical images
Assists doctors in diagnosis
Ancient Text Digitization
Ancient Text Recognition
Recognize and transcribe characters in ancient texts
Promotes cultural heritage preservation
Featured Recommended AI Models
Š 2025AIbase