Erax VL 7B V2.0 Preview GGUF
EraX-VL-7B-V2.0-Preview is a multimodal foundation model supporting Vietnamese, English, and Chinese, suitable for various vision-language tasks.
Downloads 162
Release Time : 1/11/2025
Model Overview
This is a 7B-parameter-scale multimodal model focused on vision-language tasks, supporting multiple languages and application scenarios such as insurance, optical character recognition, radiology, etc.
Model Features
Multilingual Support
Supports processing in three languages: Vietnamese, English, and Chinese.
Multimodal Capabilities
Combines vision and language processing abilities, suitable for joint tasks involving images and text.
Multiple Quantized Versions
Offers various quantized versions to accommodate different hardware and performance needs.
Model Capabilities
Image-to-text
Visual Question Answering
Document Question Answering
Handwriting Recognition
Ancient Text Processing
Use Cases
Insurance
Traffic Accident Processing
Used for processing image and text data related to traffic accidents.
Medical
Radiology Analysis
Used for analyzing radiology images and related text reports.
Document Processing
Optical Character Recognition
Used for extracting text information from images.
Handwriting Recognition
Used for recognizing handwritten text.
Featured Recommended AI Models
Š 2025AIbase