Glm 4vq
4-bit quantized version of GLM-4V-9B, supporting multimodal multilingual understanding with memory usage under 9G, outperforming multiple mainstream models
Image-to-Text
Transformers Supports Multiple Languages#Multimodal Document QA#Low-Memory Visual Inference#12-Language Support

Downloads 440
Release Time : 6/10/2024
Model Overview
Quantized version based on GLM-4V-9B, specializing in document, image, and chart QA tasks, supporting 12-language interaction with excellent performance across multiple benchmarks
Model Features
Efficient Quantization
4-bit quantized version uses less than 9GB memory, can run on free Google Colab
Multilingual Support
Supports interaction in 12 languages, with optimal performance in English and Chinese
Outstanding Performance
Outperforms mainstream models like GPT-4-turbo and Gemini 1.0 Pro in document and image QA tasks
Long Context Support
Supports context length up to 8K tokens
Model Capabilities
Document Understanding
Image Analysis
Chart Parsing
Multilingual Text Generation
Visual QA
Multimodal Reasoning
Use Cases
Education
Textbook Content Analysis
Analyze graphic-text content in textbooks and answer related questions
Accurately understands charts and text content in textbooks
Business
Business Report Analysis
Automatically extract and analyze key data and charts in business reports
Quickly generates report summaries and key metrics
Featured Recommended AI Models