GLM 4 32B 0414 8bit
This model is an 8-bit quantized MLX format conversion from THUDM/GLM-4-32B-0414, supporting Chinese and English text generation tasks.
Downloads 222
Release Time : 4/22/2025
Model Overview
A 32B-parameter large language model based on the GLM-4 architecture, processed with 8-bit quantization and converted to the MLX framework format for efficient inference on Apple silicon devices.
Model Features
8-bit Quantization
Reduces model memory usage through 8-bit quantization while maintaining good inference quality.
MLX Framework Support
Optimized for Apple silicon via the MLX framework, enabling efficient operation on Mac devices.
Bilingual Support
Natively supports text generation tasks in both Chinese and English.
Model Capabilities
Text Generation
Dialogue Systems
Content Creation
Use Cases
Intelligent Assistants
Chatbot
Build bilingual (Chinese-English) chatbots
Smooth and natural conversational experience
Content Creation
Article Generation
Generate coherent articles in Chinese or English based on prompts
Diverse content output aligned with the topic
Featured Recommended AI Models
Š 2025AIbase