A

Acip Qwen25 3b

Developed by MerantixMomentum
Compressible version of Qwen2.5-3B provided by the ACIP project, supporting dynamic model size adjustment while maintaining performance
Downloads 31
Release Time : 4/15/2025

Model Overview

Compressible model based on Qwen2.5-3B, achieving flexible parameter compression and quantization through ACIP technology, suitable for multilingual text generation tasks

Model Features

Dynamic Compressibility
Supports real-time adjustment of model compression ratio (0-100%) via the size_ratio parameter, with reversible compression operations
Quantization Support
Integrates bitsandbytes' 4-bit quantization scheme to further reduce GPU memory usage
Multilingual Support
Natively supports text generation tasks in 13 languages

Model Capabilities

Text Generation
Model Compression
Quantized Inference

Use Cases

Resource-Constrained Deployment
Edge Device Deployment
Deploy large models to devices with limited GPU memory through compression and quantization
Can reduce GPU memory usage by over 60%
Multilingual Applications
Multilingual Text Generation
Supports text generation and creation in 13 languages
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase