BAGEL 7B MoT DF11
The BAGEL-7B-MoT model utilizing DFloat11 lossless compression technology reduces volume by 32% while maintaining bit-level output consistency
Downloads 428
Release Time : 5/25/2025
Model Overview
Based on the BAGEL-7B-MoT model, DFloat11 compression technology achieves model size reduction, suitable for scenarios requiring efficient storage and operation
Model Features
DFloat11 lossless compression
Uses dynamic-length floating-point compression technology to reduce model size by 32% while maintaining 100% accuracy
Efficient GPU operation
Hardware-aware algorithm design enables real-time weight decompression on GPUs, maintaining high inference speed
Huffman coding optimization
Applies Huffman coding to the exponent bits of BFloat16 model weights to achieve high compression rates
Model Capabilities
Text generation
Efficient compressed model inference
Use Cases
Efficient inference
Large model deployment
Deploying large language models in resource-constrained environments
Model size reduced by 32%, memory usage decreased
Featured Recommended AI Models
Š 2025AIbase