D

Deepseek R1 AWQ

Developed by cognitivecomputations
AWQ quantized version of DeepSeek R1 model, optimized for float16 overflow issues and supports efficient inference deployment
Downloads 30.46k
Release Time : 1/21/2025

Model Overview

AWQ quantized version based on DeepSeek-R1 foundation model, suitable for text generation tasks with bilingual (Chinese-English) processing capabilities

Model Features

Efficient Quantization
Utilizes AWQ quantization technology to significantly reduce computational resource requirements while maintaining model performance
Overflow Fix
Modified model code to fix overflow issues when using float16
High-performance Deployment
Supports efficient deployment via vLLM with performance benchmarks across various GPU configurations

Model Capabilities

Text generation
Bilingual processing (Chinese-English)
Long-context reasoning

Use Cases

Text generation
Content creation
Generate various types of textual content
Dialogue systems
Build intelligent conversational agents
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase