D

Deepseek R1 0528 FP4

Developed by nvidia
A quantized version of the DeepSeek R1 0528 model from DeepSeek AI, an autoregressive language model based on an optimized Transformer architecture, which can be used for commercial and non-commercial purposes.
Downloads 372
Release Time : 6/3/2025

Model Overview

This model is the FP4 quantized version of DeepSeek R1 0528, which reduces disk size and GPU memory requirements and is suitable for text generation tasks.

Model Features

FP4 Quantization
By quantizing weights and activations to the FP4 data type, it reduces the storage and computational resource requirements, reducing the disk size and GPU memory requirements by approximately 1.6 times.
Optimized Transformer Architecture
Based on an optimized Transformer architecture, it is an autoregressive language model suitable for efficient text generation tasks.
Commercial and Non-commercial Use
The model can be used for commercial and non-commercial purposes, following the MIT license.

Model Capabilities

Text Generation
Language Model Inference

Use Cases

Text Generation
Basic Text Completion
Generate coherent text completion based on the given prompt.
Generate coherent text that fits the context.
Question Answering System
Answer questions raised by users, such as factual questions or reasoning questions.
Generate accurate or reasonable answers.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase