D

Deepseek R1

Developed by deepseek-ai
DeepSeek-R1 is the first-generation inference model launched by DeepSeek. Through large-scale reinforcement learning training, it performs excellently in mathematics, code, and reasoning tasks.
Downloads 1.7M
Release Time : 1/20/2025

Model Overview

DeepSeek-R1 is a large-scale language model based on the MoE architecture, trained through two-stage reinforcement learning and supervised fine-tuning, focusing on improving complex reasoning abilities.

Model Features

Pure Reinforcement Learning Training
The DeepSeek-R1-Zero version is completely trained through reinforcement learning without supervised fine-tuning, demonstrating naturally emerging reasoning abilities.
Two-stage Training Process
It includes two RL stages for discovering reasoning patterns and aligning human preferences, as well as two SFT stages as ability seeds.
Powerful Reasoning Ability
It performs excellently in mathematics, code, and complex reasoning tasks, comparable to OpenAI-o1.
Knowledge Distillation Support
It supports distilling the reasoning ability of large models into small models to improve the performance of small models.

Model Capabilities

Solving Complex Mathematical Problems
Code Generation and Understanding
Long Text Reasoning
Multi-step Logical Reasoning
Self-verification and Reflection
Thought Chain Generation

Use Cases

Education
Mathematical Problem Solving
Solve complex mathematical problems, including proof questions and calculation questions.
It performs excellently in mathematical benchmark tests.
Programming
Code Generation
Generate functional code based on problem descriptions.
It achieves a Pass@1-COT of 65.9% on LiveCodeBench.
Research
Scientific Reasoning
Handle complex scientific problems and reasoning tasks.
It achieves an accuracy of 71.5% in the GPQA-Diamond test.
Featured Recommended AI Models
ยฉ 2025AIbase