D

Deepseek R1 Bf16

Developed by opensourcerelease
DeepSeek-R1 is the first-generation inference model, which performs excellently in mathematics, code, and reasoning tasks, and its performance is comparable to that of OpenAI-o1.
Downloads 1,486
Release Time : 1/21/2025

Model Overview

DeepSeek-R1 is a large language model focusing on mathematics, code, and reasoning tasks. It is trained through reinforcement learning and cold-start data, and has excellent reasoning ability and self-verification ability.

Model Features

Pure reinforcement learning training
Directly train the model through reinforcement learning without supervised fine-tuning (SFT) as an initial step.
Self-verification ability
The model has self-verification and reflection abilities and can generate long thought chains to solve complex problems.
Distillation support
Supports distilling the inference ability of large models into small models to improve the performance of small models.
128K long context
Supports a context length of up to 128K, suitable for processing long documents and complex tasks.

Model Capabilities

Mathematical reasoning
Code generation
Complex problem solving
Long text processing
Self-verification
Thought chain generation

Use Cases

Education
Mathematics problem solving
Solve high school mathematics competition questions
Achieved 79.8% pass@1 in the AIME 2024 test
Programming education
Generate programming exercises and solutions
Achieved 65.9% pass@1 in the LiveCodeBench test
Software development
Code generation
Generate functional code according to requirements
Achieved a score of 2029 in the Codeforces test
Code debugging
Analyze and fix errors in the code
Solved 49.2% of the problems in the SWE Verified test
Research
Scientific problem solving
Solve complex scientific problems
Achieved 71.5% pass@1 in the GPQA-Diamond test
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase