Deepseek R1 Llama 8B F32 GGUF
D
Deepseek R1 Llama 8B F32 GGUF
Developed by prithivMLmods
DeepSeek-R1-Llama-8B-F32-GGUF is the quantized version of DeepSeek-R1-Distill-Llama-8B, trained directly with reinforcement learning, featuring capabilities such as self-verification, reflection, and generating extended chain-of-thought reasoning.
Downloads 326
Release Time : 6/1/2025
Model Overview
This model is the quantized version of DeepSeek-R1-Distill-Llama-8B, trained directly with reinforcement learning without supervised fine-tuning, capable of exploring chain-of-thought reasoning to solve complex problems.
Model Features
Direct Reinforcement Learning Training
Training directly with reinforcement learning without supervised fine-tuning as a preliminary step.
Chain-of-Thought Reasoning
Capable of exploring chain-of-thought reasoning to solve complex problems.
Self-Verification and Reflection
Features self-verification, reflection, and the ability to generate extended chain-of-thought reasoning.
Multi-Precision Quantization
Provides quantized models in three precisions: BF16, FP16, and FP32.
Model Capabilities
Text Generation
Chain-of-Thought Reasoning
Self-Verification
Reflection
Use Cases
Complex Problem Solving
Mathematical Reasoning
Solving complex mathematical problems through chain-of-thought reasoning.
Logical Reasoning
Performing logical reasoning and verification.
Featured Recommended AI Models
Š 2025AIbase