Deepseek R1 0528 Distilled Qwen3 Gguf
D
Deepseek R1 0528 Distilled Qwen3 Gguf
Developed by ertghiu256
Fine-tuned based on the Qwen 3 model with 4B parameters to improve inference and problem-solving abilities
Downloads 142
Release Time : 6/16/2025
Model Overview
This model is fine-tuned on a specific dataset based on the Qwen 3 model with 4B parameters. It is mainly used for text generation tasks and enhances inference and problem-solving abilities.
Model Features
Training acceleration
Using Unsloth and Huggingface's TRL library, the training speed is doubled.
Multi-purpose capabilities
Supports various tasks such as general inference, code generation, and problem solving.
Model Capabilities
Text generation
Logical reasoning
Code generation
Problem solving
Use Cases
Inference and problem solving
General inference
Perform general logical inference tasks
Code generation
Code generation
Generate programming code (Note: Not specifically trained for HTML code)
The generated HTML code may not perform well.
Featured Recommended AI Models