Deepseek R1 Distill Qwen 1.5B Turkish
This model is a fine-tuned version of DeepSeek-R1-Distill-Qwen-1.5B on the Turkish-R1 dataset, mainly used for Turkish-related reasoning tasks.
Downloads 124
Release Time : 2/7/2025
Model Overview
This model is an inference model optimized for Turkish, fine-tuned on a specific dataset, suitable for Turkish text processing tasks.
Model Features
Turkish optimization
Specifically fine-tuned for Turkish, improving the ability to process Turkish text
Distilled model
Based on knowledge distillation technology, reducing the model size while maintaining performance
Multi-GPU training
Distributed training using 8 GPUs, improving training efficiency
Model Capabilities
Turkish text understanding
Turkish text generation
Turkish reasoning tasks
Use Cases
Natural language processing
Turkish text analysis
Used to analyze Turkish text content
The loss value on the evaluation set is 1.1396
Turkish question-answering system
Build a Turkish question-answering application
Featured Recommended AI Models
Š 2025AIbase