OpenR1-Qwen-7B-Turkish Open-Source Large Language Model - Empowering Step-by-Step Mathematical Reasoning in Turkish!

Openr1 Qwen 7B Turkish

Developed by WiroAI

A 7B-parameter large language model fine-tuned on Turkish datasets based on Qwen2.5-Instruct, specializing in mathematical reasoning and step-by-step thinking capabilities

Large Language Model

Transformers

Open Source License:Apache-2.0 #Turkish reasoning #Mathematical problem solving #Step-by-step thinking

Downloads 319

Release Time : 2/14/2025

Model Overview

This is a fine-tuned version of the Qwen2.5-7B-Instruct model on Turkish mathematical reasoning datasets, aimed at improving logical reasoning and mathematical problem-solving abilities in Turkish contexts

Model Features

Turkish language optimization

Specifically optimized for Turkish, demonstrating better reasoning and thinking capabilities in Turkish compared to the original model

Mathematical reasoning ability

Excels in mathematical problem solving, capable of demonstrating detailed solution steps

Long-text processing

Supports 4096-token context length, suitable for handling complex reasoning tasks

Model Capabilities

Turkish text generation

Mathematical problem solving

Step-by-step reasoning demonstration

Algebraic equation solving

Use Cases

Education

Math tutoring

Helps students understand mathematical problem-solving processes

Can demonstrate detailed steps for solving algebraic problems

Research

Multilingual model research

Investigating model performance in low-resource languages

Provides benchmarks for Turkish NLP research

🚀 OpenR1-Qwen-7B-Turkish 🚀

This is a finetuned version of Qwen2.5-Instruct on WiroAI/dolphin-r1-turkish. It aims to address some limitations in language reasoning and performance on low - resource languages, offering a contribution to the open - source community.

🚀 Quick Start

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "OpenR1-Qwen-7B-Turkish"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "$4x+5 = 6x+7$ denklemini sağlayan $x$ değerini bul."

messages = [
    {"role": "system", "content": "Lütfen adım adım düşün ve cevapla."},
    {"role": "user", "content": prompt}
]

text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(device)

generated_ids = model.generate(
    model_inputs.input_ids,
    max_new_tokens=4096
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]

Basic Usage

# The above code demonstrates the basic usage of loading the model, tokenizing the input, and generating responses.

Advanced Usage

# In more complex scenarios, you can adjust parameters such as max_new_tokens according to your needs.
# For example, if you want to limit the response length in some cases, you can set a smaller value for max_new_tokens.
# However, be aware that this model may require a relatively large number of tokens to fully express the reasoning process.

✨ Features

Overview

DeepSeek's distilled models sometimes reason in Chinese or English even when prompted in another language.
Open - Source models still need improvement on relatively low - resource languages.
There is a motivation to reproduce R1 and contribute to the community.

Training

We trained the model on the WiroAI/dolphin-r1-turkish for 2 epochs. We used a learning rate of 1e - 5 and a max sequence length of 4096. The training followed a cosine learning rate schedule with a 10% warm - up phase.
Training took 3 days on an 8xA6000 ADA cluster.
Normally, the R1 team compares the performance of OpenR1 models to DeepSeek - Distill - Qwen - 7B and OpenThinker - 7B using lighteval. However, since the datasets are only MATH - oriented, we won't disclose the default results as no conclusive findings can be made.

You can find the training and evaluation code at: https://github.com/huggingface/open-r1/

📚 Documentation

Evaluation

We observed that the reasoning process has slightly improved. Our model thinks more clearly in Turkish compared to DeepSeek's reasoning model.
This model was trained for experimental purposes, and any benchmark evaluation is highly appreciated. Please note that this model will produce more tokens compared to normal models and will consume more VRAM during inference.
If you are willing to evaluate this model, please ensure that the model is allowed to produce enough tokens. Generate - until requests that restrict the model to output less than 4000 tokens will lead to poor results.
We believe that democratized and culturally improved open - source models will be achieved through sharing and experiments!

🤗 Community

We would like to thank Huggingface Staff and everyone who contributed to the Open - R1 project!

📄 License

This project is licensed under the Apache 2.0 license.

Citation

@article{WiroAI,
  title={WiroAI/OpenR1-Qwen-7B-Turkish},
  author={Abdullah Bezir, Cengiz Asmazoğlu},
  year={2025},
  url={https://huggingface.co/WiroAI/OpenR1-Qwen-7B-Turkish}
}

Information Table

Property	Details
Model Type	Finetuned version of Qwen2.5 - Instruct
Training Data	WiroAI/dolphin - r1 - turkish

Important Note

⚠️ Important Note

This model will produce more tokens compared to normal models and consume more VRAM during inference. When evaluating, make sure the model is allowed to generate enough tokens (at least 4000).

💡 Usage Tip

In complex reasoning tasks, use a larger max_new_tokens value to allow the model to fully express the reasoning process.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご