OpenR1-Qwen-7B-SFT-Instruct Open-source Model - Focused on mathematical tasks, freely assisting in solving mathematical problems

Openr1 Qwen 7B SFT Instruct

Developed by InfiniAILab

A version fine-tuned on the OpenR1-Math-220k dataset based on the Qwen2.5-7B-Instruct model, focusing on mathematics-related tasks.

Large Language Model

Transformers

#Mathematical reasoning fine-tuning #Instruction fine-tuning #Qwen2.5 optimization

Downloads 396

Release Time : 3/8/2025

Model Overview

This model is further trained on a mathematics dataset through the SFT (Supervised Fine-Tuning) method based on Qwen2.5-7B-Instruct, aiming to improve the performance of mathematics-related tasks.

Model Features

Enhanced mathematical ability

Fine-tuned on the OpenR1-Math-220k dataset to improve the performance of mathematics-related tasks

Instruction following

Inherits the instruction understanding and execution ability of the base model

Efficient training

Uses the TRL framework for supervised fine-tuning, with high training efficiency

Model Capabilities

Mathematics problem solving

Instruction understanding and execution

Text generation

Use Cases

Education

Mathematics problem solving

Solve various mathematics problems, including algebra and geometry

Fine-tuned based on the mathematics dataset, expected to perform better on mathematics tasks

General AI assistant

Instruction execution

Understand and execute various user instructions

Inherits the instruction following ability of the base model

Property	Details
TRL	0.16.0.dev0
Transformers	4.49.0
Pytorch	2.5.1
Datasets	3.3.2
Tokenizers	0.21.0

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Openr1 Qwen 7B SFT Instruct

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 OpenR1-Qwen-7B-SFT-Instruct

🚀 Quick Start

✨ Features

📦 Installation

📚 Documentation

Training procedure

Framework versions

📄 License

📚 Citations