Openr1 Qwen 7B SFT Instruct
A version fine-tuned on the OpenR1-Math-220k dataset based on the Qwen2.5-7B-Instruct model, focusing on mathematics-related tasks.
Downloads 396
Release Time : 3/8/2025
Model Overview
This model is further trained on a mathematics dataset through the SFT (Supervised Fine-Tuning) method based on Qwen2.5-7B-Instruct, aiming to improve the performance of mathematics-related tasks.
Model Features
Enhanced mathematical ability
Fine-tuned on the OpenR1-Math-220k dataset to improve the performance of mathematics-related tasks
Instruction following
Inherits the instruction understanding and execution ability of the base model
Efficient training
Uses the TRL framework for supervised fine-tuning, with high training efficiency
Model Capabilities
Mathematics problem solving
Instruction understanding and execution
Text generation
Use Cases
Education
Mathematics problem solving
Solve various mathematics problems, including algebra and geometry
Fine-tuned based on the mathematics dataset, expected to perform better on mathematics tasks
General AI assistant
Instruction execution
Understand and execute various user instructions
Inherits the instruction following ability of the base model
Featured Recommended AI Models