P

Phi 4 Reasoning

Developed by microsoft
Phi-4 Reasoning is a cutting-edge open-weight reasoning model based on Phi-4, fine-tuned with supervised chain-of-thought trajectory datasets and trained via reinforcement learning, specializing in mathematics, science, and programming skills.
Downloads 11.31k
Release Time : 4/9/2025

Model Overview

Phi-4 Reasoning is a language model focused on mathematical reasoning, science, and programming, trained with high-quality and advanced reasoning data, suitable for memory/computation-constrained environments and latency-sensitive scenarios.

Model Features

Trained on High-Quality Reasoning Data
Fine-tuned with supervised chain-of-thought trajectory datasets and trained via reinforcement learning, specializing in mathematics, science, and programming skills.
Long-context Support
Supports context lengths of up to 32k tokens, suitable for handling complex queries and long-text reasoning.
Safety Alignment
Adopts robust post-training safety methods through supervised fine-tuning to ensure model responses comply with safety and ethical guidelines.

Model Capabilities

Mathematical Reasoning
Scientific Question Answering
Code Generation
Chat Dialogue
Logical Reasoning

Use Cases

Education
Mathematical Problem Solving
Solving Olympiad-level math problems, such as AIME competition questions.
Achieved 75.3 points on AIME 2024
Programming
Code Generation
Generating functional code to solve programming competition problems.
Achieved 53.8 points on LiveCodeBench
Research
Scientific Question Answering
Answering graduate-level scientific questions, such as those in the GPQA-Diamond dataset.
Achieved 65.8 points on GPQA-D
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase