Thinkygemma 4b
T
Thinkygemma 4b
Developed by xsanskarx
A pseudo-reasoning expert model fine-tuned from Google Gemma-3-4b-pt, designed for structured reasoning/pseudo-inductive reasoning
Downloads 19
Release Time : 3/14/2025
Model Overview
This model is a fine-tuned version of Google Gemma-3-4b-it, aiming to mimic an excellent reasoner, focusing on structured reasoning and pseudo-inductive reasoning tasks.
Model Features
Structured Reasoning Capability
Designed for structured reasoning and pseudo-inductive reasoning, capable of generating logically coherent reasoning processes.
Efficient Fine-Tuning
Utilizes LoRA fine-tuning technique (r = 128, alpha = 256), completing training in just 9 hours on a single NVIDIA H100.
High-Quality Training Data
Trained on 25,000 validated Chain-of-Thought (CoT) trajectories, sourced from DeepSeek R1 and Qwen QWQ.
Model Capabilities
Text Generation
Structured Reasoning
Pseudo-Inductive Reasoning
Use Cases
Education
Logical Reasoning Teaching
Used to generate logical reasoning examples, helping students understand the problem-solving process of complex issues.
Generates coherent reasoning chains, demonstrating step-by-step problem-solving processes.
Research
Reasoning Capability Research
Used to study the reasoning capabilities and pseudo-reasoning behaviors of AI models.
Provides analyzable reasoning trajectories, aiding in understanding model reasoning mechanisms.
Featured Recommended AI Models