T

Thinkygemma 4b

Developed by xsanskarx
A pseudo-reasoning expert model fine-tuned from Google Gemma-3-4b-pt, designed for structured reasoning/pseudo-inductive reasoning
Downloads 19
Release Time : 3/14/2025

Model Overview

This model is a fine-tuned version of Google Gemma-3-4b-it, aiming to mimic an excellent reasoner, focusing on structured reasoning and pseudo-inductive reasoning tasks.

Model Features

Structured Reasoning Capability
Designed for structured reasoning and pseudo-inductive reasoning, capable of generating logically coherent reasoning processes.
Efficient Fine-Tuning
Utilizes LoRA fine-tuning technique (r = 128, alpha = 256), completing training in just 9 hours on a single NVIDIA H100.
High-Quality Training Data
Trained on 25,000 validated Chain-of-Thought (CoT) trajectories, sourced from DeepSeek R1 and Qwen QWQ.

Model Capabilities

Text Generation
Structured Reasoning
Pseudo-Inductive Reasoning

Use Cases

Education
Logical Reasoning Teaching
Used to generate logical reasoning examples, helping students understand the problem-solving process of complex issues.
Generates coherent reasoning chains, demonstrating step-by-step problem-solving processes.
Research
Reasoning Capability Research
Used to study the reasoning capabilities and pseudo-reasoning behaviors of AI models.
Provides analyzable reasoning trajectories, aiding in understanding model reasoning mechanisms.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase