Deductive Reasoning Qwen 32B
MIT
A model trained through reinforcement fine-tuning based on Qwen 2.5 32B Instruct, specifically designed to solve challenging deductive reasoning problems in the Temporal Clue dataset.
Large Language Model
Transformers English