FairyR1-32B is an efficient large language model developed by Peking University DS-LAB, based on DeepSeek-R1-Distill-Qwen-32B. It achieves a balance between high performance and low-cost inference through an innovative 'distillation-fusion' process.
Large Language Model
Transformers English