Seerattention Decode Qwen3 4B AttnGates
Provide the AttnGate weights for the decoding phase in the SeerAttention-R paper, supporting the inference tasks of the Qwen3-4B model
Downloads 4,295
Release Time : 6/9/2025
Model Overview
This model contains the attention gate weights for the decoding phase in the SeerAttention-R paper, used to enhance the inference ability of the Qwen3-4B model
Model Features
Attention Optimization in Decoding Phase
Provide attention gate weights for the decoding phase to optimize the inference process
Multi-budget Support
Support inference tasks under different token budgets
Compatibility with Qwen3 Series
Designed specifically for the Qwen3-4B model
Model Capabilities
Inference Task Optimization
Attention Mechanism Enhancement
Text Generation
Use Cases
Academic Inference
AIME Math Contest Problem Solving
Solve AIME math contest problems
Achieve an accuracy of 55.42 - 72.50% under different token budgets
GPQA Question Answering
Solve GPQA test questions
Achieve an accuracy of 39.61 - 56.19% under different token budgets
Mathematical Problem Solving
MATH500 Math Problem Solving
Solve math problems in the MATH500 dataset
Achieve an accuracy of 84.80 - 93.93% under different token budgets
Featured Recommended AI Models
Š 2025AIbase