Decision Tree Reward Gemma 2 27B
Other
A decision tree reward model fine-tuned based on Gemma-2-27B, used to evaluate the quality of content generated by language models, with outstanding performance on the RewardBench leaderboard.
Large Language Model
Transformers English