AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
RLHF Reward Model

# RLHF Reward Model

Prometheus 7b V2.0
Apache-2.0
Prometheus 2 is a language model based on Mistral-Instruct, specifically designed for fine-grained evaluation and reinforcement learning from human feedback, serving as an alternative to GPT-4 evaluation.
Large Language Model Transformers English
P
prometheus-eval
13.07k
91
Hh Rlhf Rm Open Llama 3b
A reward model trained based on the LMFlow framework. It is trained on the HH - RLHF dataset (only the useful part) with open_llama_3b as the base model and has good generalization ability.
Large Language Model Transformers
H
weqweasdas
483
18
Toxicitymodel
Apache-2.0
ToxicityModel is a fine-tuned model based on RoBERTa, designed to assess the toxicity level of English sentences.
Text Classification Transformers English
T
nicholasKluge
133.56k
12
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase