# RLHF Reward Model
Prometheus 7b V2.0
Apache-2.0
Prometheus 2 is a language model based on Mistral-Instruct, specifically designed for fine-grained evaluation and reinforcement learning from human feedback, serving as an alternative to GPT-4 evaluation.
Large Language Model
Transformers English

P
prometheus-eval
13.07k
91
Hh Rlhf Rm Open Llama 3b
A reward model trained based on the LMFlow framework. It is trained on the HH - RLHF dataset (only the useful part) with open_llama_3b as the base model and has good generalization ability.
Large Language Model
Transformers

H
weqweasdas
483
18
Toxicitymodel
Apache-2.0
ToxicityModel is a fine-tuned model based on RoBERTa, designed to assess the toxicity level of English sentences.
Text Classification
Transformers English

T
nicholasKluge
133.56k
12
Featured Recommended AI Models