AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
RLHF alignment

# RLHF alignment

Llama 3.1 405B Instruct
Llama 3.1 is a multilingual large language model series developed by Meta, including 8B, 70B, and 405B scales, supporting multilingual text generation and code generation tasks.
Large Language Model Transformers Supports Multiple Languages
L
meta-llama
34.83k
569
Llama 3.1 405B FP8
Meta Llama 3.1 is a multilingual large language model collection, including 8B, 70B, and 405B parameter pre-trained and instruction-tuned generative models, supporting 8 languages with outstanding performance on industry benchmarks.
Large Language Model Transformers Supports Multiple Languages
L
meta-llama
540
115
Gpt2 Large Harmless Reward Model
MIT
A large GPT2 model trained on the Anthropic/hh - rlhf harmless dataset, specifically for harmful response detection or reinforcement learning from human feedback (RLHF).
Large Language Model Transformers
G
Ray2333
1,489
3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase