Model Selection

RLHF alignment

# RLHF alignment

Llama 3.1 405B Instruct

Llama 3.1 is a multilingual large language model series developed by Meta, including 8B, 70B, and 405B scales, supporting multilingual text generation and code generation tasks.

Large Language Model

Transformers Supports Multiple Languages

Llama 3.1 405B FP8

Meta Llama 3.1 is a multilingual large language model collection, including 8B, 70B, and 405B parameter pre-trained and instruction-tuned generative models, supporting 8 languages with outstanding performance on industry benchmarks.

Large Language Model

Transformers Supports Multiple Languages

Gpt2 Large Harmless Reward Model

A large GPT2 model trained on the Anthropic/hh - rlhf harmless dataset, specifically for harmful response detection or reinforcement learning from human feedback (RLHF).

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase