G

Gpt2 Open Instruct V1 Anthropic Hh Rlhf

Developed by jtatman
A dialogue model fine-tuned on the Anthropic/hh-rlhf dataset based on GPT2-open-instruct, excelling in responding to prompts in dialogue scenarios
Downloads 125
Release Time : 7/22/2023

Model Overview

This model is a fine-tuned version of vicgalle/gpt2-open-instruct-v1 on a subset of the Anthropic/hh-rlhf dataset, primarily used for instruction response in dialogue scenarios

Model Features

Dialogue Scenario Optimization
Specifically optimized for the 'Human:' and 'Assistant:' dialogue format
Short-text Generation Advantage
Performs better in short-text reply scenarios
RLHF Adaptation
Reconstructed the language model head through partial RLHF adapters

Model Capabilities

Dialogue Generation
Instruction Response
Short-text Generation

Use Cases

Dialogue System
Dialogue Response Generation
Generate dialogue responses based on user input
Achieved a loss value of 2.1534 on the evaluation set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase