S

Stable Vicuna 13b Delta

Developed by CarperAI
StableVicuna-13B is a fine-tuned version of the Vicuna-13B v0 model, enhanced through Reinforcement Learning from Human Feedback (RLHF) and Proximal Policy Optimization (PPO) on various dialogue and instruction datasets.
Downloads 31
Release Time : 4/26/2023

Model Overview

StableVicuna-13B is an autoregressive language model based on the LLaMA transformer architecture, specializing in text generation for dialogue tasks.

Model Features

Reinforcement Learning Fine-tuning
Fine-tuned through Reinforcement Learning from Human Feedback (RLHF) and Proximal Policy Optimization (PPO) on multiple dialogue and instruction datasets.
Multi-dataset Training
Trained on high-quality datasets such as OpenAssistant, GPT4All, and Alpaca.
Dialogue Optimization
Specializes in text generation for dialogue tasks, capable of producing coherent and meaningful dialogue responses.

Model Capabilities

Text generation
Dialogue systems
Instruction following

Use Cases

Dialogue systems
Intelligent Assistant
Used to build intelligent assistants capable of understanding and responding to user instructions and questions.
Text generation
Code Generation
Generates code snippets like Python scripts based on user instructions.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase