đ ConvAI-9b: A Conversational AI Model
ConvAI-9b is a powerful conversational AI model with 9 billion parameters, designed to excel in various conversational applications and provide high - quality interactions.
đ Quick Start
This README provides a comprehensive overview of the ConvAI-9b conversational AI model, including its details, description, training data, intended uses, limitations, and evaluation metrics.
⨠Features
- Fine - Tuned Model: ConvAI-9b is a fine - tuned model based on HuggingFaceH4/zephyr-7b-beta and mistral-community/Mistral-7B-v0.2, offering enhanced conversational capabilities.
- Custom Training Data: It was trained on a custom dataset of AI - user conversations, following a specific format for better performance.
- Multiple Use Cases: Suitable for a wide range of conversational AI applications, such as chatbots, virtual assistants, interactive storytelling, and educational tools.
đĻ Installation
No installation steps are provided in the original document, so this section is skipped.
đģ Usage Examples
No code examples are provided in the original document, so this section is skipped.
đ Documentation
đ Model Details
Property |
Details |
Model Name |
ConvAI-9b |
Authors |
CreitinGameplays |
Date |
April 18th, 2024 |
đ Model Description
ConvAI-9b is a fine - tuned conversational AI model with 9 billion parameters. It is based on the following models:
đ Training Data
The model was fine - tuned on a custom dataset of conversations between an AI assistant and a user. The dataset format followed a specific structure:
<|system|> (system prompt, e.g.: You are a helpful AI language model called ChatGPT, your goal is helping users with their questions) </s> <|user|> (user prompt) </s>
đ¯ Intended Uses
ConvAI-9b is intended for use in conversational AI applications, such as:
- Chatbots
- Virtual assistants
- Interactive storytelling
- Educational tools
â ī¸ Limitations
- Like any other language model, ConvAI-9b may generate incorrect or misleading responses.
- It may exhibit biases present in the training data.
- The model's performance can be affected by the quality and format of the input text.
đ Evaluation
Metrics |
Value |
ARC |
57.50 |
HellaSwag |
80.34 |
TruthfulQA |
49.54 |
Winogrande |
76.24 |
More detailed evaluation here
đ§ Technical Details
No technical details are provided in the original document, so this section is skipped.
đ License
The model is released under the MIT license.