B

Bagel 8b V1.0

Developed by jondurbin
An 8B parameter model built on Meta's Llama-3, fine-tuned using the Bagel dataset, and adopting a single llama-3 instruction format.
Downloads 14.11k
Release Time : 4/24/2024

Model Overview

This model is a fine-tuned version of llama-3-8b using the Bagel dataset, focusing on instruction following and context understanding tasks.

Model Features

Single instruction format
Adopting the llama-3-instruct prompt template simplifies the prompt format and improves the consistency of instruction following.
Multi-task support
Supports multiple tasks such as question answering, summary generation, and function call, suitable for a wide range of application scenarios.
Fine-tuning with high-quality data
Fine-tuning using the Bagel dataset, combined with multiple high-quality data sources, enhances the model's performance and generalization ability.

Model Capabilities

Text generation
Instruction following
Question answering
Summary generation
Function call
Chain-of-thought reasoning

Use Cases

Education
Question answering system
Used in the education field's question answering system to answer students' questions and provide explanations.
Performs well in MT-Bench, with an average score of 7.296875.
Content generation
Summary generation
Automatically generates summaries of text content, suitable for news, research papers, etc.
Supports summary generation of about 130 words.
Programming assistance
Function call
Generates function call code based on user input, supporting multiple programming languages.
Can accurately generate function call parameters in JSON format.
Featured Recommended AI Models
ยฉ 2025AIbase