L

Llama 3 Gutenberg 8B

Developed by nbeerbower
A fine-tuned model based on Llama-3-8b, optimized using the Gutenberg DPO dataset, suitable for text generation tasks.
Downloads 18
Release Time : 5/5/2024

Model Overview

This model is a text generation model based on the Llama-3-8b architecture, fine-tuned using the DPO (Direct Preference Optimization) method on the Gutenberg dataset to enhance instruction-following and text generation capabilities.

Model Features

DPO fine-tuning optimization
Fine-tuned using the Direct Preference Optimization method on the Gutenberg dataset to improve the model's instruction-following capability.
LoRA efficient training
Utilizes LoRA (Low-Rank Adaptation) technology for efficient fine-tuning, reducing computational resource requirements.
Multi-task evaluation
Evaluated on multiple benchmarks (IFEval, BBH, MATH, etc.), demonstrating diverse text generation capabilities.

Model Capabilities

Text generation
Instruction following
Multi-turn dialogue

Use Cases

Education
Educational Q&A system
Used to build Q&A systems in the education field to answer student questions.
Achieved 31.45% accuracy in the MMLU-PRO test
Content creation
Creative writing assistance
Assists writers in creative writing and content generation.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase