U

UNA TheBeagle 7b V1

Developed by fblgit
TheBeagle is a 7-billion-parameter model trained on The Bagel dataset, optimized with DPO (Direct Preference Optimization) and UNA (Unified Neural Architecture) techniques, demonstrating excellent performance in multi-task scenarios.
Downloads 88
Release Time : 1/9/2024

Model Overview

This model is a 7-billion-parameter large language model optimized with a carefully selected DPO paired dataset, based on Intel's neural-chat model, and has shown outstanding performance in multiple benchmark tests.

Model Features

DPO Optimization
Trained with Direct Preference Optimization techniques on a carefully selected DPO paired dataset
UNA Architecture
Optimizes perceptron layers using Unified Neural Architecture, with a learning rate set to 3.5e-7
High Performance
Achieves excellent results in multiple benchmarks including ARC, GSM8K, and HellaSwag
Data Decontamination
The dataset undergoes rigorous decontamination to ensure training quality

Model Capabilities

Text generation
Question answering
Mathematical reasoning
Commonsense reasoning
Logical reasoning

Use Cases

Academic Research
Natural Language Processing Research
Can be used for language model performance comparison and new technology validation
Performs excellently in multiple benchmark tests
Educational Applications
Mathematical Problem Solving
Solves mathematical problems such as those in GSM8K
Achieves an exact match rate of 72.1%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase