UNA TheBeagle 7b V1
TheBeagle is a 7-billion-parameter model trained on The Bagel dataset, optimized with DPO (Direct Preference Optimization) and UNA (Unified Neural Architecture) techniques, demonstrating excellent performance in multi-task scenarios.
Downloads 88
Release Time : 1/9/2024
Model Overview
This model is a 7-billion-parameter large language model optimized with a carefully selected DPO paired dataset, based on Intel's neural-chat model, and has shown outstanding performance in multiple benchmark tests.
Model Features
DPO Optimization
Trained with Direct Preference Optimization techniques on a carefully selected DPO paired dataset
UNA Architecture
Optimizes perceptron layers using Unified Neural Architecture, with a learning rate set to 3.5e-7
High Performance
Achieves excellent results in multiple benchmarks including ARC, GSM8K, and HellaSwag
Data Decontamination
The dataset undergoes rigorous decontamination to ensure training quality
Model Capabilities
Text generation
Question answering
Mathematical reasoning
Commonsense reasoning
Logical reasoning
Use Cases
Academic Research
Natural Language Processing Research
Can be used for language model performance comparison and new technology validation
Performs excellently in multiple benchmark tests
Educational Applications
Mathematical Problem Solving
Solves mathematical problems such as those in GSM8K
Achieves an exact match rate of 72.1%
Featured Recommended AI Models