Llama2 7b Mmlu
Continuously trained on the MMLU dataset based on the Llama-2-7b-hf model to improve MMLU metrics while maintaining stability in other indicators
Downloads 120
Release Time : 10/10/2023
Model Overview
This model is an improved version of Llama-2-7b-hf, continuously trained on the mmlu_recall dataset, focusing on enhancing performance in MMLU benchmark tests while ensuring other capability metrics remain unaffected.
Model Features
MMLU performance improvement
Through continuous training on the mmlu_recall dataset, the MMLU metric reached 60.04, showing significant improvement compared to the original version
Multi-task capability retention
While improving MMLU performance, it maintains stable performance in other benchmark tests such as ARC and HellaSwag
Open-source license
Adopts the Apache-2.0 license, allowing for commercial and research use
Model Capabilities
Text generation
Knowledge Q&A
Language understanding
Reasoning ability
Use Cases
Education
Academic Q&A system
Used to answer various academic questions, especially those requiring broad knowledge
Excellent performance in MMLU benchmark tests
Research
Model performance research
Study the impact of continuous training on specific metrics
Achieved improvement in specific metrics without affecting other capabilities
Featured Recommended AI Models