Llama3.1-1B-Neo-BAAI-1000k Open-Source Language Model - Efficient Text Processing, Trained on Millions of Data

Llama3.1 1B Neo BAAI 1000k

Developed by yang31210999

Llama3.1-Neo-1B-100w is an efficient language model pruned to 1.4B parameters from Meta-Llama-3.1-8B-Instruct and fine-tuned using the LLM-Neo method (combining LoRA and knowledge distillation). The training data consists of 1 million samples from BAAI/Infinity-Instruct.

Large Language Model

Transformers

Open Source License:Apache-2.0 #Knowledge Distillation Optimization #Instruction Fine-tuning #Lightweight LLM

Downloads 39

Release Time : 9/10/2024

Model Overview

This model is an efficient large language model after parameter knowledge distillation, focusing on text generation tasks and suitable for various natural language processing scenarios.

Model Features

Efficient Parameter Knowledge Distillation

Uses the LLM-Neo method combining LoRA and knowledge distillation techniques to significantly reduce model parameters while maintaining performance.

Lightweight Design

Pruned from 8B to 1.4B parameters, greatly reducing computational resource requirements.

High-Quality Fine-tuning Data

Fine-tuned with 1 million carefully selected samples from the BAAI/Infinity-Instruct dataset.

Model Capabilities

Text Generation

Question Answering System

Instruction Following

Knowledge Reasoning

Use Cases

Education

Academic Q&A System

Used to answer various academic questions from students

Achieved 31.58% accuracy on the CEVAL advanced mathematics subset

Business

Accounting Knowledge Q&A

Handles basic accounting-related questions

Achieved 24.49% accuracy on the CEVAL accounting subset

General AI Assistant

Daily Problem Solving

Answers various questions in daily life

Achieved 58.43% accuracy on the PIQA benchmark

Property	Details
Base Model	meta-llama/Meta-Llama-3.1-8B-Instruct
Training Data	Sampling 1 Million lines from BAAI/Infinity-Instruct
License	apache-2.0
Library Name	transformers
Pipeline Tag	text-generation

Category	Benchmark	Version	Metric	Value	Stderr
ARC	ARC-Challenge	1	acc	0.1920	± 0.0115
ARC	ARC-Easy	1	acc	0.3834	± 0.0100
CEVAL	CEVAL (valid)	N/A	acc	0.2370	± 0.0117
CEVAL	CEVAL (Accountant)	1	acc	0.2449	± 0.0621
CEVAL	CEVAL (Advanced Mathematics)	1	acc	0.3158	± 0.1096
MMLU	MMLU	N/A	acc	0.2439	± 0.0036
MMLU	MMLU (Abstract Algebra)	0	acc	0.2500	± 0.0435
PIQA	PIQA	1	acc	0.5843	± 0.0115
PIQA	PIQA (Normalized)	1	acc_norm	0.5822	± 0.0115
Winogrande	Winogrande	1	acc	0.5249	± 0.0140

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Llama3.1 1B Neo BAAI 1000k

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Llama3.1-Neo-1B-100w Project

📦 Installation

💻 Usage Examples

📚 Documentation

Project Overview

Related Information

Model Information

🔧 Technical Details

Benchmarks

Evaluation results

📄 License