Einstein-v6.1-Llama3-8B開源大模型 - 免費支持STEM領域任務應用

首頁

Einstein V6.1 Llama3 8B

由Weyaxi開發

基於Meta-Llama-3-8B在多樣化科學數據集上微調的大語言模型，專注於STEM領域任務

大型語言模型

Transformers

英語開源協議:其他 #STEM領域專家 #多學科知識庫 #科學推理增強

下載量 70

發布時間 : 4/19/2024

模型概述

該模型是在Llama-3-8B基礎上通過多階段微調優化的科學專用模型，擅長處理物理、化學、生物、數學等STEM學科問題

模型特點

STEM領域優化

在物理、化學、生物、數學等科學數據集上專門微調，顯著提升STEM任務表現

多階段指令微調

採用ChatML格式的多樣化指令數據集進行訓練，增強對話和指令跟隨能力

高性能推理

在多個科學基準測試中表現優異，如AI2 ARC(62.46%)、MMLU(66.19%)等

模型能力

科學問答

數學問題求解

物理概念解釋

化學方程式生成

生物學術語解釋

多步驟推理

技術文檔生成

使用案例

教育

科學作業輔導

幫助學生解答物理、化學等學科的作業問題

在ScienceQA等測試集上表現良好

STEM概念解釋

用通俗語言解釋複雜的科學概念

研究

文獻理解輔助

幫助研究人員快速理解科學文獻中的關鍵概念

實驗設計建議

基於已有研究提供實驗設計思路

🚀 🔬 Einstein-v6.1-Llama3-8B

這是一個基於多樣化數據集對 meta-llama/Meta-Llama-3-8B 進行全量微調的模型。它使用 8xRTX3090 + 1xRTXA6000 顯卡，藉助 axolotl 工具完成微調。該模型的訓練由 sablo.ai 贊助。

image/png

🚀 快速開始

模型基礎信息

屬性	詳情
模型類型	Einstein-v6.1-Llama3-8B
基礎模型	meta-llama/Meta-Llama-3-8B
訓練數據	allenai/ai2_arc、camel-ai/physics、camel-ai/chemistry等眾多數據集

提示模板

你可以在使用該模型時使用 ChatML 提示模板：

ChatML

<|im_start|>system
{system}<|im_end|>
<|im_start|>user
{user}<|im_end|>
<|im_start|>assistant
{asistant}<|im_end|>

這個提示模板可以作為聊天模板使用，這意味著你可以使用 tokenizer.apply_chat_template() 方法來格式化消息：

messages = [
    {"role": "system", "content": "You are helpful AI asistant."},
    {"role": "user", "content": "Hello!"}
]
gen_input = tokenizer.apply_chat_template(message, return_tensors="pt")
model.generate(**gen_input)

✨ 主要特性

基於 Meta-Llama-3-8B 基礎模型進行全量微調，在多個科學領域數據集上進行訓練，具備廣泛的科學知識。
使用 ChatML 提示模板，方便與模型進行交互。
有多種量化版本可供選擇，適應不同的應用場景。

📚 詳細文檔

axolotl 配置

查看 axolotl 配置

axolotl 版本: 0.4.0

base_model: meta-llama/Meta-Llama-3-8B
model_type: LlamaForCausalLM
tokenizer_type: AutoTokenizer

load_in_8bit: false
load_in_4bit: false
strict: false

chat_template: chatml
datasets:
  - path: data/merged_all.json
    ds_type: json
    type: alpaca
    conversation: chatml

  - path: data/gpteacher-instruct-special-alpaca.json
    ds_type: json
    type: gpteacher
    conversation: chatml

  - path: data/wizardlm_evol_instruct_70k_random_half.json
    ds_type: json
    type: alpaca
    conversation: chatml

  - path: data/capybara_sharegpt.json
    ds_type: json
    type: sharegpt
    conversation: chatml

  - path: data/synthia-v1.3_sharegpt_12500.json
    ds_type: json
    type: sharegpt
    conversation: chatml  

  - path: data/cot_alpaca_gpt4_extracted_openhermes_2.5_sharegpt.json
    ds_type: json
    type: sharegpt
    conversation: chatml

  - path: data/slimorca_dedup_filtered_95k_sharegpt.json
    ds_type: json
    type: sharegpt
    conversation: chatml  

  - path: data/airoboros_3.2_without_contextual_slimorca_orca_sharegpt.json
    ds_type: json
    type: sharegpt
    conversation: chatml  

  - path: data/allenai_wild_chat_gpt4_english_toxic_random_half_4k_sharegpt.json
    ds_type: json
    type: sharegpt
    strict: false
    conversation: chatml  

  - path: data/pippa_bagel_repo_3k_sharegpt.json
    ds_type: json
    type: sharegpt
    conversation: chatml  

  - path: data/gpt4_data_lmys_1m_sharegpt.json
    ds_type: json
    type: sharegpt
    conversation: chatml  

  - path: data/sharegpt_gpt4_english.json
    ds_type: json
    type: sharegpt
    conversation: chatml

  - path: data/no_robots_sharegpt.json
    ds_type: json
    type: sharegpt
    strict: false
    conversation: chatml

  - path: data/oasst_top1_from_fusechatmixture_sharegpt.json
    ds_type: json
    type: sharegpt
    strict: false
    conversation: chatml

  - path: data/everythinglm-data-v3_sharegpt.json
    ds_type: json
    type: sharegpt
    strict: false
    conversation: chatml

dataset_prepared_path: last_run_prepared
val_set_size: 0.002

output_dir: ./Einstein-v6.1-Llama3-8B-model

sequence_len: 8192
sample_packing: true
pad_to_sequence_len: true
eval_sample_packing: false

wandb_project: Einstein
wandb_entity:
wandb_watch:
wandb_name: Einstein-v6.1-Llama3-2-epoch
wandb_log_model:
hub_model_id: Weyaxi/Einstein-v6.1-Llama3-8B

save_safetensors: true

gradient_accumulation_steps: 4
micro_batch_size: 1
num_epochs: 2
optimizer: adamw_bnb_8bit # look
lr_scheduler: cosine
learning_rate: 0.000005 # look

train_on_inputs: false
group_by_length: false
bf16: true
fp16: false
tf32: false

gradient_checkpointing: true
early_stopping_patience:
resume_from_checkpoint:
local_rank:
logging_steps: 1
xformers_attention:
flash_attention: true

warmup_steps: 10
evals_per_epoch: 2
eval_table_size:
eval_table_max_new_tokens: 128
saves_per_epoch: 2
debug:

deepspeed: zero3_bf16_cpuoffload_params.json
weight_decay: 0.0
fsdp:
fsdp_config:
special_tokens:
  bos_token: "<s>"
  eos_token: "<|im_end|>"
  unk_token: "<unk>"
  pad_token: <|end_of_text|> # changed
tokens:
  - "<|im_start|>"

數據集使用情況

本模型訓練使用的數據集列在模型卡片的元數據部分。請注意，元數據中提到的某些數據集可能已經根據各種標準進行了過濾。過濾過程的結果及其輸出位於本倉庫的數據文件夾中： Weyaxi/Einstein-v6.1-Llama3-8B/data

量化版本

GGUF @bartowski
- https://huggingface.co/bartowski/Einstein-v6.1-Llama3-8B-GGUF
ExLlamaV2 @bartowski
- https://huggingface.co/bartowski/Einstein-v6.1-Llama3-8B-exl2
AWQ @solidrust
- https://huggingface.co/solidrust/Einstein-v6.1-Llama3-8B-AWQ

評估結果

Open LLM Leaderboard 評估結果

詳細結果可查看此處

指標	值
平均值	68.60
AI2 推理挑戰 (25 次樣本學習)	62.46
HellaSwag (10 次樣本學習)	82.41
MMLU (5 次樣本學習)	66.19
TruthfulQA (0 次樣本學習)	55.10
Winogrande (5 次樣本學習)	79.32
GSM8k (5 次樣本學習)	66.11

Open LLM Leaderboard v2 評估結果

詳細結果可查看此處

指標	值
平均值	19.99
IFEval (0 次樣本學習)	45.68
BBH (3 次樣本學習)	29.38
MATH Lvl 5 (4 次樣本學習)	5.74
GPQA (0 次樣本學習)	4.25
MuSR (0 次樣本學習)	11.23
MMLU-PRO (5 次樣本學習)	23.68