Vapor_v2_7B開源大語言模型 - 支持13種語言處理的高效交流工具

首頁

Vapor V2 7B

由FourOhFour開發

基於Qwen/Qwen2.5-7B模型在多語言數據集上微調的大語言模型，支持13種語言處理

大型語言模型

Transformers

開源協議:Apache-2.0 #多語言對話 #長文本處理 #指令微調

下載量 60

發布時間 : 9/20/2024

模型概述

這是一個基於Qwen2.5-7B模型微調的多語言大語言模型，專注於對話生成和指令跟隨任務，在多種專業領域數據集上進行了訓練

模型特點

多語言支持

支持13種語言的文本生成和理解，包括主要亞洲和歐洲語言

長上下文處理

支持8192個token的長上下文處理能力

多領域知識

在醫學、軍事、推理等多個專業領域數據集上進行訓練

高效訓練

使用flash attention和梯度檢查點等技術優化訓練效率

模型能力

多語言文本生成

指令跟隨

對話系統

知識問答

專業領域諮詢

使用案例

智能助手

多語言客服機器人

為跨國企業提供多語言客戶服務支持

教育

語言學習助手

幫助學習者練習多種語言的寫作和對話

專業諮詢

醫學信息諮詢

提供基礎醫學知識和健康建議

軍事生存指南

提供軍事和野外生存相關專業知識

🚀 輸出模型（outputs/out）

本項目基於 transformers 庫，所訓練的模型是 Qwen/Qwen2.5 - 7B 在特定數據集上的微調版本。該模型支持多種語言，包括中文、英文、法文、西班牙文等。本項目使用 axolotl 工具進行訓練，以下是詳細介紹。

查看 axolotl 配置

axolotl 版本：0.4.1

base_model: Qwen/Qwen2.5-7B
model_type: AutoModelForCausalLM
tokenizer_type: AutoTokenizer

load_in_8bit: false
load_in_4bit: false
strict: false

datasets:
  - path: PocketDoc/Dans-MemoryCore-CoreCurriculum-Small
    type: sharegpt
    conversation: chatml
  - path: NewEden/Kalo-Opus-Instruct-22k-Refusal-Murdered
    type: sharegpt
    conversation: chatml
  - path: Epiculous/Synthstruct-Gens-v1.1-Filtered-n-Cleaned
    type: sharegpt
    conversation: chatml
  - path: NewEden/Gryphe-Sonnet-3.5-35k-Subset
    type: sharegpt
    conversation: chatml
  - path: Nitral-AI/Reasoning-1shot_ShareGPT
    type: sharegpt
    conversation: chatml
  - path: Nitral-AI/GU_Instruct-ShareGPT
    type: sharegpt
    conversation: chatml
  - path: Nitral-AI/Medical_Instruct-ShareGPT
    type: sharegpt
    conversation: chatml
  - path: AquaV/Resistance-Sharegpt
    type: sharegpt
    conversation: chatml
  - path: AquaV/US-Army-Survival-Sharegpt
    type: sharegpt
    conversation: chatml
  - path: Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
    type: sharegpt
    conversation: chatml

chat_template: chatml

val_set_size: 0.002
output_dir: ./outputs/out

adapter:
lora_r:
lora_alpha:
lora_dropout:
lora_target_linear:

sequence_len: 8192
sample_packing: true
eval_sample_packing: false
pad_to_sequence_len: true

plugins:
  - axolotl.integrations.liger.LigerPlugin
liger_rope: true
liger_rms_norm: true
liger_swiglu: true
liger_fused_linear_cross_entropy: true

wandb_project: qwen7B
wandb_entity:
wandb_watch:
wandb_name: qwen7B
wandb_log_model:

gradient_accumulation_steps: 32
micro_batch_size: 1
num_epochs: 2
optimizer: adamw_bnb_8bit
lr_scheduler: cosine
learning_rate: 0.00001
weight_decay: 0.05

train_on_inputs: false
group_by_length: false
bf16: auto
fp16:
tf32: true

gradient_checkpointing: true
early_stopping_patience:
resume_from_checkpoint:
local_rank:
logging_steps: 1
xformers_attention:
flash_attention: true

warmup_ratio: 0.1
evals_per_epoch: 4
eval_table_size:
eval_max_new_tokens: 128
saves_per_epoch: 2

debug:
deepspeed: 
fsdp:
fsdp_config:

special_tokens:
  pad_token: <pad>

🚀 快速開始

此模型是 Qwen/Qwen2.5 - 7B 在特定數據集上的微調版本，在評估集上取得了以下結果：

損失值：0.7923

📚 詳細文檔

模型描述

更多信息待補充。

預期用途與限制

更多信息待補充。

訓練和評估數據

更多信息待補充。

🔧 技術細節

訓練超參數

訓練過程中使用了以下超參數：

學習率：1e - 05
訓練批次大小：1
評估批次大小：1
隨機種子：42
分佈式類型：多GPU
設備數量：4
梯度累積步數：32
總訓練批次大小：128
總評估批次大小：4
優化器：Adam，β=(0.9, 0.999)，ε = 1e - 08
學習率調度器類型：餘弦
學習率調度器熱身步數：46
訓練輪數：2

訓練結果

訓練損失	輪數	步數	驗證損失
1.0297	0.0043	1	1.1468
0.8512	0.2515	58	0.8729
0.8496	0.5030	116	0.8193
0.8175	0.7546	174	0.8033
0.7868	1.0041	232	0.7961
0.8119	1.2555	290	0.7934
0.799	1.5069	348	0.7926
0.7891	1.7583	406	0.7923