14B-Qwen2.5-Freya-x1开源文本生成模型 - 免费支持指令理解任务

首页

14B Qwen2.5 Freya X1

由 Sao10K 开发

基于Qwen2.5-14B和Qwen2.5-14B-Instruct模型的多阶段训练模型，专注于文本生成和指令理解任务。

大型语言模型

Transformers

开源协议:其他 #多步LoRA训练 #长文本生成 #文学创作优化

下载量 252

发布时间 : 12/31/2024

模型简介

该模型采用两阶段训练方法，首先在文学和原始文本上进行LoRA训练，然后在指令数据上进一步微调，旨在提高文本生成质量和指令遵循能力。

模型特点

多阶段训练

采用两阶段训练方法，先基础训练后指令微调，提升模型性能

高效微调

使用LoRA适配器进行参数高效微调，降低训练成本

长上下文支持

支持16384个token的上下文长度

优化训练

采用多种优化技术如闪光注意力、梯度检查点等提升训练效率

模型能力

文本生成

指令理解

文学创作

对话系统

使用案例

内容创作

文学创作

生成小说、散文等文学作品

基于清理后的文学数据集训练，可生成较高质量的文学内容

对话系统

智能助手

构建遵循指令的对话助手

通过指令数据微调，提高指令理解和执行能力

🚀 14B-Qwen2.5-Freya-v1

本项目围绕Qwen 2.5的基础模型进行微调训练，探索了多步训练等方法，生成了14B-Qwen2.5-Freya-v1模型。该模型在特定的文本数据上进行训练，为相关领域的应用提供了新的选择。

Freya 训练失败时的我

✨ 主要特性

Freya-S1

在Qwen 2.5基础模型上，基于约1.1GB的文学和原始文本进行LoRA训练。
尽可能对文本和文献进行了清理，但可能仍存在一些问题。

Freya-S2

先将第一个LoRA应用于Qwen 2.5 Instruct，然后在此基础上继续训练。
降低了LoRA的秩，主要是因为模型以指令学习为主，还有一些细节暂不赘述。

📦 安装指南

文档中未提及安装相关内容，暂无法提供安装指南。

💻 使用示例

📚 详细文档

训练信息

训练总时长约为10小时，在8xH100节点上完成，由新加坡政府或其他机构赞助。感谢内政部的国民服役津贴。
如需联系，请访问：https://sao10k.carrd.co/

模型相关信息

属性	详情
模型名称	14B-Qwen2.5-Freya-v1
基础模型	Qwen/Qwen2.5-14B
标签	generated_from_trainer
许可证	qwen

基于Axolotl构建

查看axolotl配置

axolotl版本: 0.6.0

base_model:
- s1: Qwen/Qwen2.5-14B
- s2: Qwen/Qwen2.5-14B-Instruct
model_type: AutoModelForCausalLM
tokenizer_type: AutoTokenizer

load_in_8bit: false
load_in_4bit: false
strict: false
sequence_len: 16384
bf16: auto
fp16:
tf32: false
flash_attention: true
special_tokens:
  
adapter: lora # 16-bit
lora_r:
- s1: 64
- s2: 32
lora_alpha: 64
lora_dropout: 0.2
lora_fan_in_fan_out:
peft_use_rslora: true
lora_target_linear: true
  
# 数据
dataset_prepared_path: dataset_run_freya
datasets:
# S1 - 写作/补全
  - path: datasets/eBooks-cleaned-75K
    type: completion
  - path: datasets/novels-clean-dedupe-10K
    type: completion
# S2 - 指令学习
  - path: datasets/10k-amoral-full-fixed-sys.json
    type: chat_template
    chat_template: chatml
    roles_to_train: ["gpt"]
    field_messages: conversations
    message_field_role: from
    message_field_content: value
    train_on_eos: turn
  - path: datasets/44k-hespera-smartshuffle.json
    type: chat_template
    chat_template: chatml
    roles_to_train: ["gpt"]
    field_messages: conversations
    message_field_role: from
    message_field_content: value
    train_on_eos: turn
  - path: datasets/5k_rpg_adventure_instruct-sys.json
    type: chat_template
    chat_template: chatml
    roles_to_train: ["gpt"]
    field_messages: conversations
    message_field_role: from
    message_field_content: value
    train_on_eos: turn
shuffle_merged_datasets: true
warmup_ratio: 0.1

plugins:
  - axolotl.integrations.liger.LigerPlugin
liger_rope: true
liger_rms_norm: true
liger_layer_norm: true
liger_glu_activation: true
liger_fused_linear_cross_entropy: true

# 迭代次数
num_epochs:
- s1: 1
- s2: 2

# 采样
sample_packing: true
pad_to_sequence_len: true
train_on_inputs: false
group_by_length: false

# 批处理
gradient_accumulation_steps: 4
micro_batch_size: 2
gradient_checkpointing: unsloth

# 评估
val_set_size: 0.025
evals_per_epoch: 5
eval_table_size:
eval_max_new_tokens: 256
eval_sample_packing: false
eval_batch_size: 1

# 优化器
optimizer: paged_ademamix_8bit
lr_scheduler: cosine
learning_rate:
- s1: 0.000002
- s2: 0.000004
weight_decay: 0.2
max_grad_norm: 10.0

# 垃圾回收
gc_steps: 10

# 其他
deepspeed: ./deepspeed_configs/zero2.json