Samantha-PL-AG-Mistral-7B-v0.2 オープンソース対話モデル - 流暢で自然なポーランド語の対話体験

ホーム

Samantha PL AG Mistral 7B V0.2

AdamGrzesikによって開発

Mistral-7B-v0.2をファインチューニングしたポーランド語対話モデルで、流暢で自然な対話体験を提供

大規模言語モデル #ポーランド語対話アシスタント #長文処理 #低リソースファインチューニング

ダウンロード数 77

リリース時間 : 3/29/2024

モデル概要

このモデルはMistral-7B-v0.2をSamanta-PL-AG-axolotlデータセットでファインチューニングしたバージョンで、ポーランド語対話シナリオに最適化

モデル特徴

長文脈サポート

16384トークンの長文脈処理能力をサポート

効率的な訓練

フラッシュアテンションと勾配チェックポイント技術を使用して訓練効率を向上

ポーランド語最適化

ポーランド語対話シナリオに特化してファインチューニング

モデル能力

ポーランド語テキスト生成

対話システム

長文理解

使用事例

対話システム

ポーランド語チャットアシスタント

ポーランド語ユーザーのためのインテリジェントな対話パートナーとして利用可能

🚀 AdamGrzesik/Samantha-PL-AG-Mistral-7B-v0.2

このモデルは、Samanta-PL-AG-axolotlデータセットでalpindale/Mistral-7B-v0.2-hfをファインチューニングしたバージョンです。

axolotl設定を表示

axolotlバージョン: 0.4.0

base_model: alpindale/Mistral-7B-v0.2-hf
model_type: MistralForCausalLM
tokenizer_type: LlamaTokenizer
is_mistral_derived_model: true

load_in_8bit: false
load_in_4bit: false
strict: false

datasets:
  - path: /workspace/datasets/Samantha-PL-AG-axolotl.json
    type: sharegpt

chat_template: chatml

dataset_prepared_path: last_run_prepared
val_set_size: 0.001
output_dir: /workspace/Samantha

sequence_len: 16384
sample_packing: true
pad_to_sequence_len: true

wandb_project: 
wandb_entity:
wandb_watch:
wandb_run_id:
wandb_log_model:

gradient_accumulation_steps: 8
micro_batch_size: 3
num_epochs: 4
adam_beta2: 0.95
adam_epsilon: 0.00001
max_grad_norm: 1.0
lr_scheduler: cosine
learning_rate: 0.000005
optimizer: adamw_bnb_8bit

train_on_inputs: false
group_by_length: false
bf16: true
fp16: false
tf32: false

gradient_checkpointing: true
gradient_checkpointing_kwargs:
  use_reentrant: true
early_stopping_patience:
resume_from_checkpoint:
local_rank:
logging_steps: 1
xformers_attention:
flash_attention: true

warmup_steps: 10

eval_steps: 73
eval_table_size:
eval_table_max_new_tokens:
eval_sample_packing: false
saves_per_epoch: 
save_steps: 73
save_total_limit: 2
debug:
deepspeed: deepspeed_configs/zero3_bf16.json
weight_decay: 0.1
fsdp:
fsdp_config:
special_tokens:
  eos_token: "<|im_end|>"
tokens:
  - "<|im_start|>"

📚 詳細ドキュメント

モデル情報

属性	詳細
ベースモデル	alpindale/Mistral-7B-v0.2-hf
モデルタイプ	MistralForCausalLM
トークナイザータイプ	LlamaTokenizer
Mistral派生モデルか	true