MN-Nyx-Chthonia-12B開源語言模型 - 融合7模型提升綜合應用能力

首頁

MN Nyx Chthonia 12B

由mergekit-community開發

這是一個基於多個12B參數規模模型的合併版本，採用model_stock方法融合了7個不同特性的預訓練語言模型，以增強綜合能力。

大型語言模型

Transformers

#多模型融合 #指令微調優化 #心理學推理增強

下載量 31

發布時間 : 4/25/2025

模型概述

該模型通過合併多個專業領域的12B參數模型，旨在提升在推理、創作和心理分析等任務上的表現，特別優化了指令跟隨能力。

模型特點

多模型知識融合

整合了Gutenberg文學、心理學推理、創意寫作等不同專業領域的模型優勢

指令優化

以Mistral-Nemo-Instruct為基礎模型，強化了指令理解和執行能力

加權融合策略

對關鍵組件模型（如EtherealAurora和推理LoRA）採用差異化權重配置

模型能力

長文本生成

多輪對話

邏輯推理

創意寫作

心理分析

指令理解

使用案例

創意輔助

故事創作

生成具有文學性的長篇幅敘事文本

結合Gutenberg和Lyra模型的文學風格

專業分析

心理評估

分析文本中的心理特徵和認知模式

整合了專業心理學推理LoRA的能力

🚀 預訓練語言模型合併項目

本項目是使用 mergekit 對預訓練語言模型進行合併的成果。通過合併多個預訓練模型，旨在提升模型的性能和適用性，為自然語言處理任務提供更強大的支持。

基礎模型信息

屬性	詳情
基礎模型	DavidAU/MN - GRAND - Gutenberg - Lyra4 - Lyra - 12B - DARKNESS、mistralai/Mistral - Nemo - Base - 2407、mistralai/Mistral - Nemo - Instruct - 2407、redrix/sororicide - 12B - Farer - Mell - Unslop、mergekit - community/MN - Chthonia - 12B、yamatazen/EtherealAurora - 12B - v2、mergekit - community/MN - Anathema - 12B、mergekit - community/MN - Ephemeros - 12B、jtatman/mistral_nemo_12b_reasoning_psychology_lora
庫名稱	transformers
標籤	mergekit、merge

🚀 合併詳情

合併方法

本模型採用 Model Stock 合併方法，以 [mistralai/Mistral - Nemo - Instruct - 2407](https://huggingface.co/mistralai/Mistral - Nemo - Instruct - 2407) 為基礎進行合併。

參與合併的模型

以下模型參與了本次合併：

[DavidAU/MN - GRAND - Gutenberg - Lyra4 - Lyra - 12B - DARKNESS](https://huggingface.co/DavidAU/MN - GRAND - Gutenberg - Lyra4 - Lyra - 12B - DARKNESS)
[mistralai/Mistral - Nemo - Base - 2407](https://huggingface.co/mistralai/Mistral - Nemo - Base - 2407)
[redrix/sororicide - 12B - Farer - Mell - Unslop](https://huggingface.co/redrix/sororicide - 12B - Farer - Mell - Unslop)
[mergekit - community/MN - Chthonia - 12B](https://huggingface.co/mergekit - community/MN - Chthonia - 12B)
[yamatazen/EtherealAurora - 12B - v2](https://huggingface.co/yamatazen/EtherealAurora - 12B - v2)
[mergekit - community/MN - Anathema - 12B](https://huggingface.co/mergekit - community/MN - Anathema - 12B)
[mergekit - community/MN - Ephemeros - 12B](https://huggingface.co/mergekit - community/MN - Ephemeros - 12B) + jtatman/mistral_nemo_12b_reasoning_psychology_lora

配置信息

以下是用於生成此模型的 YAML 配置：

dtype: float32
out_dtype: bfloat16
merge_method: model_stock
base_model: mistralai/Mistral-Nemo-Instruct-2407
models:
  - model: DavidAU/MN-GRAND-Gutenberg-Lyra4-Lyra-12B-DARKNESS
  - model: mergekit-community/MN-Anathema-12B
  - model: mergekit-community/MN-Chthonia-12B
  - model: mergekit-community/MN-Ephemeros-12B+jtatman/mistral_nemo_12b_reasoning_psychology_lora
    parameters:
      weight: 0.7
  - model: mistralai/Mistral-Nemo-Base-2407
    parameters:
      weight: 0.5
  - model: redrix/sororicide-12B-Farer-Mell-Unslop
  - model: yamatazen/EtherealAurora-12B-v2
    parameters:
      weight: 1.4

tokenizer:
  source: union
  tokens:
    "<|im_start|>":
      source: yamatazen/EtherealAurora-12B-v2
    "<|im_end|>":
      source: yamatazen/EtherealAurora-12B-v2
    "[INST]":
      source: mistralai/Mistral-Nemo-Instruct-2407
    "[/INST]":
      source: mistralai/Mistral-Nemo-Instruct-2407
      
chat_template: chatml