Qwen3-4B-NexusPrime開源AI模型 - 多領域高性能推理編程難題輕鬆搞定

首頁

Qwen3 4B NexusPrime

由ZeroXClem開發

高性能、多領域AI模型，採用MergeKit的Model Stock融合技術構建，整合了多個精調優化的Qwen3-4B模型，在結構化輸出和技術應用場景中展現出卓越的推理、編程及多步驟問題解決能力。

大型語言模型

Transformers

英語開源協議:Apache-2.0 #多步驟推理 #跨語言編程 #結構化輸出

下載量 24

發布時間 : 5/12/2025

模型概述

ZeroXClem-Qwen3-4B-NexusPrime是一款高性能、多領域AI模型，適配標準Qwen3對話模板，適合推理、編程及多步驟問題解決。

模型特點

高級符號推理

結合QWQ與iCoT技術實現多步驟數學求解

高效代碼生成

支持多編程語言的邏輯密集型任務

跨領域靈活性

無縫切換STEM/技術文檔/結構化推理場景

多語言理解

基於多樣化數據集訓練，支持技術文檔跨語言轉換

部署友好

適配中端GPU，兼顧小團隊與大規部署需求

模型能力

文本生成

符號推理

代碼生成

多語言理解

結構化輸出

多步驟問題解決

使用案例

教育

數學問題求解

解決複雜的數學問題，包括符號運算與邏輯任務

高精度推理，輸出LaTeX/JSON/Markdown格式結果

技術教育

支持多語言技術教育場景，如Python/C++等語言的邏輯任務

高效處理多語言邏輯任務

開發

代碼生成

生成Python/JavaScript/C++等語言的代碼

優化JSON/Markdown/YAML等結構化輸出

技術文檔生成

生成技術文檔，支持多種格式輸出

高效生成結構化技術文檔

🚀 ZeroXClem-Qwen3-4B-NexusPrime

ZeroXClem-Qwen3-4B-NexusPrime 是一個高性能、多領域的人工智能模型，它使用 Model Stock 方法，藉助 MergeKit 工具將多個模型融合而成。該模型融合了多個經過精細調優的 Qwen3-4B 模型，具備出色的推理、編碼以及多步驟問題解決能力，尤其適用於結構化輸出和技術應用場景。

✅ 此模型使用默認的 Qwen3 聊天模板效果最佳，具體使用方法可參考下面的 Ollama 模型卡說明。

🚀 快速開始

🦙 使用 Ollama 快速運行

若要使用 Ollama 快速運行此模型，可使用以下命令：

ollama run hf.co/ZeroXClem/Qwen3-4B-NexusPrime-Q4_K_M-GGUF

該命令會下載預量化的 GGUF 版本模型並在本地運行，無需大量配置即可輕鬆進行實驗。

🎯 最佳推理配置

若要實現最佳推理效果，可使用以下 Ollama 模型文件。將其保存為名為 Modelfile 的文件：

Ollama Modelfile

FROM hf.co/ZeroXClem/Qwen3-4B-NexusPrime-Q4_K_M-GGUF:latest
PARAMETER temperature 0.6
PARAMETER top_p 0.95
PARAMETER repeat_penalty 1.05
PARAMETER top_k 20
TEMPLATE """"{{- if .Messages }}
{{- if or .System .Tools }}<|im_start|>system
{{- if .System }}
{{ .System }}
{{- end }}
{{- if .Tools }}

# Tools

You may call one or more functions to assist with the user query.

You are provided with function signatures within <tools></tools> XML tags:
<tools>
{{- range .Tools }}
{"type": "function", "function": {{ .Function }}}
{{- end }}
</tools>

For each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:
<tool_call>
{"name": <function-name>, "arguments": <args-json-object>}
</tool_call>
{{- end }}<|im_end|>
{{ end }}
{{- range $i, $_ := .Messages }}
{{- $last := eq (len (slice $.Messages $i)) 1 -}}
{{- if eq .Role "user" }}<|im_start|>user
{{ .Content }}<|im_end|>
{{ else if eq .Role "assistant" }}<|im_start|>assistant
{{ if .Content }}{{ .Content }}
{{- else if .ToolCalls }}<tool_call>
{{ range .ToolCalls }}{"name": "{{ .Function.Name }}", "arguments": {{ .Function.Arguments }}}
{{ end }}</tool_call>
{{- end }}{{ if not $last }}<|im_end|>
{{ end }}
{{- else if eq .Role "tool" }}<|im_start|>user
<tool_response>
{{ .Content }}
</tool_response><|im_end|>
{{ end }}
{{- if and (ne .Role "assistant") $last }}<|im_start|>assistant
{{ end }}
{{- end }}
{{- else }}
{{- if .System }}<|im_start|>system
{{ .System }}<|im_end|>
{{ end }}{{ if .Prompt }}<|im_start|>user
{{ .Prompt }}<|im_end|>
{{ end }}<|im_start|>assistant
{{ end }}{{ .Response }}{{ if .Response }}<|im_end|>{{ end }}"""
SYSTEM """# System Prompt: Universal Coder and DevOps Expert

You are an advanced AI assistant specializing in coding and DevOps. Your role is to provide expert guidance, code solutions, and best practices across a wide range of programming languages, frameworks, and DevOps tools. Your knowledge spans from low-level systems programming to high-level web development, cloud infrastructure, and everything in between.

## Key responsibilities:
1. Code analysis and optimization
2. Debugging and troubleshooting
3. Architecture design and system planning
4. Version Control best practices (Git)
5. Building from source, extracting binaries, and building packages & executeables including bash scripts.
6. Security and implementation and auditing
7. Performance review, and code analysis with practical suggestions in fully functioning syntax.

Be VERY selective on choosing how to respond based on the user query. If the above responsibilities don't apply then respond to the best of your ability with the given context to COMPLETELY satisfy the user query.

### Guidance
When assisting users:
- Provide clear, concise, and well-commented code examples
- Explain complrex concepts in simple terms
- Offer multiple solutions when applicable, highlighting pros and cons
- Prioritize security, efficiency, scalability, and maintainability in all suggestions
- Adapt your communication style for expert users.

### Helpful
Be EXTREMELY helpful, insightful, and lucid."""

你可以根據實際需求自定義 SYSTEM 以下的內容，該模型在處理技術任務方面表現出色。

保存好 Modelfile 後，在同一目錄下運行以下命令：

ollama create nexusprime -f ./Modelfile

💻 Python 使用示例

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "ZeroXClem-Qwen3-4B-NexusIntel"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Explain the concept of entropy in thermodynamics in simple terms."
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=200)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

✨ 主要特性

🔹 高級符號推理：結合了 QWQ 和 iCoT 的精度，可解決複雜的多步驟數學問題。
🔹 高效代碼生成：支持多種編程語言，能夠處理邏輯密集型任務。
🔹 多領域靈活性：可在 STEM、技術文檔和結構化推理等領域之間無縫切換。
🔹 多語言支持：基於多樣化的數據集進行訓練，具備跨語言理解和技術翻譯能力。
🔹 可擴展性優化：適用於中端 GPU，便於小團隊使用和大規模部署。

📦 安裝指南

文檔未提及安裝步驟，暫不提供相關內容。

📚 詳細文檔

🔧 合併配置

合併方法：model_stock
基礎模型：prithivMLmods/Cetus-Qwen3_4B-GeneralThought
數據類型：bfloat16
分詞器來源：prithivMLmods/Cetus-Qwen3_4B-GeneralThought

📝 配置文件

name: ZeroXClem-Qwen3-4B-NexusPrime
base_model: prithivMLmods/Cetus-Qwen3_4B-GeneralThought
dtype: bfloat16
merge_method: model_stock
models:
  - model: prithivMLmods/Tureis-Qwen3_QWQ-4B-Exp
  - model: prithivMLmods/Canum-Qwen3_R1-4B-iCoT
  - model: prithivMLmods/Bootes-Qwen3_Coder-Reasoning
  - model: prithivMLmods/Segue-Qwen3_DeepScaleR-Preview
tokenizer_source: prithivMLmods/Cetus-Qwen3_4B-GeneralThought