Qwen3-4B-NexusPrime开源AI模型 - 多领域高性能推理编程难题轻松搞定

首页

Qwen3 4B NexusPrime

由 ZeroXClem 开发

高性能、多领域AI模型，采用MergeKit的Model Stock融合技术构建，整合了多个精调优化的Qwen3-4B模型，在结构化输出和技术应用场景中展现出卓越的推理、编程及多步骤问题解决能力。

大型语言模型

Transformers

英语开源协议:Apache-2.0 #多步骤推理 #跨语言编程 #结构化输出

下载量 24

发布时间 : 5/12/2025

模型简介

ZeroXClem-Qwen3-4B-NexusPrime是一款高性能、多领域AI模型，适配标准Qwen3对话模板，适合推理、编程及多步骤问题解决。

模型特点

高级符号推理

结合QWQ与iCoT技术实现多步骤数学求解

高效代码生成

支持多编程语言的逻辑密集型任务

跨领域灵活性

无缝切换STEM/技术文档/结构化推理场景

多语言理解

基于多样化数据集训练，支持技术文档跨语言转换

部署友好

适配中端GPU，兼顾小团队与大规部署需求

模型能力

文本生成

符号推理

代码生成

多语言理解

结构化输出

多步骤问题解决

使用案例

教育

数学问题求解

解决复杂的数学问题，包括符号运算与逻辑任务

高精度推理，输出LaTeX/JSON/Markdown格式结果

技术教育

支持多语言技术教育场景，如Python/C++等语言的逻辑任务

高效处理多语言逻辑任务

开发

代码生成

生成Python/JavaScript/C++等语言的代码

优化JSON/Markdown/YAML等结构化输出

技术文档生成

生成技术文档，支持多种格式输出

高效生成结构化技术文档

🚀 ZeroXClem-Qwen3-4B-NexusPrime

ZeroXClem-Qwen3-4B-NexusPrime 是一个高性能、多领域的人工智能模型，它使用 Model Stock 方法，借助 MergeKit 工具将多个模型融合而成。该模型融合了多个经过精细调优的 Qwen3-4B 模型，具备出色的推理、编码以及多步骤问题解决能力，尤其适用于结构化输出和技术应用场景。

✅ 此模型使用默认的 Qwen3 聊天模板效果最佳，具体使用方法可参考下面的 Ollama 模型卡说明。

🚀 快速开始

🦙 使用 Ollama 快速运行

若要使用 Ollama 快速运行此模型，可使用以下命令：

ollama run hf.co/ZeroXClem/Qwen3-4B-NexusPrime-Q4_K_M-GGUF

该命令会下载预量化的 GGUF 版本模型并在本地运行，无需大量配置即可轻松进行实验。

🎯 最佳推理配置

若要实现最佳推理效果，可使用以下 Ollama 模型文件。将其保存为名为 Modelfile 的文件：

Ollama Modelfile

FROM hf.co/ZeroXClem/Qwen3-4B-NexusPrime-Q4_K_M-GGUF:latest
PARAMETER temperature 0.6
PARAMETER top_p 0.95
PARAMETER repeat_penalty 1.05
PARAMETER top_k 20
TEMPLATE """"{{- if .Messages }}
{{- if or .System .Tools }}<|im_start|>system
{{- if .System }}
{{ .System }}
{{- end }}
{{- if .Tools }}

# Tools

You may call one or more functions to assist with the user query.

You are provided with function signatures within <tools></tools> XML tags:
<tools>
{{- range .Tools }}
{"type": "function", "function": {{ .Function }}}
{{- end }}
</tools>

For each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:
<tool_call>
{"name": <function-name>, "arguments": <args-json-object>}
</tool_call>
{{- end }}<|im_end|>
{{ end }}
{{- range $i, $_ := .Messages }}
{{- $last := eq (len (slice $.Messages $i)) 1 -}}
{{- if eq .Role "user" }}<|im_start|>user
{{ .Content }}<|im_end|>
{{ else if eq .Role "assistant" }}<|im_start|>assistant
{{ if .Content }}{{ .Content }}
{{- else if .ToolCalls }}<tool_call>
{{ range .ToolCalls }}{"name": "{{ .Function.Name }}", "arguments": {{ .Function.Arguments }}}
{{ end }}</tool_call>
{{- end }}{{ if not $last }}<|im_end|>
{{ end }}
{{- else if eq .Role "tool" }}<|im_start|>user
<tool_response>
{{ .Content }}
</tool_response><|im_end|>
{{ end }}
{{- if and (ne .Role "assistant") $last }}<|im_start|>assistant
{{ end }}
{{- end }}
{{- else }}
{{- if .System }}<|im_start|>system
{{ .System }}<|im_end|>
{{ end }}{{ if .Prompt }}<|im_start|>user
{{ .Prompt }}<|im_end|>
{{ end }}<|im_start|>assistant
{{ end }}{{ .Response }}{{ if .Response }}<|im_end|>{{ end }}"""
SYSTEM """# System Prompt: Universal Coder and DevOps Expert

You are an advanced AI assistant specializing in coding and DevOps. Your role is to provide expert guidance, code solutions, and best practices across a wide range of programming languages, frameworks, and DevOps tools. Your knowledge spans from low-level systems programming to high-level web development, cloud infrastructure, and everything in between.

## Key responsibilities:
1. Code analysis and optimization
2. Debugging and troubleshooting
3. Architecture design and system planning
4. Version Control best practices (Git)
5. Building from source, extracting binaries, and building packages & executeables including bash scripts.
6. Security and implementation and auditing
7. Performance review, and code analysis with practical suggestions in fully functioning syntax.

Be VERY selective on choosing how to respond based on the user query. If the above responsibilities don't apply then respond to the best of your ability with the given context to COMPLETELY satisfy the user query.

### Guidance
When assisting users:
- Provide clear, concise, and well-commented code examples
- Explain complrex concepts in simple terms
- Offer multiple solutions when applicable, highlighting pros and cons
- Prioritize security, efficiency, scalability, and maintainability in all suggestions
- Adapt your communication style for expert users.

### Helpful
Be EXTREMELY helpful, insightful, and lucid."""

你可以根据实际需求自定义 SYSTEM 以下的内容，该模型在处理技术任务方面表现出色。

保存好 Modelfile 后，在同一目录下运行以下命令：

ollama create nexusprime -f ./Modelfile

💻 Python 使用示例

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "ZeroXClem-Qwen3-4B-NexusIntel"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Explain the concept of entropy in thermodynamics in simple terms."
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=200)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

✨ 主要特性

🔹 高级符号推理：结合了 QWQ 和 iCoT 的精度，可解决复杂的多步骤数学问题。
🔹 高效代码生成：支持多种编程语言，能够处理逻辑密集型任务。
🔹 多领域灵活性：可在 STEM、技术文档和结构化推理等领域之间无缝切换。
🔹 多语言支持：基于多样化的数据集进行训练，具备跨语言理解和技术翻译能力。
🔹 可扩展性优化：适用于中端 GPU，便于小团队使用和大规模部署。

📦 安装指南

文档未提及安装步骤，暂不提供相关内容。

📚 详细文档

🔧 合并配置

合并方法：model_stock
基础模型：prithivMLmods/Cetus-Qwen3_4B-GeneralThought
数据类型：bfloat16
分词器来源：prithivMLmods/Cetus-Qwen3_4B-GeneralThought

📝 配置文件

name: ZeroXClem-Qwen3-4B-NexusPrime
base_model: prithivMLmods/Cetus-Qwen3_4B-GeneralThought
dtype: bfloat16
merge_method: model_stock
models:
  - model: prithivMLmods/Tureis-Qwen3_QWQ-4B-Exp
  - model: prithivMLmods/Canum-Qwen3_R1-4B-iCoT
  - model: prithivMLmods/Bootes-Qwen3_Coder-Reasoning
  - model: prithivMLmods/Segue-Qwen3_DeepScaleR-Preview
tokenizer_source: prithivMLmods/Cetus-Qwen3_4B-GeneralThought