開源Qwen3-8B-Esper3代碼專家模型 - 本地與服務器推理免費可用

首頁

Qwen3 8B Esper3

由ValiantLabs開發

埃斯佩爾3是基於千問3構建的代碼、架構和開發運維推理專家模型，適用於本地和服務器推理。

大型語言模型

Transformers

支持多種語言開源協議:Apache-2.0 #代碼推理專家 #雲架構設計 #多語言編程支持

下載量 83

發布時間 : 5/5/2025

模型概述

埃斯佩爾3是基於千問3構建的代碼、架構和開發運維推理專家模型，通過微調開發運維和架構推理數據增強其問題解決能力。

模型特點

代碼與開發運維推理

專注於代碼生成、架構設計和開發運維任務，支持多種編程語言和雲平臺。

通用推理增強

通過微調通用和創意推理數據，提升問題解決和一般聊天表現。

高效推理

小模型尺寸允許在本地桌面和移動設備上運行，以及超快的服務器推理。

模型能力

文本生成

代碼生成

架構設計

開發運維任務

問題解決

一般聊天

使用案例

開發運維

Terraform配置生成

生成使用aws_ami數據源查找最新Amazon Linux 2 AMI的Terraform配置。

動態確定AMI ID並配置EC2實例。

代碼生成

Python腳本編寫

生成Python腳本以自動化常見開發任務。

高效完成代碼編寫任務。

🚀 Esper 3：基於Qwen 3的編碼與推理專家模型

Esper 3是基於Qwen 3構建的模型，在編碼、架構設計和DevOps推理方面表現出色。它經過精心微調，能有效解決各類問題，無論是在本地桌面、移動設備，還是服務器上，都能提供出色的性能。

🚀 快速開始

支持開源項目：支持我們的開源數據集和模型發佈！
模型版本：Esper 3有不同的版本可供選擇，包括 [Qwen3 - 4B](https://huggingface.co/ValiantLabs/Qwen3 - 4B - Esper3)、[Qwen3 - 8B](https://huggingface.co/ValiantLabs/Qwen3 - 8B - Esper3) 和 [Qwen3 - 14B](https://huggingface.co/ValiantLabs/Qwen3 - 14B - Esper3)。

✨ 主要特性

精細微調：在使用Deepseek R1生成的 [DevOps和架構推理](https://huggingface.co/datasets/sequelbox/Titanium2.1 - DeepSeek - R1) 以及 [代碼推理](https://huggingface.co/datasets/sequelbox/Tachibana2 - DeepSeek - R1) 數據上進行了微調。
推理能力提升：改進了 [通用和創造性推理](https://huggingface.co/datasets/sequelbox/Raiden - DeepSeek - R1) 能力，增強了解決問題和日常對話的性能。
靈活部署：模型規模較小，支持在本地桌面和移動設備上運行，同時在服務器上推理速度極快。

📦 安裝指南

文檔未提及具體安裝步驟，故跳過此章節。

💻 使用示例

基礎用法

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "ValiantLabs/Qwen3-8B-Esper3"

# load the tokenizer and the model
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)

# prepare the model input
prompt = "Write a Terraform configuration that uses the `aws_ami` data source to find the latest Amazon Linux 2 AMI. Then, provision an EC2 instance using this dynamically determined AMI ID."
messages = [
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True,
    enable_thinking=True # Switches between thinking and non-thinking modes. Default is True.
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

# conduct text completion
generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=32768
)
output_ids = generated_ids[0][len(model_inputs.input_ids[0]):].tolist() 

# parsing thinking content
try:
    # rindex finding 151668 (</think>)
    index = len(output_ids) - output_ids[::-1].index(151668)
except ValueError:
    index = 0

thinking_content = tokenizer.decode(output_ids[:index], skip_special_tokens=True).strip("\n")
content = tokenizer.decode(output_ids[index:], skip_special_tokens=True).strip("\n")

print("thinking content:", thinking_content)
print("content:", content)

📚 詳細文檔

Esper 3使用 [Qwen 3](https://huggingface.co/Qwen/Qwen3 - 8B) 的提示格式。作為推理微調模型，建議在所有對話中啟用 enable_thinking = True。

🔧 技術細節

文檔未提供具體技術實現細節，故跳過此章節。

📄 許可證

本項目採用 apache - 2.0 許可證。

其他信息

數據集：模型基於以下數據集進行訓練：
- [sequelbox/Titanium2.1 - DeepSeek - R1](https://huggingface.co/datasets/sequelbox/Titanium2.1 - DeepSeek - R1)
- [sequelbox/Tachibana2 - DeepSeek - R1](https://huggingface.co/datasets/sequelbox/Tachibana2 - DeepSeek - R1)
- [sequelbox/Raiden - DeepSeek - R1](https://huggingface.co/datasets/sequelbox/Raiden - DeepSeek - R1)
模型創建者：Esper 3由 Valiant Labs 創建。
更多模型：查看我們的HuggingFace頁面，瞭解所有模型！

![image/jpeg](https://cdn - uploads.huggingface.co/production/uploads/64f267a8a4f79a118e0fcc89/qdicXwrO_XOKRTjOu2yBF.jpeg) ![image/jpeg](https://cdn - uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)