FastApply-1.5B-v1.0開源代碼模型 - 即時代碼應用，支持完整文件編輯

首頁

Fastapply 1.5B V1.0

由Kortix開發

FastApply-1.5B-v1.0 是一個1.5B參數的模型，專為即時代碼應用設計，能夠生成完整的文件編輯，為SoftGen AI提供支持。

大型語言模型

Transformers

英語開源協議:Apache-2.0 #即時代碼編輯 #高吞吐量 #完整文件生成

下載量 13.17k

發布時間 : 10/18/2024

模型概述

該模型基於Qwen2.5 Coder架構，並針對代碼編輯任務進行了微調，適用於快速、準確的代碼修改任務。

模型特點

高吞吐量

在快速部署平臺上運行時，速度約為每秒340個令牌，保持高編輯準確性。

完整文件編輯

能夠生成完整的文件編輯，適用於即時代碼應用任務。

快速部署

專為快速部署平臺（如Fireworks）設計，適合本地工具以減少前沿模型輸出的成本。

模型能力

代碼生成

代碼編輯

文件更新

代碼合併

使用案例

AI驅動的代碼編輯器

即時代碼應用

用於AI驅動的代碼編輯器（如Aider和PearAI）中的即時代碼應用任務。

高編輯準確性和高吞吐量。

本地工具

成本優化

作為本地工具，減少前沿模型輸出的成本。

高效且經濟的代碼編輯解決方案。

🚀 FastApply-1.5B-v1.0

FastApply-1.5B-v1.0是一個專為即時代碼應用而設計的15億參數模型，能夠生成完整的文件編輯內容，為 SoftGen AI 提供強大支持。它是Fast Apply管道的一部分，用於數據生成和微調Qwen2.5 Coder模型。

Github: kortix-ai/fast-apply
數據集: Kortix/FastApply-dataset-v1.0
立即在👉 Google Colab中試用

📚 詳細文檔

基本信息

屬性	詳情
開發者	Kortix
許可證	apache-2.0
微調基礎模型	unsloth/Qwen2.5-Coder-1.5B-Instruct-bnb-4bit

模型描述

FastApply-1.5B-v1.0是一個15億參數的模型，專為即時代碼應用而設計，能夠生成完整的文件編輯內容，為 SoftGen AI 提供支持。它是Fast Apply管道的一部分，用於數據生成和微調Qwen2.5 Coder模型。

該模型在部署到Fireworks等快速服務提供商時，能夠實現高吞吐量，同時保持較高的編輯準確率，速度約為340個令牌/秒。

🎯 預期用途

FastApply-1.5B-v1.0旨在用於人工智能驅動的代碼編輯器和需要快速、準確代碼修改的工具。它特別適用於：

即時代碼應用任務
完整文件編輯
與Aider和PearAI等人工智能驅動的代碼編輯器集成
本地工具，以降低前沿模型輸出的成本

💻 推理模板

FastApply-1.5B-v1.0基於Qwen2.5 Coder架構，並針對代碼編輯任務進行了微調。它使用特定的提示結構進行推理：

<|im_start|>system
You are a coding assistant that helps merge code updates, ensuring every modification is fully integrated.<|im_end|>
<|im_start|>user
Merge all changes from the <update> snippet into the <code> below.
- Preserve the code's structure, order, comments, and indentation exactly.
- Output only the updated code, enclosed within <updated-code> and </updated-code> tags.
- Do not include any additional text, explanations, placeholders, ellipses, or code fences.

<code>{original_code}</code>

<update>{update_snippet}</update>

Provide the complete updated code.<|im_end|>
<|im_start|>assistant

模型的輸出結構如下：

<updated-code>[Full-complete updated file]</updated-code>

📖 額外信息

有關Fast Apply管道、數據生成過程和部署說明的更多詳細信息，請參考 GitHub倉庫。

💻 使用示例

基礎用法

要使用該模型，您可以使用Hugging Face Transformers庫加載它：

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("Kortix/FastApply-1.5B-v1.0", device_map="auto")
tokenizer = AutoTokenizer.from_pretrained("Kortix/FastApply-1.5B-v1.0")

# Prepare your input following the prompt structure mentioned above
input_text = """<|im_start|>system
You are a coding assistant that helps merge code updates, ensuring every modification is fully integrated.<|im_end|>
<|im_start|>user
Merge all changes from the <update> snippet into the <code> below.
- Preserve the code's structure, order, comments, and indentation exactly.
- Output only the updated code, enclosed within <updated-code> and </updated-code> tags.
- Do not include any additional text, explanations, placeholders, ellipses, or code fences.

<code>{original_code}</code>

<update>{update_snippet}</update>

Provide the complete updated code.<|im_end|>
<|im_start|>assistant
"""

input_text = input_text.format(
    original_code=original_code,
    update_snippet=update_snippet,
).strip() 

# Generate the response
input_ids = tokenizer.encode(input_text, return_tensors="pt")
output = model.generate(input_ids, max_length=8192,)

response = tokenizer.decode(output[0][len(input_ids[0]):])
print(response)

# Extract the updated code from the response
updated_code = response.split("<updated-code>")[1].split("</updated-code>")[0]