FastApply-7B-v1.0开源代码生成模型 - 免费部署支持完整文件编辑

首页

Fastapply 7B V1.0

由 Kortix 开发

FastApply-7B-v1.0是一个专为即时代码应用设计的70亿参数模型，能够生成完整文件编辑，为SoftGen AI提供核心支持。

大型语言模型

Transformers

英语开源协议:Apache-2.0 #即时代码编辑 #文件级修改 #高速推理

下载量 623

发布时间 : 10/18/2024

模型简介

该模型基于Qwen2.5 Coder架构，经过unsloth优化和SFT微调，专注于快速精准的代码修改任务，特别适合AI编程工具集成。

模型特点

高效代码编辑

专为即时代码应用设计，能生成完整文件编辑，保持代码结构、顺序、注释和缩进。

高速推理

在Fireworks等高速推理平台上部署时，模型能保持约150 tokens/秒的处理速度。

unsloth优化

基于unsloth优化的4bit量化版本，提高推理效率。

严格输出控制

使用特定模板确保输出格式规范，便于集成到自动化流程中。

模型能力

代码生成

代码编辑

文件级修改

保持代码结构

快速推理

使用案例

AI编程工具

即时代码应用

与Aider、PearAI等AI编程工具集成，提供实时代码修改功能。

高精度编辑，保持代码完整性

完整文件编辑

生成完整的文件级修改，而不仅仅是片段。

完整的文件更新内容

本地化开发

降低前沿模型使用成本

作为本地化工具开发的基础模型，减少对云端大模型的依赖。

降低使用成本，提高响应速度

🚀 FastApply-7B-v1.0

FastApply-7B-v1.0是一款专为即时代码应用设计的70亿参数模型，能够生成完整的文件编辑内容，为 SoftGen AI 提供强大支持。

Github: kortix-ai/fast-apply
数据集: Kortix/FastApply-dataset-v1.0
立即在 🖐️ Google Colab 中试用

🚀 快速开始

你可以使用Hugging Face Transformers库加载该模型：

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("Kortix/FastApply-7B-v1.0")
tokenizer = AutoTokenizer.from_pretrained("Kortix/FastApply-7B-v1.0")

# 按照上述提示结构准备输入
input_text = """<|im_start|>system
You are a coding assistant that helps merge code updates, ensuring every modification is fully integrated.<|im_end|>
<|im_start|>user
Merge all changes from the <update> snippet into the <code> below.
- Preserve the code's structure, order, comments, and indentation exactly.
- Output only the updated code, enclosed within <updated-code> and </updated-code> tags.
- Do not include any additional text, explanations, placeholders, ellipses, or code fences.

<code>{original_code}</code>

<update>{update_snippet}</update>

Provide the complete updated code.<|im_end|>
<|im_start|>assistant
"""

input_text = input_text.format(
    original_code=original_code,
    update_snippet=update_snippet,
).strip() 

# 生成响应
input_ids = tokenizer.encode(input_text, return_tensors="pt")
output = model.generate(input_ids, max_length=8192,)

response = tokenizer.decode(output[0][len(input_ids[0]):])
print(response)

# 从响应中提取更新后的代码
updated_code = response.split("<updated-code>")[1].split("</updated-code>")[0]

✨ 主要特性

专为即时代码应用设计：能够生成完整的文件编辑内容，为AI代码编辑提供强大支持。
高吞吐量与准确性：在像Fireworks这样的快速服务提供商上部署时，能实现高吞吐量，同时保持较高的编辑准确性，速度约为150个令牌/秒。
广泛适用性：适用于需要快速、准确代码修改的AI代码编辑器和工具，尤其适合即时代码应用任务、完整文件编辑等。

📦 安装指南

文档未提及安装步骤，暂不提供。

💻 使用示例

基础用法

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("Kortix/FastApply-7B-v1.0")
tokenizer = AutoTokenizer.from_pretrained("Kortix/FastApply-7B-v1.0")

# 按照上述提示结构准备输入
input_text = """<|im_start|>system
You are a coding assistant that helps merge code updates, ensuring every modification is fully integrated.<|im_end|>
<|im_start|>user
Merge all changes from the <update> snippet into the <code> below.
- Preserve the code's structure, order, comments, and indentation exactly.
- Output only the updated code, enclosed within <updated-code> and </updated-code> tags.
- Do not include any additional text, explanations, placeholders, ellipses, or code fences.

<code>{original_code}</code>

<update>{update_snippet}</update>

Provide the complete updated code.<|im_end|>
<|im_start|>assistant
"""

input_text = input_text.format(
    original_code=original_code,
    update_snippet=update_snippet,
).strip() 

# 生成响应
input_ids = tokenizer.encode(input_text, return_tensors="pt")
output = model.generate(input_ids, max_length=8192,)

response = tokenizer.decode(output[0][len(input_ids[0]):])
print(response)

# 从响应中提取更新后的代码
updated_code = response.split("<updated-code>")[1].split("</updated-code>")[0]

📚 详细文档

模型详情

基本信息

属性	详情
开发者	Kortix
许可证	apache-2.0
微调基础模型	unsloth/Qwen2.5-Coder-7B-Instruct-bnb-4bit

模型描述

FastApply-7B-v1.0是一个70亿参数的模型，专为即时代码应用而设计，可生成完整的文件编辑内容，为 SoftGen AI 提供支持。它是Fast Apply管道的一部分，用于数据生成和微调Qwen2.5 Coder模型。

该模型在像Fireworks这样的快速服务提供商上部署时，能实现高吞吐量，同时保持较高的编辑准确性，速度约为150个令牌/秒。

预期用途

FastApply-7B-v1.0旨在用于需要快速、准确代码修改的AI代码编辑器和工具。它特别适合以下场景：

即时代码应用任务
完整文件编辑
与Aider和PearAI等AI代码编辑器集成
本地工具，以降低前沿模型输出的成本

推理模板

FastApply-7B-v1.0基于Qwen2.5 Coder架构，并针对代码编辑任务进行了微调。它使用特定的提示结构进行推理：

<|im_start|>system
You are a coding assistant that helps merge code updates, ensuring every modification is fully integrated.<|im_end|>
<|im_start|>user
Merge all changes from the <update> snippet into the <code> below.
- Preserve the code's structure, order, comments, and indentation exactly.
- Output only the updated code, enclosed within <updated-code> and </updated-code> tags.
- Do not include any additional text, explanations, placeholders, ellipses, or code fences.

<code>{original_code}</code>

<update>{update_snippet}</update>

Provide the complete updated code.<|im_end|>
<|im_start|>assistant

模型的输出结构为：

<updated-code>[完整的更新文件]</updated-code>

额外信息

有关Fast Apply管道、数据生成过程和部署说明的更多详细信息，请参考 GitHub仓库。

🔧 技术细节

文档未提及技术实现细节，暂不提供。

📄 许可证

该模型使用的许可证为apache-2.0。

评估

image/png

精选推荐AI模型

Llama 3 Typhoon V1.5x 8b Instruct

专为泰语设计的80亿参数指令模型，性能媲美GPT-3.5-turbo，优化了应用场景、检索增强生成、受限生成和推理任务

Cadet-Tiny是一个基于SODA数据集训练的超小型对话模型，专为边缘设备推理设计，体积仅为Cosmo-3B模型的2%左右。

Roberta Base Chinese Extractive Qa

基于RoBERTa架构的中文抽取式问答模型，适用于从给定文本中提取答案的任务。

智启未来，您的人工智能解决方案智库