gemma-2-2b-it-tool-think开源文本生成模型 - 支持工具调用与思考过程

首页

Gemma 2 2b It Tool Think

由 langdai 开发

基于google/gemma-2b-it微调的文本生成模型，支持工具调用思考过程

大型语言模型

Transformers

开源协议:MIT #工具调用优化 #轻量化微调 #多轮对话支持

下载量 36

发布时间 : 3/31/2025

模型简介

这是一个基于gemma-2b-it微调的独立模型，专注于文本生成任务，特别设计了工具调用时的思考过程输出能力。

模型特点

工具调用思考过程

在工具调用前会输出<think>思考过程</think>标签内容，增强可解释性

轻量化微调

仅经过1个周期的微调，保持基础模型能力的同时适应特定任务

函数调用支持

支持按照指定格式进行函数调用和参数传递

模型能力

文本生成

工具调用思考

函数调用响应

使用案例

金融工具

货币转换

根据用户请求调用货币转换函数

生成符合规范的函数调用JSON

地理服务

距离计算

计算两个地点之间的地理距离

生成包含位置参数的函数调用

🚀 语言模型微调项目

本项目是一个基于transformers库的文本生成模型微调项目，将基础模型与PEFT微调模型合并，形成独立可用的模型。该模型可用于文本生成任务，尤其在特定领域的文本输出上表现出色。

🚀 快速开始

使用以下代码开始使用该模型：

基础用法

from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
from transformers import pipeline
import torch
model_id = "langdai/gemma-2-2b-it-tool-think"
model = AutoModelForCausalLM.from_pretrained(model_id,
                                             device_map="cuda:0",
                                            ) # For GPU
tokenizer = AutoTokenizer.from_pretrained(model_id)
# model.to(torch.bfloat16)
model.eval()
generator = pipeline("text-generation", model= model, tokenizer= tokenizer)

高级用法

prompt="""<bos><start_of_turn>human
You are a function calling AI model. You are provided with function signatures within <tools></tools> XML tags.You may call one or more functions to assist with the user query. Don't make assumptions about what values to plug into functions.Here are the available tools:<tools> [{'type': 'function', 'function': {'name': 'convert_currency', 'description': 'Convert from one currency to another', 'parameters': {'type': 'object', 'properties': {'amount': {'type': 'number', 'description': 'The amount to convert'}, 'from_currency': {'type': 'string', 'description': 'The currency to convert from'}, 'to_currency': {'type': 'string', 'description': 'The currency to convert to'}}, 'required': ['amount', 'from_currency', 'to_currency']}}}, {'type': 'function', 'function': {'name': 'calculate_distance', 'description': 'Calculate the distance between two locations', 'parameters': {'type': 'object', 'properties': {'start_location': {'type': 'string', 'description': 'The starting location'}, 'end_location': {'type': 'string', 'description': 'The ending location'}}, 'required': ['start_location', 'end_location']}}}] </tools>Use the following pydantic model json schema for each tool call you will make: {'title': 'FunctionCall', 'type': 'object', 'properties': {'arguments': {'title': 'Arguments', 'type': 'object'}, 'name': {'title': 'Name', 'type': 'string'}}, 'required': ['arguments', 'name']}For each function call return a json object with function name and arguments within <tool_call></tool_call> XML tags as follows:
<tool_call>
{tool_call}
</tool_call>Also, before making a call to a function take the time to plan the function to take. Make that thinking process between <think>{your thoughts}</think>

Hi, I need to convert 500 INR to Euros. Can you help me with that?<end_of_turn><eos>
<start_of_turn>model
<think>"""

output = generator([{"role": "user", "content": prompt}], max_new_tokens=512, return_full_text=False)[0]

print(output)