gemma-2-2b-it-tool-think開源文本生成模型 - 支持工具調用與思考過程

首頁

Gemma 2 2b It Tool Think

由langdai開發

基於google/gemma-2b-it微調的文本生成模型，支持工具調用思考過程

大型語言模型

Transformers

開源協議:MIT #工具調用優化 #輕量化微調 #多輪對話支持

下載量 36

發布時間 : 3/31/2025

模型概述

這是一個基於gemma-2b-it微調的獨立模型，專注於文本生成任務，特別設計了工具調用時的思考過程輸出能力。

模型特點

工具調用思考過程

在工具調用前會輸出<think>思考過程</think>標籤內容，增強可解釋性

輕量化微調

僅經過1個週期的微調，保持基礎模型能力的同時適應特定任務

函數調用支持

支持按照指定格式進行函數調用和參數傳遞

模型能力

文本生成

工具調用思考

函數調用響應

使用案例

金融工具

貨幣轉換

根據用戶請求調用貨幣轉換函數

生成符合規範的函數調用JSON

地理服務

距離計算

計算兩個地點之間的地理距離

生成包含位置參數的函數調用

🚀 語言模型微調項目

本項目是一個基於transformers庫的文本生成模型微調項目，將基礎模型與PEFT微調模型合併，形成獨立可用的模型。該模型可用於文本生成任務，尤其在特定領域的文本輸出上表現出色。

🚀 快速開始

使用以下代碼開始使用該模型：

基礎用法

from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
from transformers import pipeline
import torch
model_id = "langdai/gemma-2-2b-it-tool-think"
model = AutoModelForCausalLM.from_pretrained(model_id,
                                             device_map="cuda:0",
                                            ) # For GPU
tokenizer = AutoTokenizer.from_pretrained(model_id)
# model.to(torch.bfloat16)
model.eval()
generator = pipeline("text-generation", model= model, tokenizer= tokenizer)

高級用法

prompt="""<bos><start_of_turn>human
You are a function calling AI model. You are provided with function signatures within <tools></tools> XML tags.You may call one or more functions to assist with the user query. Don't make assumptions about what values to plug into functions.Here are the available tools:<tools> [{'type': 'function', 'function': {'name': 'convert_currency', 'description': 'Convert from one currency to another', 'parameters': {'type': 'object', 'properties': {'amount': {'type': 'number', 'description': 'The amount to convert'}, 'from_currency': {'type': 'string', 'description': 'The currency to convert from'}, 'to_currency': {'type': 'string', 'description': 'The currency to convert to'}}, 'required': ['amount', 'from_currency', 'to_currency']}}}, {'type': 'function', 'function': {'name': 'calculate_distance', 'description': 'Calculate the distance between two locations', 'parameters': {'type': 'object', 'properties': {'start_location': {'type': 'string', 'description': 'The starting location'}, 'end_location': {'type': 'string', 'description': 'The ending location'}}, 'required': ['start_location', 'end_location']}}}] </tools>Use the following pydantic model json schema for each tool call you will make: {'title': 'FunctionCall', 'type': 'object', 'properties': {'arguments': {'title': 'Arguments', 'type': 'object'}, 'name': {'title': 'Name', 'type': 'string'}}, 'required': ['arguments', 'name']}For each function call return a json object with function name and arguments within <tool_call></tool_call> XML tags as follows:
<tool_call>
{tool_call}
</tool_call>Also, before making a call to a function take the time to plan the function to take. Make that thinking process between <think>{your thoughts}</think>

Hi, I need to convert 500 INR to Euros. Can you help me with that?<end_of_turn><eos>
<start_of_turn>model
<think>"""

output = generator([{"role": "user", "content": prompt}], max_new_tokens=512, return_full_text=False)[0]

print(output)