Granite-4-Tiny-Preview開源模型 - 支持通用指令跟隨任務免費部署

首頁

Granite 4.0 Tiny Preview

由ibm-granite開發

Granite-4-Tiny-Preview 是一個擁有70億參數的細粒度混合專家（MoE）指令微調模型，基於 Granite-4.0-Tiny-Base-Preview 開發，適用於通用指令跟隨任務。

大型語言模型

Transformers

開源協議:Apache-2.0 #混合專家指令模型 #多語言長文本處理 #邏輯推理增強

下載量 7,906

發布時間 : 4/30/2025

模型概述

該模型結合了開源指令數據集和內部合成的長上下文問題解決數據集，採用多種技術開發，包括監督微調和強化學習對齊，並採用結構化對話格式。

模型特點

混合專家架構

採用細粒度混合專家（MoE）架構，提高模型效率和性能。

多語言支持

支持12種語言，包括英語、中文、日語等，並可針對其他語言進一步微調。

長上下文處理

特別優化了長上下文任務處理能力，如長文檔摘要和問答。

指令跟隨

經過指令微調，能夠準確理解和執行復雜指令。

模型能力

思考推理

摘要生成

文本分類

文本提取

問答系統

檢索增強生成（RAG）

代碼相關任務

函數調用任務

多語言對話

長上下文任務處理

使用案例

商業應用

AI助手

集成到商業AI助手中，提供智能對話和任務支持。

教育

數學問題解答

解決複雜的數學問題，如濃度計算等。

內容處理

長文檔摘要

對長文檔或會議記錄進行高效摘要。

🚀 Granite-4.0-Tiny-Preview

Granite-4.0-Tiny-Preview是一個擁有70億參數的細粒度混合專家模型（MoE）指令模型。它基於Granite-4.0-Tiny-Base-Preview進行微調，結合了具有寬鬆許可的開源指令數據集和針對解決長上下文問題定製的內部收集合成數據集。該模型採用了多種技術進行開發，具備結構化的對話格式，包括監督微調以及使用強化學習進行模型對齊。

🚀 快速開始

要使用此檢查點，你需要從源代碼安裝transformers庫。

HuggingFace PR：https://github.com/huggingface/transformers/pull/37658
從源代碼安裝transformers：https://huggingface.co/docs/transformers/en/installation#install-from-source

安裝完成後，複製以下代碼片段來運行示例：

from transformers import AutoModelForCausalLM, AutoTokenizer, set_seed
import torch

model_path="ibm-granite/granite-4.0-tiny-preview"
device="cuda"
model = AutoModelForCausalLM.from_pretrained(
        model_path,
        device_map=device,
        torch_dtype=torch.bfloat16,
    )
tokenizer = AutoTokenizer.from_pretrained(
        model_path
)

conv = [{"role": "user", "content":"You have 10 liters of a 30% acid solution. How many liters of a 70% acid solution must be added to achieve a 50% acid mixture?"}]

input_ids = tokenizer.apply_chat_template(conv, return_tensors="pt", thinking=True, return_dict=True, add_generation_prompt=True).to(device)

set_seed(42)
output = model.generate(
    **input_ids,
    max_new_tokens=8192,
)

prediction = tokenizer.decode(output[0, input_ids["input_ids"].shape[1]:], skip_special_tokens=True)
print(prediction)

✨ 主要特性

多語言支持：支持英語、德語、西班牙語、法語、日語、葡萄牙語、阿拉伯語、捷克語、意大利語、韓語、荷蘭語和中文。用戶還可以針對這12種語言之外的語言對該模型進行微調。
廣泛的任務處理能力：能夠處理一般的指令跟隨任務，可集成到各個領域的AI助手，包括商業應用。具備思考、總結、文本分類、文本提取、問答、檢索增強生成（RAG）、代碼相關任務、函數調用任務、多語言對話用例以及長上下文任務（如長文檔/會議總結、長文檔問答等）能力。

📦 安裝指南

你需要從源代碼安裝transformers庫來使用此檢查點。

HuggingFace PR：https://github.com/huggingface/transformers/pull/37658
從源代碼安裝transformers：https://huggingface.co/docs/transformers/en/installation#install-from-source

💻 使用示例

基礎用法

from transformers import AutoModelForCausalLM, AutoTokenizer, set_seed
import torch

model_path="ibm-granite/granite-4.0-tiny-preview"
device="cuda"
model = AutoModelForCausalLM.from_pretrained(
        model_path,
        device_map=device,
        torch_dtype=torch.bfloat16,
    )
tokenizer = AutoTokenizer.from_pretrained(
        model_path
)

conv = [{"role": "user", "content":"You have 10 liters of a 30% acid solution. How many liters of a 70% acid solution must be added to achieve a 50% acid mixture?"}]

input_ids = tokenizer.apply_chat_template(conv, return_tensors="pt", thinking=True, return_dict=True, add_generation_prompt=True).to(device)

set_seed(42)
output = model.generate(
    **input_ids,
    max_new_tokens=8192,
)

prediction = tokenizer.decode(output[0, input_ids["input_ids"].shape[1]:], skip_special_tokens=True)
print(prediction)

📚 詳細文檔

評估結果

模型	Arena-Hard	AlpacaEval-2.0	MMLU	PopQA	TruthfulQA	BigBenchHard	DROP	GSM8K	HumanEval	HumanEval+	IFEval	AttaQ
Granite-3.3-2B-Instruct	28.86	43.45	55.88	18.4	58.97	52.51	35.98	72.48	80.51	75.68	65.8	87.47
Granite-3.3-8B-Instruct	57.56	62.68	65.54	26.17	66.86	59.01	41.53	80.89	89.73	86.09	74.82	88.5
Granite-4.0-Tiny-Preview	26.70	35.16	60.40	22.93	58.07	55.71	46.22	70.05	82.41	78.33	63.03	86.10