Granite-4-Tiny-Preview开源模型 - 支持通用指令跟随任务免费部署

首页

Granite 4.0 Tiny Preview

由 ibm-granite 开发

Granite-4-Tiny-Preview 是一个拥有70亿参数的细粒度混合专家（MoE）指令微调模型，基于 Granite-4.0-Tiny-Base-Preview 开发，适用于通用指令跟随任务。

大型语言模型

Transformers

开源协议:Apache-2.0 #混合专家指令模型 #多语言长文本处理 #逻辑推理增强

下载量 7,906

发布时间 : 4/30/2025

模型简介

该模型结合了开源指令数据集和内部合成的长上下文问题解决数据集，采用多种技术开发，包括监督微调和强化学习对齐，并采用结构化对话格式。

模型特点

混合专家架构

采用细粒度混合专家（MoE）架构，提高模型效率和性能。

多语言支持

支持12种语言，包括英语、中文、日语等，并可针对其他语言进一步微调。

长上下文处理

特别优化了长上下文任务处理能力，如长文档摘要和问答。

指令跟随

经过指令微调，能够准确理解和执行复杂指令。

模型能力

思考推理

摘要生成

文本分类

文本提取

问答系统

检索增强生成（RAG）

代码相关任务

函数调用任务

多语言对话

长上下文任务处理

使用案例

商业应用

AI助手

集成到商业AI助手中，提供智能对话和任务支持。

教育

数学问题解答

解决复杂的数学问题，如浓度计算等。

内容处理

长文档摘要

对长文档或会议记录进行高效摘要。

🚀 Granite-4.0-Tiny-Preview

Granite-4.0-Tiny-Preview是一个拥有70亿参数的细粒度混合专家模型（MoE）指令模型。它基于Granite-4.0-Tiny-Base-Preview进行微调，结合了具有宽松许可的开源指令数据集和针对解决长上下文问题定制的内部收集合成数据集。该模型采用了多种技术进行开发，具备结构化的对话格式，包括监督微调以及使用强化学习进行模型对齐。

🚀 快速开始

要使用此检查点，你需要从源代码安装transformers库。

HuggingFace PR：https://github.com/huggingface/transformers/pull/37658
从源代码安装transformers：https://huggingface.co/docs/transformers/en/installation#install-from-source

安装完成后，复制以下代码片段来运行示例：

from transformers import AutoModelForCausalLM, AutoTokenizer, set_seed
import torch

model_path="ibm-granite/granite-4.0-tiny-preview"
device="cuda"
model = AutoModelForCausalLM.from_pretrained(
        model_path,
        device_map=device,
        torch_dtype=torch.bfloat16,
    )
tokenizer = AutoTokenizer.from_pretrained(
        model_path
)

conv = [{"role": "user", "content":"You have 10 liters of a 30% acid solution. How many liters of a 70% acid solution must be added to achieve a 50% acid mixture?"}]

input_ids = tokenizer.apply_chat_template(conv, return_tensors="pt", thinking=True, return_dict=True, add_generation_prompt=True).to(device)

set_seed(42)
output = model.generate(
    **input_ids,
    max_new_tokens=8192,
)

prediction = tokenizer.decode(output[0, input_ids["input_ids"].shape[1]:], skip_special_tokens=True)
print(prediction)

✨ 主要特性

多语言支持：支持英语、德语、西班牙语、法语、日语、葡萄牙语、阿拉伯语、捷克语、意大利语、韩语、荷兰语和中文。用户还可以针对这12种语言之外的语言对该模型进行微调。
广泛的任务处理能力：能够处理一般的指令跟随任务，可集成到各个领域的AI助手，包括商业应用。具备思考、总结、文本分类、文本提取、问答、检索增强生成（RAG）、代码相关任务、函数调用任务、多语言对话用例以及长上下文任务（如长文档/会议总结、长文档问答等）能力。

📦 安装指南

你需要从源代码安装transformers库来使用此检查点。

HuggingFace PR：https://github.com/huggingface/transformers/pull/37658
从源代码安装transformers：https://huggingface.co/docs/transformers/en/installation#install-from-source

💻 使用示例

基础用法

from transformers import AutoModelForCausalLM, AutoTokenizer, set_seed
import torch

model_path="ibm-granite/granite-4.0-tiny-preview"
device="cuda"
model = AutoModelForCausalLM.from_pretrained(
        model_path,
        device_map=device,
        torch_dtype=torch.bfloat16,
    )
tokenizer = AutoTokenizer.from_pretrained(
        model_path
)

conv = [{"role": "user", "content":"You have 10 liters of a 30% acid solution. How many liters of a 70% acid solution must be added to achieve a 50% acid mixture?"}]

input_ids = tokenizer.apply_chat_template(conv, return_tensors="pt", thinking=True, return_dict=True, add_generation_prompt=True).to(device)

set_seed(42)
output = model.generate(
    **input_ids,
    max_new_tokens=8192,
)

prediction = tokenizer.decode(output[0, input_ids["input_ids"].shape[1]:], skip_special_tokens=True)
print(prediction)

📚 详细文档

评估结果

模型	Arena-Hard	AlpacaEval-2.0	MMLU	PopQA	TruthfulQA	BigBenchHard	DROP	GSM8K	HumanEval	HumanEval+	IFEval	AttaQ
Granite-3.3-2B-Instruct	28.86	43.45	55.88	18.4	58.97	52.51	35.98	72.48	80.51	75.68	65.8	87.47
Granite-3.3-8B-Instruct	57.56	62.68	65.54	26.17	66.86	59.01	41.53	80.89	89.73	86.09	74.82	88.5
Granite-4.0-Tiny-Preview	26.70	35.16	60.40	22.93	58.07	55.71	46.22	70.05	82.41	78.33	63.03	86.10