Phi-4-mini-instruct開源輕量級模型 - 支持多語言，超長上下文推理必備

首頁

Phi 4 Mini Instruct

由microsoft開發

Phi-4-mini-instruct是一個輕量級開源模型，基於合成數據和過濾後的公開網站數據構建，專注於高質量、富含推理能力的數據。支持128K令牌的上下文長度和多語言處理。

大型語言模型

Transformers

支持多種語言開源協議:MIT #輕量級推理 #多語言指令 #128K長文本

下載量 346.30k

發布時間 : 2/19/2025

模型概述

該模型通過監督微調和直接偏好優化增強，能夠精確遵循指令並具備強大的安全措施，適用於商業和研究用途，特別適合內存/計算受限環境及需要強大推理能力的應用。

模型特點

輕量高效

3.8B參數的輕量級設計，適合內存和計算資源受限的環境。

強大推理能力

專注於數學和邏輯推理能力，在多個基準測試中表現優異。

多語言支持

支持23種語言的處理和理解。

長上下文處理

支持128K令牌的上下文長度，適合處理長文檔和複雜對話。

安全措施

經過直接偏好優化，具備強大的安全防護能力。

模型能力

文本生成

多語言處理

數學推理

邏輯推理

指令遵循

代碼生成

使用案例

商業應用

客戶服務助手

用於處理多語言客戶查詢，提供快速準確的響應。

提高客戶滿意度，降低響應時間

數據分析報告生成

根據結構化數據自動生成分析報告。

節省人工報告編寫時間

研究應用

數學問題求解

用於解決數學問題和驗證數學猜想。

在GSM8K等數學基準上表現優異

代碼生成與補全

輔助程序員編寫和優化代碼。

在HumanEval等代碼基準上表現良好

🚀 Phi-4

Phi-4是一系列強大的模型，涵蓋了多種變體，如推理、多模態指令、迷你指令等版本，還支持ONNX格式。它基於合成數據和高質量公開網站數據構建，專注於推理密集型任務，支持128K token上下文長度，在多語言商業和研究領域有廣泛應用。

🚀 快速開始

你可以通過以下鏈接嘗試使用Phi-4模型：

✨ 主要特性

多語言支持：支持阿拉伯語、中文、捷克語等多種語言。
輕量級設計：適合內存/計算受限環境和低延遲場景。
強大推理能力：在數學和邏輯推理方面表現出色。
長上下文支持：支持128K token上下文長度。

📦 安裝指南

使用vLLM進行推理

所需包

flash_attn==2.7.4.post1
torch==2.5.1
vllm>=0.7.3

示例代碼

from vllm import LLM, SamplingParams

llm = LLM(model="microsoft/Phi-4-mini-instruct", trust_remote_code=True)

messages = [
    {"role": "system", "content": "You are a helpful AI assistant."},
    {"role": "user", "content": "Can you provide ways to eat combinations of bananas and dragonfruits?"},
    {"role": "assistant", "content": "Sure! Here are some ways to eat bananas and dragonfruits together: 1. Banana and dragonfruit smoothie: Blend bananas and dragonfruits together with some milk and honey. 2. Banana and dragonfruit salad: Mix sliced bananas and dragonfruits together with some lemon juice and honey."},
    {"role": "user", "content": "What about solving an 2x + 3 = 7 equation?"},
]

sampling_params = SamplingParams(
  max_tokens=500,
  temperature=0.0,
)

output = llm.chat(messages=messages, sampling_params=sampling_params)
print(output[0].outputs[0].text)

使用Transformers進行推理

所需包

flash_attn==2.7.4.post1
torch==2.5.1
transformers==4.49.0
accelerate==1.3.0

示例代碼

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline

torch.random.manual_seed(0)

model_path = "microsoft/Phi-4-mini-instruct"

model = AutoModelForCausalLM.from_pretrained(
    model_path,
    device_map="auto",
    torch_dtype="auto",
    trust_remote_code=True,
)
tokenizer = AutoTokenizer.from_pretrained(model_path)

messages = [
    {"role": "system", "content": "You are a helpful AI assistant."},
    {"role": "user", "content": "Can you provide ways to eat combinations of bananas and dragonfruits?"},
    {"role": "assistant", "content": "Sure! Here are some ways to eat bananas and dragonfruits together: 1. Banana and dragonfruit smoothie: Blend bananas and dragonfruits together with some milk and honey. 2. Banana and dragonfruit salad: Mix sliced bananas and dragonfruits together with some lemon juice and honey."},
    {"role": "user", "content": "What about solving an 2x + 3 = 7 equation?"},
]

pipe = pipeline(
    "text-generation",
    model=model,
    tokenizer=tokenizer,
)

generation_args = {
    "max_new_tokens": 500,
    "return_full_text": False,
    "temperature": 0.0,
    "do_sample": False,
}

output = pipe(messages, **generation_args)
print(output[0]['generated_text'])

💻 使用示例

輸入格式

聊天格式

<|system|>Insert System Message<|end|><|user|>Insert User Message<|end|><|assistant|>

工具啟用的函數調用格式

<|system|>You are a helpful assistant with some tools.<|tool|>[{"name": "get_weather_updates", "description": "Fetches weather updates for a given city using the RapidAPI Weather API.", "parameters": {"city": {"description": "The name of the city for which to retrieve weather information.", "type": "str", "default": "London"}}}]<|/tool|><|end|><|user|>What is the weather like in Paris today?<|end|><|assistant|>

📚 詳細文檔

預期用途

主要用例

該模型適用於廣泛的多語言商業和研究用途，可用於以下場景：

內存/計算受限的環境。
低延遲場景。
需要強大推理能力（特別是數學和邏輯推理）的通用AI系統和應用。

使用案例考慮因素

開發者在選擇使用案例時，應考慮語言模型的常見侷限性以及不同語言之間的性能差異，並在特定下游用例中使用之前評估和緩解準確性、安全性和公平性問題，特別是在高風險場景中。同時，開發者應遵守適用的法律法規。

發佈說明

本次發佈的Phi-4-mini-instruct基於Phi-3系列的用戶反饋，採用了新架構、更大的詞彙表和更好的訓練後技術，在關鍵能力上有顯著提升。建議用戶在特定AI應用中進行測試。

模型質量

通過內部基準平臺對Phi-4-mini-instruct模型與一組模型在各種基準測試中進行了比較，結果顯示該模型在多語言理解和推理能力方面達到了與更大模型相似的水平，但在某些任務上仍受限於其規模。

負責任的AI考慮

開發者應應用負責任的AI最佳實踐，包括評估和緩解與特定用例和文化、語言背景相關的風險。在部署模型時，應考慮模型在資源分配、高風險場景、錯誤信息、有害內容生成和濫用等方面的適用性。

訓練

模型信息

屬性	詳情
模型類型	基於Transformer的輕量級開放模型
輸入	文本，適合聊天格式的提示
上下文長度	128K tokens
GPU	512 A100-80G
訓練時間	21天
訓練數據	5T tokens
輸出	生成的文本
訓練日期	2024年11月至12月
狀態	基於2024年6月截止的離線數據集訓練的靜態模型
支持語言	阿拉伯語、中文、捷克語、丹麥語、荷蘭語、英語、芬蘭語、法語、德語、希伯來語、匈牙利語、意大利語、日語、韓語、挪威語、波蘭語、葡萄牙語、俄語、西班牙語、瑞典語、泰語、土耳其語、烏克蘭語
發佈日期	2025年2月