MiniLM-L6-mnli開源文本分類模型 - 快速推理實現高效文本分類

首頁

Minilm L6 Mnli

由MoritzLaurer開發

基於MultiNLI數據集訓練的快速推理文本分類模型，採用MiniLM-L6架構

文本分類

Transformers

英語#零樣本推理 #快速文本分類 #多語言NLI

下載量 29

發布時間 : 3/2/2022

模型概述

該模型用於自然語言推理任務，能夠判斷兩個句子之間的邏輯關係（蘊含/中立/矛盾）

模型特點

高效推理

採用MiniLM-L6輕量級架構，推理速度優於大型模型

多關係判斷

可識別文本間的三種邏輯關係：蘊含、中立和矛盾

零樣本分類

支持無需微調的直接分類應用

模型能力

文本分類

自然語言推理

零樣本學習

使用案例

文本分析

影評情感分析

通過分析用戶評論與標準評價的關係判斷情感傾向

內容一致性檢查

檢測文檔前後內容是否存在邏輯矛盾

🚀 MiniLM-L6-mnli

這是一個用於文本分類和零樣本分類的模型，基於MiniLM-L6架構，在MultiNLI數據集上訓練，速度快但精度略遜於其他模型。

🚀 快速開始

本模型可用於文本分類和零樣本分類任務，以下是使用示例：

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

model_name = "MoritzLaurer/MiniLM-L6-mnli"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)

premise = "I liked the movie"
hypothesis = "The movie was good."

input = tokenizer(premise, hypothesis, truncation=True, return_tensors="pt")
output = model(input["input_ids"].to(device))  # device = "cuda:0" or "cpu"
prediction = torch.softmax(output["logits"][0], -1).tolist()
label_names = ["entailment", "neutral", "contradiction"]
prediction = {name: round(float(pred) * 100, 1) for pred, name in zip(prediction, label_names)}
print(prediction)

✨ 主要特性

適用任務：適用於文本分類和零樣本分類任務。
基礎模型：基於微軟的MiniLM-L6，速度快，但精度略低於其他模型。

📦 安裝指南

文檔未提及具體安裝步驟，可參考Hugging Face相關庫的安裝方式來安裝所需依賴。

💻 使用示例

基礎用法

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

model_name = "MoritzLaurer/MiniLM-L6-mnli"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)

premise = "I liked the movie"
hypothesis = "The movie was good."

input = tokenizer(premise, hypothesis, truncation=True, return_tensors="pt")
output = model(input["input_ids"].to(device))  # device = "cuda:0" or "cpu"
prediction = torch.softmax(output["logits"][0], -1).tolist()
label_names = ["entailment", "neutral", "contradiction"]
prediction = {name: round(float(pred) * 100, 1) for pred, name in zip(prediction, label_names)}
print(prediction)

📚 詳細文檔

訓練數據

模型使用MultiNLI數據集進行訓練。

訓練過程

MiniLM-L6-mnli-binary使用Hugging Face的訓練器進行訓練，超參數如下：

training_args = TrainingArguments(
    num_train_epochs=5,              # total number of training epochs
    learning_rate=2e-05,
    per_device_train_batch_size=32,   # batch size per device during training
    per_device_eval_batch_size=32,    # batch size for evaluation
    warmup_ratio=0.1,                # number of warmup steps for learning rate scheduler
    weight_decay=0.06,               # strength of weight decay
    fp16=True                        # mixed precision training
)