DistilBart-MNLI開源模型 - 輕量級且準確率高，降低模型複雜應用

首頁

Distilbart Mnli 12 9

由valhalla開發

DistilBart-MNLI 是通過無教師蒸餾技術從 bart-large-mnli 蒸餾得到的輕量級版本，保持了較高的準確率同時減少了模型複雜度。

文本分類 #零樣本分類 #文本推理 #模型蒸餾

下載量 8,343

發布時間 : 3/2/2022

模型概述

該模型主要用於零樣本分類任務，特別適用於自然語言推理（NLI）場景。它是 bart-large-mnli 的輕量級蒸餾版本，通過交替複製層結構並在相同數據上微調得到。

模型特點

高效蒸餾

採用無教師蒸餾技術，從 bart-large-mnli 中交替複製層結構，顯著減小模型規模

高性能保持

在 MNLI 數據集上保持了接近原始模型的準確率，性能下降極小

多版本選擇

提供不同層數的多個版本（12-1,12-3,12-6,12-9），可根據需求平衡性能與效率

模型能力

自然語言推理

零樣本分類

文本分類

使用案例

文本分析

情感分析

無需特定訓練即可對文本進行情感傾向分類

主題分類

對文本內容進行多類別主題分類

問答系統

問題理解

分析問題與候選答案的語義關係

🚀 DistilBart - MNLI

DistilBart - MNLI 是 bart - large - MNLI 的蒸餾版本，它採用了 Huggingface 為 BART 摘要任務提出的無教師蒸餾技術創建而成，相關內容可查看此處。該模型通過從 bart - large - MNLI 中複製交替層，並在相同數據上進行更多微調得到。

✨ 主要特性

運用無教師蒸餾技術，在性能損失極小的情況下實現模型蒸餾。
提供了不同配置的蒸餾模型，可根據需求選擇。

📦 安裝指南

若你想自行訓練這些模型，可克隆 [distillbart - MNLI 倉庫](https://github.com/patil - suraj/distillbart - mnli)，並按以下步驟操作：

從源碼克隆並安裝 transformers：

git clone https://github.com/huggingface/transformers.git
pip install -qqq -U ./transformers

下載 MNLI 數據：

python transformers/utils/download_glue_data.py --data_dir glue_data --tasks MNLI

創建學生模型：

python create_student.py \
  --teacher_model_name_or_path facebook/bart-large-mnli \
  --student_encoder_layers 12 \
  --student_decoder_layers 6 \
  --save_path student-bart-mnli-12-6 \

開始微調：

python run_glue.py args.json

📚 詳細文檔

模型性能對比

模型	匹配準確率	不匹配準確率
[bart - large - MNLI](https://huggingface.co/facebook/bart - large - mnli) (基線模型, 12 - 12)	89.9	90.01
[distilbart - mnli - 12 - 1](https://huggingface.co/valhalla/distilbart - mnli - 12 - 1)	87.08	87.5
[distilbart - mnli - 12 - 3](https://huggingface.co/valhalla/distilbart - mnli - 12 - 3)	88.1	88.19
[distilbart - mnli - 12 - 6](https://huggingface.co/valhalla/distilbart - mnli - 12 - 6)	89.19	89.01
[distilbart - mnli - 12 - 9](https://huggingface.co/valhalla/distilbart - mnli - 12 - 9)	89.56	89.52