DistilBart-MNLI开源模型 - 轻量级且准确率高，降低模型复杂应用

首页

Distilbart Mnli 12 9

由 valhalla 开发

DistilBart-MNLI 是通过无教师蒸馏技术从 bart-large-mnli 蒸馏得到的轻量级版本，保持了较高的准确率同时减少了模型复杂度。

文本分类 #零样本分类 #文本推理 #模型蒸馏

下载量 8,343

发布时间 : 3/2/2022

模型简介

该模型主要用于零样本分类任务，特别适用于自然语言推理（NLI）场景。它是 bart-large-mnli 的轻量级蒸馏版本，通过交替复制层结构并在相同数据上微调得到。

模型特点

高效蒸馏

采用无教师蒸馏技术，从 bart-large-mnli 中交替复制层结构，显著减小模型规模

高性能保持

在 MNLI 数据集上保持了接近原始模型的准确率，性能下降极小

多版本选择

提供不同层数的多个版本（12-1,12-3,12-6,12-9），可根据需求平衡性能与效率

模型能力

自然语言推理

零样本分类

文本分类

使用案例

文本分析

情感分析

无需特定训练即可对文本进行情感倾向分类

主题分类

对文本内容进行多类别主题分类

问答系统

问题理解

分析问题与候选答案的语义关系

🚀 DistilBart - MNLI

DistilBart - MNLI 是 bart - large - MNLI 的蒸馏版本，它采用了 Huggingface 为 BART 摘要任务提出的无教师蒸馏技术创建而成，相关内容可查看此处。该模型通过从 bart - large - MNLI 中复制交替层，并在相同数据上进行更多微调得到。

✨ 主要特性

运用无教师蒸馏技术，在性能损失极小的情况下实现模型蒸馏。
提供了不同配置的蒸馏模型，可根据需求选择。

📦 安装指南

若你想自行训练这些模型，可克隆 [distillbart - MNLI 仓库](https://github.com/patil - suraj/distillbart - mnli)，并按以下步骤操作：

从源码克隆并安装 transformers：

git clone https://github.com/huggingface/transformers.git
pip install -qqq -U ./transformers

下载 MNLI 数据：

python transformers/utils/download_glue_data.py --data_dir glue_data --tasks MNLI

创建学生模型：

python create_student.py \
  --teacher_model_name_or_path facebook/bart-large-mnli \
  --student_encoder_layers 12 \
  --student_decoder_layers 6 \
  --save_path student-bart-mnli-12-6 \

开始微调：

python run_glue.py args.json

📚 详细文档

模型性能对比

模型	匹配准确率	不匹配准确率
[bart - large - MNLI](https://huggingface.co/facebook/bart - large - mnli) (基线模型, 12 - 12)	89.9	90.01
[distilbart - mnli - 12 - 1](https://huggingface.co/valhalla/distilbart - mnli - 12 - 1)	87.08	87.5
[distilbart - mnli - 12 - 3](https://huggingface.co/valhalla/distilbart - mnli - 12 - 3)	88.1	88.19
[distilbart - mnli - 12 - 6](https://huggingface.co/valhalla/distilbart - mnli - 12 - 6)	89.19	89.01
[distilbart - mnli - 12 - 9](https://huggingface.co/valhalla/distilbart - mnli - 12 - 9)	89.56	89.52