gliclass-modern-base-v3.0开源零样本分类器 - 高效完成单次分类任务

首页

Gliclass Modern Base V3.0

由 knowledgator 开发

GLiClass 是一款高效的零样本分类器，受 GLiNER 启发，能在单次前向传播中完成分类任务，兼具交叉编码器性能和更高计算效率。

文本分类

Safetensors

其他开源协议:Apache-2.0 #零样本分类 #高效推理 #多标签分类

下载量 105

发布时间 : 7/14/2025

模型简介

通用轻量级序列分类模型，支持零样本分类、主题分类、情感分析及RAG管道中的重排任务，具备逻辑推理能力。

模型特点

高效零样本分类

单次前向传播完成分类，计算效率优于传统交叉编码器

多任务适配

支持主题分类、情感分析、RAG重排等多种文本处理任务

逻辑推理能力

在逻辑任务上专门训练，可处理NLI类型任务

LoRA微调支持

使用LoRA适配器进行微调，保留预训练知识的同时适应新任务

模型能力

零样本文本分类

多标签分类

自然语言推理(NLI)

情感分析

主题分类

检索增强生成(RAG)重排

使用案例

文本分析

新闻主题分类

对新闻内容进行多标签分类（如政治/体育/科技）

在20_news_groups数据集上F1达0.5958

情感分析

分析用户评论的情感倾向

在sst2数据集上F1达0.8959

逻辑推理

自然语言推理

判断前提与假设的逻辑关系

需将前提作为文本、假设作为标签输入

🚀 GLiClass：用于序列分类的通用轻量级模型

GLiClass 是一款高效的零样本分类器，它受到了 GLiNER 工作的启发。该模型在单次前向传播中即可完成分类任务，在具备与交叉编码器相当性能的同时，还拥有更高的计算效率。它可用于 主题分类、情感分析，并能在 RAG 管道中作为重排器使用。

🚀 快速开始

安装

首先，你需要安装 GLiClass 库：

pip install gliclass
pip install -U transformers>=4.48.0

初始化模型和管道

from gliclass import GLiClassModel, ZeroShotClassificationPipeline
from transformers import AutoTokenizer

model = GLiClassModel.from_pretrained("knowledgator/gliclass-modern-base-v3.0")
tokenizer = AutoTokenizer.from_pretrained("knowledgator/gliclass-modern-base-v3.0", add_prefix_space=True)
pipeline = ZeroShotClassificationPipeline(model, tokenizer, classification_type='multi-label', device='cuda:0')

text = "One day I will see the world!"
labels = ["travel", "dreams", "sport", "science", "politics"]
results = pipeline(text, labels, threshold=0.5)[0] #because we have one text
for result in results:
 print(result["label"], "=>", result["score"])

NLI 任务使用方法

如果你想将其用于 NLI 类型的任务，建议将前提表示为文本，假设表示为标签。你可以输入多个假设，但模型在单个输入假设的情况下效果最佳。

# Initialize model and multi-label pipeline
text = "The cat slept on the windowsill all afternoon"
labels = ["The cat was awake and playing outside."]
results = pipeline(text, labels, threshold=0.0)[0]
print(results)

✨ 主要特性

高效零样本分类：受 GLiNER 工作启发，在单次前向传播中完成分类，计算效率高。
多任务应用：可用于主题分类、情感分析，还能在 RAG 管道中作为重排器。
逻辑推理能力：模型在逻辑任务上进行训练，诱导推理能力。
LoRA 微调：使用 LoRA 适配器微调模型，保留先前知识。

📦 安装指南

pip install gliclass
pip install -U transformers>=4.48.0

💻 使用示例

基础用法

from gliclass import GLiClassModel, ZeroShotClassificationPipeline
from transformers import AutoTokenizer

model = GLiClassModel.from_pretrained("knowledgator/gliclass-modern-base-v3.0")
tokenizer = AutoTokenizer.from_pretrained("knowledgator/gliclass-modern-base-v3.0", add_prefix_space=True)
pipeline = ZeroShotClassificationPipeline(model, tokenizer, classification_type='multi-label', device='cuda:0')

text = "One day I will see the world!"
labels = ["travel", "dreams", "sport", "science", "politics"]
results = pipeline(text, labels, threshold=0.5)[0] #because we have one text
for result in results:
 print(result["label"], "=>", result["score"])

高级用法（NLI 任务）

# Initialize model and multi-label pipeline
text = "The cat slept on the windowsill all afternoon"
labels = ["The cat was awake and playing outside."]
results = pipeline(text, labels, threshold=0.0)[0]
print(results)

📚 详细文档

LoRA 参数

	gliclass‑modern‑base‑v3.0	gliclass‑modern‑large‑v3.0	gliclass‑base‑v3.0	gliclass‑large‑v3.0
LoRa r	512	768	384	384
LoRa α	1024	1536	768	768
focal loss α	0.7	0.7	0.7	0.7
Target modules	"Wqkv", "Wo", "Wi", "linear_1", "linear_2"	"Wqkv", "Wo", "Wi", "linear_1", "linear_2"	"query_proj", "key_proj", "value_proj", "dense", "linear_1", "linear_2", mlp.0", "mlp.2", "mlp.4"	"query_proj", "key_proj", "value_proj", "dense", "linear_1", "linear_2", mlp.0", "mlp.2", "mlp.4"

GLiClass-V3 模型信息

模型名称	大小	参数	平均基准	平均推理速度（批次大小 = 1，a6000，示例/秒）
gliclass‑edge‑v3.0	131 MB	32.7M	0.4873	97.29
gliclass‑modern‑base‑v3.0	606 MB	151M	0.5571	54.46
gliclass‑modern‑large‑v3.0	1.6 GB	399M	0.6082	43.80
gliclass‑base‑v3.0	746 MB	187M	0.6556	51.61
gliclass‑large‑v3.0	1.75 GB	439M	0.7001	25.22

基准测试

以下是几个文本分类数据集上的 F1 分数。所有测试模型均未在这些数据集上进行微调，而是在零样本设置下进行测试。

GLiClass-V3

数据集	gliclass‑large‑v3.0	gliclass‑base‑v3.0	gliclass‑modern‑large‑v3.0	gliclass‑modern‑base‑v3.0	gliclass‑edge‑v3.0
CR	0.9398	0.9127	0.8952	0.8902	0.8215
sst2	0.9192	0.8959	0.9330	0.8959	0.8199
sst5	0.4606	0.3376	0.4619	0.2756	0.2823
20_news_groups	0.5958	0.4759	0.3905	0.3433	0.2217
spam	0.7584	0.6760	0.5813	0.6398	0.5623
financial_phrasebank	0.9000	0.8971	0.5929	0.4200	0.5004
imdb	0.9366	0.9251	0.9402	0.9158	0.8485
ag_news	0.7181	0.7279	0.7269	0.6663	0.6645
emotion	0.4506	0.4447	0.4517	0.4254	0.3851
cap_sotu	0.4589	0.4614	0.4072	0.3625	0.2583
rotten_tomatoes	0.8411	0.7943	0.7664	0.7070	0.7024
massive	0.5649	0.5040	0.3905	0.3442	0.2414
banking	0.5574	0.4698	0.3683	0.3561	0.0272
平均	0.7001	0.6556	0.6082	0.5571	0.4873

先前的 GLiClass 模型

数据集	gliclass‑large‑v1.0‑lw	gliclass‑base‑v1.0‑lw	gliclass‑modern‑large‑v2.0	gliclass‑modern‑base‑v2.0
CR	0.9226	0.9097	0.9154	0.8977
sst2	0.9247	0.8987	0.9308	0.8524
sst5	0.2891	0.3779	0.2152	0.2346
20_news_groups	0.4083	0.3953	0.3813	0.3857
spam	0.3642	0.5126	0.6603	0.4608
financial_phrasebank	0.9044	0.8880	0.3152	0.3465
imdb	0.9429	0.9351	0.9449	0.9188
ag_news	0.7559	0.6985	0.6999	0.6836
emotion	0.3951	0.3516	0.4341	0.3926
cap_sotu	0.4749	0.4643	0.4095	0.3588
rotten_tomatoes	0.8807	0.8429	0.7386	0.6066
massive	0.5606	0.4635	0.2394	0.3458
banking	0.3317	0.4396	0.1355	0.2907
平均	0.6273	0.6291	0.5400	0.5211

交叉编码器

数据集	deberta‑v3‑large‑zeroshot‑v2.0	deberta‑v3‑base‑zeroshot‑v2.0	roberta‑large‑zeroshot‑v2.0‑c	comprehend_it‑base
CR	0.9134	0.9051	0.9141	0.8936
sst2	0.9272	0.9176	0.8573	0.9006
sst5	0.3861	0.3848	0.4159	0.4140
enron_spam	0.5970	0.4640	0.5040	0.3637
financial_phrasebank	0.5820	0.6690	0.4550	0.4695
imdb	0.9180	0.8990	0.9040	0.4644
ag_news	0.7710	0.7420	0.7450	0.6016
emotion	0.4840	0.4950	0.4860	0.4165
cap_sotu	0.5020	0.4770	0.5230	0.3823
rotten_tomatoes	0.8680	0.8600	0.8410	0.4728
massive	0.5180	0.5200	0.5200	0.3314
banking77	0.5670	0.4460	0.2900	0.4972
平均	0.6695	0.6483	0.6213	0.5173

推理速度

每个模型都在文本长度为 64、256 和 512 个标记，标签数量为 1、2、4、8、16、32、64 和 128 的示例上进行了测试，然后对不同文本长度的分数进行了平均。

模型名称 / 每秒每 m 个标签的样本数	1	2	4	8	16	32	64	128	平均
gliclass‑edge‑v3.0	103.81	101.01	103.50	103.50	98.36	96.77	88.76	82.64	97.29
gliclass‑modern‑base‑v3.0	56.00	55.46	54.95	55.66	54.73	54.95	53.48	50.34	54.46
gliclass‑modern‑large‑v3.0	46.30	46.82	46.66	46.30	43.93	44.73	42.77	32.89	43.80
gliclass‑base‑v3.0	49.42	50.25	40.05	57.69	57.14	56.39	55.97	45.94	51.61
gliclass‑large‑v3.0	19.05	26.86	23.64	29.27	29.04	28.79	27.55	17.60	25.22
deberta‑v3‑base‑zeroshot‑v2.0	24.55	30.40	15.38	7.62	3.77	1.87	0.94	0.47	10.63
deberta‑v3‑large‑zeroshot‑v2.0	16.82	15.82	7.93	3.98	1.99	0.99	0.49	0.25	6.03
roberta‑large‑zeroshot‑v2.0‑c	50.42	39.27	19.95	9.95	5.01	2.48	1.25	0.64	16.12
comprehend_it‑base	21.79	27.32	13.60	7.58	3.80	1.90	0.97	0.49	9.72

📄 许可证

本项目采用 Apache-2.0 许可证。

模型相关信息

属性	详情
模型类型	通用轻量级序列分类模型
训练数据	BioMike/formal-logic-reasoning-gliclass-2k、knowledgator/gliclass-v3-logic-dataset、tau/commonsense_qa
评估指标	F1
标签	text classification、nli、sentiment analysis
管道标签	text-classification