MetaHateBERT开源文本分类模型 - 免费部署精准检测仇恨言论

首页

Metahatebert

由 irlab-udc 开发

MetaHateBERT是一个基于BERT架构的文本分类模型，专门用于检测仇恨言论。

文本分类

Transformers

英语开源协议:Apache-2.0 #仇恨言论检测 #社交媒体内容审核 #BERT微调

下载量 1,456

发布时间 : 6/17/2024

模型简介

该模型基于bert-base-uncased架构，在自定义数据集上进行了微调，可实现二元文本分类，标签为'无仇恨'和'仇恨'。

模型特点

仇恨言论检测

专门用于检测社交媒体评论、论坛和其他文本数据源中的仇恨言论。

内容审核

平台可使用该模型自动标记潜在的有害内容。

模型能力

文本分类

仇恨言论检测

使用案例

内容审核

社交媒体评论审核

自动检测社交媒体评论中的仇恨言论

标记潜在有害内容

论坛内容过滤

识别论坛帖子中的仇恨言论

帮助维护健康的讨论环境

🚀 MetaHateBERT

MetaHateBERT是一个经过微调的BERT模型，专门用于检测文本中的仇恨言论。它基于bert-base-uncased架构，在自定义数据集上进行了微调，可实现二元文本分类，标签为无仇恨和仇恨。

🚀 快速开始

推理

要使用此模型，你可以通过transformers库加载它：

from transformers import pipeline

# 加载模型
classifier = pipeline("text-classification", model="irlab-udc/MetaHateBERT")

# 测试模型
result = classifier("Your input text here")
print(result)  # 应打印标签 "no hate" 或 "hate"

✨ 主要特性

仇恨言论检测：该模型旨在检测社交媒体评论、论坛和其他文本数据源中的仇恨言论。
内容审核：平台可使用该模型自动标记潜在的有害内容。

📚 详细文档

预期用途

仇恨言论检测：此模型用于检测社交媒体评论、论坛和其他文本数据源中的仇恨言论。
内容审核：平台可使用该模型自动标记潜在的有害内容。

局限性

偏差：模型可能带有训练数据中存在的偏差。
误报/漏报：模型并非完美，可能会误分类某些实例。
领域特异性：在不同领域的性能可能有所不同。

📄 许可证

本模型使用的是Apache-2.0许可证。

🔗 相关信息

属性	详情
模型类型	文本分类
训练数据	irlab-udc/metahate
评估指标	准确率、F1值

📚 引用

如果使用此模型，请引用以下参考文献：

@article{Piot_Martín-Rodilla_Parapar_2024,
  title={MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection},
  volume={18},
  url={https://ojs.aaai.org/index.php/ICWSM/article/view/31445},
  DOI={10.1609/icwsm.v18i1.31445},
  abstractNote={Hate speech represents a pervasive and detrimental form of online discourse, often manifested through an array of slurs, from hateful tweets to defamatory posts. As such speech proliferates, it connects people globally and poses significant social, psychological, and occasionally physical threats to targeted individuals and communities. Current computational linguistic approaches for tackling this phenomenon rely on labelled social media datasets for training. For unifying efforts, our study advances in the critical need for a comprehensive meta-collection, advocating for an extensive dataset to help counteract this problem effectively. We scrutinized over 60 datasets, selectively integrating those pertinent into MetaHate. This paper offers a detailed examination of existing collections, highlighting their strengths and limitations. Our findings contribute to a deeper understanding of the existing datasets, paving the way for training more robust and adaptable models. These enhanced models are essential for effectively combating the dynamic and complex nature of hate speech in the digital realm.},
  number={1},
  journal={Proceedings of the International AAAI Conference on Web and Social Media},
  author={Piot, Paloma and Martín-Rodilla, Patricia and Parapar, Javier},
  year={2024},
  month={May},
  pages={2025-2039}
}

🙏 致谢

作者感谢Horizon Europe研究与创新计划根据Marie Skłodowska - Curie资助协议No. 101073351提供的资金支持。作者还感谢文化、教育、职业培训和大学部（认证2019 - 2022 ED431G/01，ED431B 2022/33）和欧洲区域发展基金的财政支持，该基金认可拉科鲁尼亚大学的CITIC信息通信技术研究中心为加利西亚大学系统的研究中心，以及项目PID2022 - 137061OB - C21（科学与创新部，国家研究机构，知识生成项目；由欧洲区域发展基金支持）。作者还感谢项目PLEC2021 - 007662（MCIN/AEI/10.13039/501100011033，科学与创新部，国家研究机构，恢复、转型和韧性计划，欧盟 - 下一代欧盟）的资金支持。