D

Duoguard 0.5B

Developed by DuoGuard
DuoGuard-0.5B is a multilingual, decoder-only large language model-based classifier, specifically designed for safety content moderation across 12 different subcategories.
Downloads 235
Release Time : 2/7/2025

Model Overview

This model is used to classify the safety of input text sequences, supporting multilingual content moderation and capable of detecting potentially unsafe or disallowed content across 12 different subcategories.

Model Features

Multilingual support
Specifically fine-tuned for safety content moderation in English, French, German, and Spanish, while retaining the base model's capability to support 29 languages.
Fine-grained classification
Capable of detecting potentially unsafe content across 12 different subcategories, providing multi-label probability distributions.
Binary moderation
Can generate simplified 'safe'/'unsafe' labels by comparing the maximum probability of the 12 subcategories with a threshold.

Model Capabilities

Multilingual text classification
Content safety moderation
Multi-label classification
Binary classification

Use Cases

Content moderation
Social media content moderation
Automatically detect unsafe or disallowed content on social media platforms
Capable of identifying potentially risky content across 12 different subcategories
Chatbot safety guardrails
Provide safety guardrails for chatbots to prevent the generation of unsafe content
Real-time detection and filtering of unsafe responses
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase