S

Shieldgemma 2b

Developed by google
ShieldGemma is a series of secure content review models built on Gemma 2, targeting four types of harmful content (pornography, dangerous content, hate, and harassment).
Downloads 3,107
Release Time : 7/16/2024

Model Overview

ShieldGemma is a decoder-only large language model that supports English, has open weights, and offers 3 scales: 2B, 9B, and 27B parameters for secure content review.

Model Features

Multi-hazard type review
Review four types of harmful content: pornography, dangerous content, hate, and harassment
Multi-scale selection
Offer model selection with three parameter scales: 2B, 9B, and 27B
Flexible application
Support two application modes: prompt-only content classification and prompt-response content classification

Model Capabilities

Text classification
Content security review
Harmful content detection

Use Cases

Content security
User input filtering
Detect whether user input contains harmful content
Identify and filter inappropriate content such as dangerous, hateful, and harassing content
Model output filtering
Detect whether AI-generated content violates security policies
Ensure that AI output complies with security specifications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase