L

Llama Guard 3 8B

Developed by meta-llama
Llama Guard 3 is a content safety classifier fine-tuned from the Llama-3.1-8B pre-trained model for content moderation of LLM inputs and responses.
Downloads 327.59k
Release Time : 7/22/2024

Model Overview

Llama Guard 3 is a content safety classifier that can be used for content moderation of both inputs (prompt classification) and responses (response classification) in large language models (LLMs). Operating as an LLM, it generates text output indicating content safety, listing violated categories if unsafe.

Model Features

Multilingual support
Supports prompt and response classification in 8 languages including English, French, German, Hindi, Italian, Portuguese, Spanish and Thai
14-category harm detection
Trained on 13 categories of harm based on MLCommons taxonomy plus code interpreter abuse, covering a wide range of security risks
Low false positive rate
Significantly reduces false positives compared to previous models and GPT-4 while maintaining high F1 scores
Tool usage scenario support
Adds safety detection capabilities for tool usage scenarios like search tools and code interpreters

Model Capabilities

Prompt classification
Response classification
Multilingual content moderation
Security risk detection
Code interpreter abuse detection

Use Cases

Content moderation
LLM input filtering
Detects potentially harmful or non-compliant content in user inputs
Effectively identifies 14 categories of harmful content including violence and hate speech
LLM output filtering
Detects potentially harmful or non-compliant content in model responses
Prevents inappropriate model responses, reducing legal and reputational risks
Security compliance
Multilingual platform moderation
Provides a unified content safety solution for multilingual platforms
Supports violation detection in 8 languages
Tool usage security
Detects potential abuse in tool usage such as code interpreters
Identifies malicious uses like denial-of-service attacks and privilege escalation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase