D

Deberta V3 Base Prompt Injection

Developed by protectai
A DeBERTa-v3 fine-tuned model for prompt injection detection, designed to identify malicious prompt inputs
Downloads 35.13k
Release Time : 11/25/2023

Model Overview

This model is specifically designed to detect prompt injection attacks, classifying input text as either normal prompts or malicious injection prompts, helping to safeguard AI systems.

Model Features

High-Precision Detection
Achieves 99.99% accuracy and 99.98% F1 score on the evaluation dataset
Multi-Dataset Training
Trained on 12 datasets from different sources, covering various prompt injection patterns
Multi-Framework Support
Provides both native Transformers and ONNX runtime options
Ecosystem Integration
Supports integration with popular frameworks like Langchain and LLM Guard

Model Capabilities

Text Classification
Malicious Input Detection
Security Protection

Use Cases

AI Security
Chatbot Protection
Prevents malicious users from manipulating chatbot behavior through prompt injection attacks
Effectively identifies 99.7% of injection attempts
API Security Gateway
Detects and blocks potentially malicious prompts at the API gateway layer
Content Moderation
Harmful Content Filtering
Identifies malicious prompts attempting to bypass content restrictions
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase