Prompt Injection DeBERTa
finetuned DeBERTa-based prompt injection detection
Build secure, reliable, and long-term AI systems focused on safety, reasoning, and developer tooling.
AI Security • Prompt Defense • LLM Safety
Building secure, reliable AI systems focused on prompt security, adversarial robustness, and practical safety tooling.
Prompt injection detection framework designed to classify benign vs malicious prompts across real-world and synthetic attack patterns.
Dataset: neuralchemy/prompt-injection-dataset 6000+ prompt injection and benign samples collected from realistic attack scenarios.
ML Models: neuralchemy/prompt-injection-detector Classical machine learning classifiers for prompt risk detection.
DeBERTa Fine-Tuned Model: neuralchemy/prompt-injection-deberta Transformer-based prompt injection classifier.
Live Demo Space: Prompt-injection-DeBERTa Interactive inference demo for prompt safety classification.
Advancing AI security through open datasets, practical model deployment, and adversarial safety research.
Building safer AI systems through open security research. 🚀