RAGWall: Prompt Injection Detection for Retrieval-Augmented Generation (RAG) Systems
~90% detection on public prompt-injection with transformer, ~70% regex-only (<1ms). ~48% HRCR reduction on healthcare corpus. Zero observed false positives on tested benign sets.
⚡ ~90% Detection Rate • <1ms Latency • 0% False Positives