InjectShield

What datasets exist for prompt-injection testing (PromptInject, HarmBench, etc.)?

The 2026 landscape of public injection-evaluation datasets:

For RAG: PoisonedRAG (Zou et al., 2024) and PromptBench RAG subsets cover stored injection. For multimodal: MMSafety and Anthropic's vision red-team disclosures.

Production-grade testing should combine 2-3 datasets across direct, indirect, and agent surfaces, plus your own domain-specific adversarial corpus. InjectShield publishes its open-source heuristic ruleset on GitHub and is evaluated against PromptInject + HarmBench + INJECAGENT — benchmarks at injectshield.dev/benchmarks.