Palo Alto Networks' Prisma AIRS AI Red Teaming is an automated solution
designed to scan any AI system—including LLMs and LLM-powered
applications—for safety and security vulnerabilities.
The tool performs a Scan against a specified Target (model,
application, or agent) by sending carefully crafted attack prompts to
simulate real-world threats. The findings are compiled into a comprehensive Scan
Report that includes an overall Risk Score (ranging from 0 to 100),
indicating the system's susceptibility to attacks.
Prisma AIRS offers three distinct scanning modes for thorough assessment:
Attack Library Scan: Uses a curated, proprietary library of predefined
attack scenarios, categorized by Security (e.g., Prompt Injection,
Jailbreak), Safety (e.g., Bias, Cybercrime), and Compliance
(e.g., OWASP LLM Top 10).
Agent Scan: Utilizes a dynamic LLM attacker to generate and
adapt attacks in real-time, enabling full-spectrum Black-box, Grey-box, and
White-box testing.
Custom Attack Scan: Allows users to upload and execute their own
custom prompt sets alongside the built-in library.
A key feature of the service is its single-tenant deployment model, which
ensures complete isolation of compute resources and data for enhanced security and
privacy.