How AI Pentesting Works¶

AI Pentesting runs on Cascade, Escape's multi-agent pentest engine. Cascade pairs Escape's crawling with an autonomous swarm of agents that reason about your application, attack it, and prove what they find. This page walks through an assessment from setup to evidence.

Profile Setup & Scope¶

AI Pentesting profiles are created from the AI Pentesting page with the New Pentest creation form.

The first section is scope. See Scope for modes, restrictions, Strict CDN allowances, and how enforcement works.

Before launch, Review configuration starts a validation workflow. It checks that Escape can reach the target, classify the listed URLs, and authenticate each configured user. The assessment can only be launched after that validation passes.

Crawling¶

Cascade starts every assessment with a crawling phase that builds the map the swarm works from. AI Pentesting uses agentic crawling to explore applications:

State-aware exploration
Natural language instructions
Error recovery
Context understanding

Agents use LLM reasoning to navigate web applications and discover endpoints.

Escape enforces assessment scope at the network boundary, inside agents, and in agent prompts. URL and GraphQL restrictions from the profile blocklist are applied during the run. See Scope for configuration details.

The Cascade Swarm¶

After crawling, Cascade takes over. It's not a fixed pipeline of single-purpose agents: it's an orchestrated swarm that adapts to what the crawler found.

Four roles drive the engagement:

Orchestrator: plans the engagement and spawns workers on demand.
Workers: short-lived agents for reconnaissance, exploitation, or access-control testing.
Reporter: independently reproduces each candidate finding before it's filed.
Coverage: tracks tested surfaces and pushes the orchestrator toward gaps.

Workers share discoveries through a message bus and a shared knowledge store. See The Cascade Engine for the full architecture, skills catalogue, and limits.

Cascade multi-agent pentest engine architecture

What Cascade Tests For¶

Cascade spins up the right specialists for the surfaces it finds. The classes it covers include:

Cross-site scripting: context-aware payloads across HTML, attribute, JavaScript, and DOM contexts.
SQL injection: reconnaissance-first testing that prioritizes database-backed endpoints before escalating to confirmed exploitation.
Access control: BOLA, IDOR, tenant isolation, and privilege escalation, using the users you configure.
Business logic: workflow bypasses, state manipulation, replay, and idempotency flaws.
Front-end JavaScript: leaked secrets and undocumented APIs mined from served bundles.
SSRF and command injection: server-side request forgery and remote command execution where the surface allows it.

These classes were once handled by separate, single-purpose agents. Cascade now runs them as dynamic capabilities spawned on demand.

Cascade uses the users configured in the Authentication section to test authorization boundaries. Each user has a name and natural-language instructions that explain how to sign in and what that account should be able to access. See Authentication for configuration details.

Agents load modular skills scoped per task (vulnerability classes, tooling, frameworks, protocols, and more) rather than running a fixed playbook. See The Cascade Engine for the full catalogue.

Assessment Workflow¶

Discovery: the crawler explores the application and builds the map.
Planning: the orchestrator reasons about structure and behavior, then assigns work.
Testing: workers execute injection attacks, access-control probes, and business-logic tests in parallel.
Validation: the reporter independently reproduces each candidate and collects evidence.
Reporting: confirmed issues are filed with an attack chain, impact, and a Proof of Exploit.

Agent reasoning is visible in assessment logs, showing why agents took specific actions and how they adapted their strategies. See Proof of Exploit.

AI Models¶

A common question from customers is "which LLM does AI Pentesting use?", often with the assumption that everything is powered by a single model like ChatGPT. It isn't. Cascade is a multi-model system: different stages of an assessment, and different agents, are powered by different models chosen for the job at hand.

A Portfolio of Models, not a Single Model¶

Escape routes work across a portfolio of frontier commercial models and open-weight models hosted on Escape's own infrastructure. No single provider or model powers the product end-to-end. We treat models as interchangeable components and select the best one for each sub-task based on continuous internal benchmarks.

Why Different Models for Different Tasks?¶

Security testing is not one problem. It's a pipeline of very different cognitive tasks, each with different requirements:

Crawling & navigation: Visual and DOM reasoning, tool use, and cost efficiency. Agents click, type, and navigate thousands of pages per assessment. Speed and cost dominate. See Agentic Crawling.
Exploit design & payload generation: Creativity, divergent thinking, and broad world knowledge. Finding a new attack path or crafting a novel payload benefits from models that explore the solution space aggressively.
Exploit validation & confirmation: Strict, deterministic reasoning, low hallucination, and adherence to evidence. Confirming an exploit worked must be precise. A false positive here becomes a false positive in your Issue queue.
Business logic & multi-step planning: Long-context reasoning, planning, and state tracking. Multi-step attacks (BOLA chains, workflow bypasses) require reasoning over long transcripts of requests, responses, and prior agent actions.
Evidence summarization & remediation write-ups: Instruction-following, concision, and technical writing. The reproduction steps, cURL commands, and remediation guidance surfaced in a finding are generated from raw evidence.

A model that is excellent at one of these stages is often not the best choice for another. Using a single model everywhere would mean trading off creativity against rigor on every single task.

Model Selection per Role¶

Cascade composes the stages above across its agent roles with different weightings:

The orchestrator leans on long-context planning models to track identities, roles, and multi-step attack plans across a long engagement.
Exploitation workers pair creative payload-generation models with reconnaissance-oriented reasoning to prioritize the highest-value surfaces.
The reporter uses a strict validation model that only confirms a finding when the evidence is unambiguous.
Summarization of reproduction steps and remediation guidance uses instruction-following models tuned for concise technical writing.

The exact routing, including which model is used at which step, with which prompts, tools, and guardrails, is part of Escape's private IP and is what makes the product work. Customers benefit from this routing without having to build it themselves.

Always Evaluating the Newest Models¶

Frontier models move fast. Escape runs every new major release (GPT, Claude, Gemini, Llama, and others) against an internal security-evaluation harness composed of real-world vulnerability reproduction tasks, benchmark applications, and regression suites. A new model is promoted only when it beats the incumbent on our metrics, not on a public leaderboard. This means:

The model mix today is not the model mix six months ago.
Improvements in upstream model capabilities translate into better findings, reproductions, and remediations without changes on the customer's side.
If a new model regresses on security reasoning, we don't ship it, even if it's cheaper or faster.

Data Handling¶

Regardless of which model is used, customer data handling is governed by Escape's Privacy & Security policy:

No training on customer data. Customer traffic, payloads, findings, and evidence are never used to train third-party models.
Zero-retention agreements are in place with commercial model providers where available, so prompts and responses are not retained by the provider.
Open-weight models used in sensitive contexts run on Escape-controlled infrastructure, so data never leaves Escape's trust boundary.
Organization administrators can disable all AI Pentesting activity at any time via the AI Pentesting Kill Switch.

What about reproducibility?

Because the model mix evolves over time, an AI Pentesting assessment is intentionally not bit-for-bit reproducible. That's a feature, not a bug. It's what lets agents find things that a frozen, signature-based scanner cannot. Determinism where it matters, such as finding identity, reproduction steps, and evidence, is provided by Escape's own engine, not by the underlying LLM.

The Cascade Engine: Architecture, skills, and limits
Graph Reasoning: How Cascade builds attack paths
Scope: Standard vs Strict, restrictions, and enforcement
Proof of Exploit: Evidence, logs, and coverage
Agentic Crawling: Technical details on crawling

architecture workflow cascade agents crawling xss sqli business logic ai models llm models chatgpt openai anthropic claude gemini