How our agents validate a finding
A finding you can't reproduce is a guess. The entire value of a penetration test collapses if the team on the receiving end can't tell signal from noise. That's why every Parameter finding ships with a working proof-of-concept — and why our agents never report a vulnerability they haven't actually triggered.
Probe, exploit, verify
Our agents reason like a human red teamer working a target:
- Probe. Map the surface — endpoints, parameters, auth flows, and the business logic that connects them.
- Hypothesize. Form a specific, testable claim: "this
idparameter is an IDOR that exposes other tenants' records." - Exploit. Build the request that proves it, capturing the full request and response.
- Verify. Replay the exploit from a clean state. If it doesn't reproduce, it isn't a finding.
Why proof-of-concept matters
A confirmed exploit changes the conversation. Instead of "the scanner flagged this, please investigate," your engineers get a reproduction they can run, watch fail, and fix — then re-test to confirm the patch holds.
It also keeps us honest. An agent that has to produce a working exploit can't pad a report with theoretical maybes. The bar is binary: did it reproduce, or not?
Validation by construction
Because validation is part of the loop rather than a step bolted on afterward, false positives stay rare — under 1% in practice. You spend your time fixing real issues, not arguing with a tool about whether they're real.