In mid-2025, McDonald’s McHire platform showed how basic authorization failures still break at internet scale. Public reporting described roughly 64 million applicant records as exposed through a combination of weak admin credentials and an IDOR-style record access flaw that let researchers retrieve other applicants’ PII. The endpoint accepted a sequential lead_id parameter. Decrement by one, get someone else's PII. No sophisticated tooling required. The attacker just changed a number.
That same month, GitLab patched multiple critical authorization bypasses: CVE-2025-4972, CVE-2025-6168, and CVE-2025-3396. Users bypassing group-level restrictions. A 2FA bypass. All logic and authorization flaws.
Broken Access Control has been OWASP's #1 vulnerability category for many years. We've known about IDOR, BOLA, and privilege escalation for decades. The tooling has improved dramatically. And yet the breaches keep happening.
Let’s understand why - and what's actually changing in 2026.
The practitioner objection you'll find on every security forum is this: "Automated tools can't find IDOR. You need a real pentester for business logic."
This has been true. Here's the technical reason why.
Traditional DAST scanners crawl endpoints and match responses against known vulnerability signatures. It knows what a SQL injection looks like. It knows what a missing security header looks like. What it cannot do:
Business logic flaws require context. Scanners don't have context. They have signatures.
We observe that more than 50% of critical vulnerabilities are authorization and access control flaws, and the traditional SAST detection rate for these was near zero. Finding them manually took 2 to 4 hours per endpoint. The same task, with AI reasoning, can be done in minutes.
That gap between what scanners find and what attackers exploit is where most breaches actually live.
The business logic blind spot is part of a larger structural problem. After fifteen years of building security testing tools and running a community of 40,000+ security professionals, I see the same three gaps at almost every enterprise:
Most organizations test their crown-jewel applications. Peripheral apps, forgotten subdomains, UAT environments, third-party integrations - these get assumed safe.
DAST tools generate 40 to 70% false positive rates. Your security team spends most of its time triaging noise - while missing the connections between findings that matter.
Development teams ship weekly. Sometimes daily. Most enterprises still pentest once a year.
The shift worth understanding isn't faster scanning. It's a different kind of reasoning.
Here's a real example of what this looks like in practice. An agentic AI system was testing a web application when it found an exposed .git directory:
Four steps. Fully autonomous. No human steering. No predefined playbook.
This is the difference between signature detection and contextual reasoning. The agent didn't match a pattern - it formed a hypothesis, tested it, and adapted. That is exactly what IDOR exploitation looks like in the real world: an attacker who understands what the application is trying to do, and tests whether the authorization logic actually enforces it.
Modern agentic platforms can now detect BOLA, IDOR, privilege escalation, and workflow bypasses at scale - the vulnerabilities practitioners have always said required human creativity. That doesn't mean human pentesters are obsolete. It means the division of labor is changing:
Whether you're evaluating a platform or building internal capability, these are the questions that matter:
The cost comparison for web app pen testing in 2026:
|
Approach |
Cost per app |
False Positive Rate |
Chaining |
Cadence |
|
Manual PT consulting |
$2,400–$10,000+ |
Low, variable |
Manual, scoped |
Annual |
|
DAST tools |
$1,460–$2,900 (estimated) |
40–70% |
✗ Not supported |
Continuous scanning only |
|
Agentic AI (FireCompass) |
Under $1,000 |
Under 2% |
Autonomous |
On-demand or continuous |
In practice, one Fortune 500 company moved from testing 200 of its 2,000+ web applications annually - at ~$5,000 per app - to continuous coverage across its full portfolio at under $1,000 per app. That's an 11x cost reduction and a jump from 10% to 99% coverage. Simultaneously.
In 2024, proactive security testing prevented an estimated $2.88 billion in potential losses - with IDOR, AWS key leakage, and 2FA bypasses among the highest-value findings, each carrying six-figure damage potential individually.
The math on doing nothing has changed.
Business logic issues. IDOR. BOLA. Credential reuse across services. These are not exotic vulnerabilities. They're OWASP #1. They're McDonald's and GitLab, and the breach that happened at a company that looked exactly like yours.
The question worth asking: of your web application portfolio, how much of it has been tested - not scanned, tested - in the last 90 days? With proof-of-exploit for every finding? Across both external and internal assets?
If the answer is less than half, you're not running a security program. You're running a compliance exercise and hoping the untested part doesn't make headlines.
Priyanka Aash is Co-Founder of FireCompass, a 2025 SC Media Power Player Honoree, and the author of The AI Divide. She is known for building the global CISOPlatform community focused on bringing together 40,000+ cybersecurity leaders together and driving meaningful industry conversations.
FireCompass is an Agentic AI platform for autonomous penetration testing and red teaming across Web, API, and infrastructure. It discovers shadow assets and web applications, safely validates what is exploitable, and connects findings into multi-stage attack paths with near-zero false positives. Unlike traditional scanners, it discovers credential reuse, business-logic flaws, privilege escalation, and app-to-app or app-to-network lateral movement. It can operate autonomously or with expert-in-the-loop validation. FireCompass has 30+ analyst recognitions across Gartner, Forrester, IDC, and is trusted by Fortune 1000 enterprises. Try it free at firecompass.com/explorer .