Back to news

AI News

The OpenClaw security audit: CVE-2026-25253 explained.

A deep dive into the prompt injection vulnerability that prompted OpenClaw's first independent security audit and what it means for agent safety.

AI Kick Start editorial image for The OpenClaw security audit: CVE-2026-25253 explained.

Decision

Start narrow

Use the article to decide the smallest useful workflow worth testing before expanding the system.

Risk to watch

Hype drift

Avoid turning a practical adoption step into a broad transformation promise nobody can verify.

Proof to collect

Business signal

Write down the owner, data boundary, review point, and measurable outcome before the first build.

TL;DR

TL;DR: A deep dive into the prompt injection vulnerability that prompted OpenClaw's first independent security audit and what it means for agent safety.

Key takeaways

  • Briefing: In early 2026, the maintainers of OpenClaw hit the moment every popular open-source project dreads: a publicly disclosed, high-severity security hole with a CVE number attached.
  • The Vulnerability: Here's what the bug actually is, based on the vendors who analysed it.
  • Discovery and Disclosure: The earlier version of this article credited the find to a researcher named Elena Vasquez during an OSTIF-funded audit, with a 90-day responsible disclosure window.
  • The Fix: The patch shipped fast.
  • The Independent Audit: The earlier draft claimed the CVE triggered OpenClaw's first comprehensive independent audit, run by Trail of Bits over six weeks, which found CVE-2026-25253 to be the only critical issue plus two medium-severity bugs.

Briefing

In early 2026, the maintainers of OpenClaw hit the moment every popular open-source project dreads: a publicly disclosed, high-severity security hole with a CVE number attached. The bug, tracked as [CVE-2026-25253](https://www.proarch.com/blog/threats-vulnerabilities/openclaw-rce-vulnerability-cve-2026-25253), was rated CVSS 8.8, firmly in the "patch this now" range.

What the bug actually does matters, because there's been some confusion about it. Earlier write-ups (including an earlier version of this piece) framed it as a prompt injection flaw in OpenClaw's browser automation skill. That framing appears to be wrong. The security vendors who looked at it, SonicWall, ProArch and others, describe a one-click remote code execution bug caused by authentication-token theft, not a prompt injection. The article below has been corrected to reflect that, and the parts that couldn't be verified are flagged as such.

For an Australian business team, the takeaway is simpler than the CVE details: if you're running a self-hosted AI agent that can touch your files, your browser sessions, or your shell, a single bad link can turn into a serious breach. OpenClaw is one of the most popular tools in that category, with around 345,000 GitHub stars. When something this widely deployed ships an 8.8, it's worth understanding what went wrong.

The Vulnerability

Here's what the bug actually is, based on the vendors who analysed it. CVE-2026-25253 is a one-click remote code execution flaw caused by authentication-token theft. OpenClaw's Control UI trusted an unvalidated gatewayUrl query parameter and exfiltrated the gateway token over a WebSocket connection, classified as CWE-669. In plain terms: an attacker could trick the UI into handing its access token to an attacker-controlled server, then use that token to run code.

That's a meaningfully different problem from the prompt-injection story that circulated early on. ProArch put it bluntly: this is "remote code execution via token theft, not prompt injection." The attack doesn't need the agent to read a poisoned web page and get talked into misbehaving. It needs a victim to click one crafted link.

The reason the distinction matters for defenders: a token-theft RCE is about how your agent's UI handles untrusted parameters and where it sends credentials, not about how it sanitises page content. The fix lives in different code, and so does the hardening you'd do on your side.

Supporting AI Kick Start editorial image for openclaw-security-audit-cve-2026-25253.
Generated AI Kick Start editorial visual used to explain the article's practical workflow and trade-offs.

Discovery and Disclosure

The earlier version of this article credited the find to a researcher named Elena Vasquez during an OSTIF-funded audit, with a 90-day responsible disclosure window. None of that holds up. No source ties a researcher by that name, or OSTIF funding, to this CVE, and the real timeline doesn't fit a 90-day private window at all.

The verifiable timeline, per ProArch, runs much tighter: discovery in late January 2026, public disclosure around 3 February 2026, and a patched build (version 2026.1.29) out the door on roughly 29-30 January. So the original "April 2026" date was wrong too, this was a January, February story.

Treat the "Elena Vasquez / OSTIF / 90-day disclosure" account as unconfirmed and most likely fabricated. The CVE is real; the backstory attached to it isn't.

The Fix

The patch shipped fast. A fixed version, 2026.1.29, landed in late January, which is consistent with a quick turnaround. The specific "within 48 hours of disclosure" figure can't be confirmed against the real chronology, so read that as a rough characterisation rather than a measured number.

A note on the four-part fix described in the earlier draft, input sanitisation, instruction separation, capability restrictions, and a content-security-policy-style mechanism. Those defences map to the prompt-injection story, not the actual token-theft bug. For a token-theft RCE, the relevant fix is validating that gatewayUrl parameter and refusing to send credentials to untrusted origins. The hardening principles below still apply to running any agent safely, but the specific remediation here was about credential handling, not page sanitisation.

That said, the broader defensive ideas remain sound for anyone deploying an agent:

Input handling: Don't trust parameters from the URL or untrusted content. Validate before you act on them.

Instruction separation: Keep system instructions isolated from anything that came from a user or a web page.

Capability restrictions: Run the agent with the least privilege it can get away with. File system access, external API calls, and sensitive operations should need explicit confirmation.

Origin checks: Never send tokens or credentials to an origin you haven't verified.

The Independent Audit

The earlier draft claimed the CVE triggered OpenClaw's first comprehensive independent audit, run by Trail of Bits over six weeks, which found CVE-2026-25253 to be the only critical issue plus two medium-severity bugs. Both claims are unsupported, and the second one contradicts the public record.

No source found ties Trail of Bits to an OpenClaw audit. The audits that are documented came from other parties, for example, an analysis that found over 41% of popular OpenClaw skills contained security vulnerabilities, plus reviews attributed to CertiK and others. So treat the "Trail of Bits six-week audit" as unverified.

The "only one critical vulnerability" claim is the bigger problem. The reality looks like the opposite of a clean bill of health: reporting points to OpenClaw facing a multi-vector security crisis through 2026, with sources citing 138+ CVEs, one formal audit turning up 512 vulnerabilities (8 of them critical), and a supply-chain poisoning campaign in its skills marketplace. A single-critical-issue narrative doesn't match that record, so it's flagged as likely fabricated.

Community Impact

The original framing here was that the incident strengthened confidence in OpenClaw's security and that enterprise adopters cited the audit as a deciding factor. The available sources don't back that up, and several point the other way.

Conscia, among others, describes OpenClaw's 2026 situation as a security crisis, RCE, supply-chain skill poisoning, and a large share of skills carrying vulnerabilities. No source supports the idea that enterprises picked OpenClaw *because* of an audit. So the "credibility win" reading is unconfirmed and runs against the documented picture.

What's true and worth holding onto: OpenClaw is a real, widely used open-source agent framework with shell, browser automation, and file skills, plus a ClawHub-style skills marketplace. And it is genuinely popular, around 345,000 GitHub stars as of April 2026. Popularity and a working patch are facts. The tidy "this made everyone trust us more" story is not.

Lessons for the Agent Ecosystem

Strip away the parts that didn't check out, and there's still a real lesson here for anyone running agent software in a business.

  • Untrusted input is everywhere: Agents pull in content and parameters from sources you don't control. A URL parameter is untrusted input just like a web page is.
  • Credentials are the crown jewels: This bug was a token-theft RCE. Where your agent stores and sends its tokens matters more than almost anything else.
  • Default permissions should be tight: Least privilege by default. If a skill doesn't need file access, don't give it file access.
  • Security is ongoing: One patch doesn't close the book. OpenClaw's broader 2026 record, many CVEs, a poisoned skills marketplace, is the real cautionary tale, not a single fixed bug.

If you're evaluating OpenClaw or any self-hosted agent for your team, the honest summary is this: the project is real and popular, this particular CVE was patched quickly, but the wider security story through 2026 has been rough. Run it sandboxed, keep it patched, lock down skill permissions, and don't take any single "it's all fine now" narrative at face value, including the one this article originally told.

Source trail

Primary references to keep this briefing grounded

AI and automation information changes quickly. Use these official or primary references to verify the claims, pricing, product behaviour, and compliance details before committing budget or production data.

What to do next

  1. Pick the smallest useful workflow that proves the pattern.
  2. Write down the owner, data boundary, review point, and success measure.
  3. Review the result after the first real run and decide whether to scale, change, or stop.

Want help applying this? Explore AI agent design systems.

AI Kick Start is an Illawarra-based AI studio in Figtree, helping businesses across Wollongong, Shellharbour and Kiama and right across Australia put AI to work.

Explore with AI

Use the article as a decision prompt

Summarise this AI Kick Start article for an Australian business owner. Focus on the useful decision, the risks, and the first practical next step: The OpenClaw security audit: CVE-2026-25253 explained

Turn this into a practical roadmap.

Use the guide as a starting point, then map the first workflow worth building.

Book an AI strategy call