Hidden commands and phishing emails: researchers demonstrate two attack paths against OpenClaw AI agent
Two research teams published attacks against the OpenClaw AI agent this week. Imperva hid commands in shared contacts and location pins that the agent executed. Varonis showed a plain email could trick an agent into forwarding AWS keys and customer data.
Two security research teams have published separate attacks against OpenClaw, the popular self-hosted AI agent platform, demonstrating that malicious instructions hidden inside shared contacts, vCards, and plain emails can drive the agent to run attacker-controlled code or hand over sensitive data without the victim ever seeing the prompt.
Imperva researcher Yohann Sillam found that when OpenClaw flattens message objects into the prompt text for the large language model, it does not mark the content as untrusted. A shared contact name containing angled brackets is serialized as
Agent phishing works where technical defenses hold
Varonis Threat Labs built a test agent named Pinchy on OpenClaw, wired it to a Gmail inbox stocked with synthetic business data, and ran four phishing simulations on both Gemini 3.1 Pro and OpenAI Codex GPT-5.4. The agent failed both exfiltration tests. A message posing as a team lead named Dan, sent from an outside Gmail address, asked for staging access during a fake production incident. Pinchy found credentials and forwarded mock AWS IAM keys, database strings, and SSH credentials in plaintext. A second request for a weekly customer export shipped a synthetic dataset of 247 enterprise customers, contacts, and contract values.
Both failures happened under a strict profile that told the agent to verify senders first. The rule existed. Urgency beat it once, routine beat it the second time. The agent did better on technical threats: it flagged a gift-card phishing page, blocked a malicious OAuth consent screen dressed as a timesheet app, and stopped before granting access.
Underlying weakness spans multiple platforms
Imperva found the same flattening pattern in other personal AI assistants, so the problem is not OpenClaw alone. The Varonis attacks map onto what security researcher Simon Willison calls the lethal trifecta: an agent that can read private data, take in untrusted content, and send data back out. OpenClaw has all three.
A single widely shared contact carrying a hidden instruction could quietly compromise multiple agents if they are not sandboxed, Imperva warns. Varonis says the social judgment gap means agent phishing will persist even as technical attacks get patched. Varonis lead researcher Itay Yashar told The Hacker News the agent drive to be helpful is the attack surface.
OpenClaw users should update to version 2026.4.23 to address the message-object injection. The broader defense against agent phishing, researchers say, requires limiting what an agent can do autonomously and adding human approval gates for outbound data transfers.
Fact check
-
Imperva buried hidden commands inside shared contacts, vCards, and location pins that caused the OpenClaw AI agent to execute attacker-controlled code.
reported · source
-
Varonis built a test agent on OpenClaw and showed that a single plain email could trick it into forwarding mock AWS keys and a fake customer export to an outside address.
reported · source
-
The flaw Imperva found is patched in OpenClaw version 2026.4.23.
reported · source
-
Imperva found the same flattening pattern in other personal AI assistants, indicating the problem is not unique to OpenClaw.
reported · source
-
Varonis maps the attacks onto what Simon Willison calls the 'lethal trifecta': an agent that can read private data, take in untrusted content, and send data back out.
reported · source
Source reporting (6)
- The Hacker News · New Attacks Trick OpenClaw AI Agent Into Running Code and Leaking Secrets
- The New Stack · How to delegate 40% of tickets to AI
- The New Stack · Agentic development hinges on verification. For cloud-native software, that is a runtime problem.
- The New Stack · AI agents need infrastructure: Why Europe’s regional cloud strategy matters
- BleepingComputer · Why AI-driven threats are exposing the limits of MSP security stacks
- The Hacker News · AI Broke Vulnerability Management. That's Why CISOs Are Moving Budget to BAS.
Join the conversation
You need to be registered and logged in to comment on blog articles.
0 Comments
No comments yet
Be the first to share your thoughts on this article.