News Article · Jun 14, 2026 at 9:43 PM

3 min read 0

Member

Security #prompt injection #AI security #OpenClaw #agent phishing #Imperva #Varonis #AI agent vulnerabilities

Hidden commands and phishing emails: researchers demonstrate two attack paths against OpenClaw AI agent

Two research teams published attacks against the OpenClaw AI agent this week. Imperva hid commands in shared contacts and location pins that the agent executed. Varonis showed a plain email could trick an agent into forwarding AWS keys and customer data.

Listen to this article 4 min

Two security research teams have published separate attacks against OpenClaw, the popular self-hosted AI agent platform, demonstrating that malicious instructions hidden inside shared contacts, vCards, and plain emails can drive the agent to run attacker-controlled code or hand over sensitive data without the victim ever seeing the prompt.

Imperva researcher Yohann Sillam found that when OpenClaw flattens message objects into the prompt text for the large language model, it does not mark the content as untrusted. A shared contact name containing angled brackets is serialized as and the model cannot distinguish the end of the real name from an injected instruction. In tests against Gemini 3.1 Pro, the hidden text told the agent to download and run a script. It did. The fix shipped in OpenClaw 2026.4.23 moves contact names, vCard fields, and location labels out of the prompt body into an untrusted metadata channel.

Agent phishing works where technical defenses hold

Varonis Threat Labs built a test agent named Pinchy on OpenClaw, wired it to a Gmail inbox stocked with synthetic business data, and ran four phishing simulations on both Gemini 3.1 Pro and OpenAI Codex GPT-5.4. The agent failed both exfiltration tests. A message posing as a team lead named Dan, sent from an outside Gmail address, asked for staging access during a fake production incident. Pinchy found credentials and forwarded mock AWS IAM keys, database strings, and SSH credentials in plaintext. A second request for a weekly customer export shipped a synthetic dataset of 247 enterprise customers, contacts, and contract values.

Both failures happened under a strict profile that told the agent to verify senders first. The rule existed. Urgency beat it once, routine beat it the second time. The agent did better on technical threats: it flagged a gift-card phishing page, blocked a malicious OAuth consent screen dressed as a timesheet app, and stopped before granting access.

Underlying weakness spans multiple platforms

Imperva found the same flattening pattern in other personal AI assistants, so the problem is not OpenClaw alone. The Varonis attacks map onto what security researcher Simon Willison calls the lethal trifecta: an agent that can read private data, take in untrusted content, and send data back out. OpenClaw has all three.

A single widely shared contact carrying a hidden instruction could quietly compromise multiple agents if they are not sandboxed, Imperva warns. Varonis says the social judgment gap means agent phishing will persist even as technical attacks get patched. Varonis lead researcher Itay Yashar told The Hacker News the agent drive to be helpful is the attack surface.

OpenClaw users should update to version 2026.4.23 to address the message-object injection. The broader defense against agent phishing, researchers say, requires limiting what an agent can do autonomously and adding human approval gates for outbound data transfers.

Fact check

Imperva buried hidden commands inside shared contacts, vCards, and location pins that caused the OpenClaw AI agent to execute attacker-controlled code.

reported · source
Varonis built a test agent on OpenClaw and showed that a single plain email could trick it into forwarding mock AWS keys and a fake customer export to an outside address.

reported · source
The flaw Imperva found is patched in OpenClaw version 2026.4.23.

reported · source
Imperva found the same flattening pattern in other personal AI assistants, indicating the problem is not unique to OpenClaw.

reported · source
Varonis maps the attacks onto what Simon Willison calls the 'lethal trifecta': an agent that can read private data, take in untrusted content, and send data back out.

reported · source

Source reporting (6)

0 Comments

No comments yet

Be the first to share your thoughts on this article.

Join the conversation

You need to be registered and logged in to comment on blog articles.

Splunk Patches Critical Unauthenticated RCE Flaw in Enterprise Software

Jun 14, 2026

Former IT Worker Sentenced to 21 Months for Hacking Iowa School District

Jun 14, 2026

Microsoft Copilot triggers Surface bricking bug; firmware patch underway

Jun 14, 2026

Back to News Desk

Hidden commands and phishing emails: researchers demonstrate two attack paths against OpenClaw AI agent

Agent phishing works where technical defenses hold

Underlying weakness spans multiple platforms

Fact check

Source reporting (6)

0 Comments

Related Articles

Splunk Patches Critical Unauthenticated RCE Flaw in Enterprise Software

Former IT Worker Sentenced to 21 Months for Hacking Iowa School District

Microsoft Copilot triggers Surface bricking bug; firmware patch underway

Who Is Online