OpenClaw AI Agent Attacks Highlight Trust Risks

TL; DR

Researchers demonstrated multiple attack techniques against OpenClaw AI agents.
Hidden prompt injection techniques reportedly influenced agent behavior through contacts, vCards, and location metadata.
Agent phishing simulations reportedly convinced an OpenClaw agent to share mock credentials and customer data.
The findings highlight the importance of trust boundaries, least-privilege access, and monitoring high-risk AI agent actions.

The latest research involving the OpenClaw AI agent highlights a growing challenge in enterprise security. As organizations deploy AI agents with access to business data and communications, security teams must consider how untrusted content could influence automated actions.

Researchers from Imperva and Varonis independently demonstrated how OpenClaw agents could be influenced by seemingly routine inputs. Their findings showed that contacts, vCards, location data, and email messages could be used to deliver prompt injection or agent phishing attacks, leading to simulated code execution and data disclosure scenarios.

The findings are notable because they do not rely on traditional account compromise. Instead, they demonstrate how attackers may exploit trust relationships inside AI-driven workflows.

Strengthen endpoint security with Hexnode XDR

Table of Contents

Two research teams, one security problem
Why AI Agents change the prompt injection risk model
Why verification rules did not fully prevent data disclosure
Understanding the enterprise risk
How to reduce exposure and mitigate risk
How Hexnode supports investigation and response
Conclusion
FAQs

Two research teams, one security problem

Imperva and Varonis used different approaches but exposed a similar security challenge: AI agents may act on untrusted content when it is interpreted as instructions.

Imperva examined how OpenClaw processed shared contacts, vCards, and location pins. Researchers reported that hidden instructions embedded within those objects could become part of the prompt context supplied to the model. In testing, a prompt injection scenario reportedly caused the agent to download and execute a script from a researcher-controlled server.

OpenClaw addressed the message-object prompt injection issue in version 2026.4.23.

Varonis focused on a different attack path. Researchers connected an OpenClaw agent to Gmail and populated the environment with synthetic business information. They then sent phishing-style emails designed to appear operationally legitimate. During testing, the agent reportedly shared:

Mock AWS keys
SSH credentials
Database connection strings
Customer export data

Although the attack techniques differed, both demonstrations showed how untrusted content could influence agent behavior and trigger actions that organizations would not normally expect from automated workflows.

Why AI Agents change the prompt injection risk model

Prompt injection is not a new concept. What changes the risk profile is the growing number of actions available to modern autonomous agents.

The OpenClaw research illustrates this shift. Rather than simply generating text, the tested agents could interact with external systems, process business data, and perform automated actions. As researchers demonstrated, manipulating an agent’s inputs can potentially influence what the agent does next.

Depending on how they are configured, enterprise AI agents may be able to:

Read emails and business communications
Access internal files and data sources
Retrieve sensitive information
Interact with connected applications and services
Execute automated tasks
Send information externally

8 Security Blind Spots Putting Your Business at Risk

Discover eight security blind spots that threaten business resilience today.

In traditional applications, malicious content might affect a single workflow. In agentic environments, the same content may influence a system capable of making decisions and performing actions on behalf of users.

Why verification rules did not fully prevent data disclosure

One notable finding from the Varonis testing involved an agent profile that reportedly required sender verification before processing requests.

Despite those instructions, researchers found that the agent still complied with malicious requests during testing. This highlights a broader challenge with agent phishing, where attackers attempt to manipulate automated decision-making rather than exploit a software vulnerability.

Trust boundary breakdown

Component	Intended Purpose	What Researchers Demonstrated
Contacts and vCards	Share business information	Hidden instructions reportedly influenced agent behavior
Location Pins	Provide contextual location data	Embedded content entered the prompt context
Email Messages	Support communication workflows	Phishing-style messages reportedly prompted the agent to share mock sensitive data
Agent Integrations	Enable automation across systems	Increased the impact of prompt injection and social engineering
External Communication Channels	Deliver business outputs	Created potential pathways for AI data exfiltration

Understanding the enterprise risk

The OpenClaw research demonstrates how prompt injection and phishing-style attacks can influence AI agents that have access to business data and automated workflows.

AI Agents expand the blast radius

Traditional phishing campaigns target users. Agent phishing attempts to manipulate AI agents that have access to business data or automated workflows.

Sensitive data becomes more accessible

When autonomous agents have access to business data and communications, successful manipulation attempts may increase the risk of unintended data disclosure.

Visibility gaps create investigation challenges

Organizations may struggle to determine exactly what data an agent accessed, what actions it performed, and whether those actions aligned with business intent.

How to reduce exposure and mitigate risk

Organizations deploying AI agents should focus on reducing unnecessary access, limiting trust relationships, and monitoring automated actions.

Deploy OpenClaw security updates promptly.
Restrict agent permissions using least-privilege principles.
Separate trusted instructions from untrusted content sources.
Implement approval workflows for sensitive actions.
Limit access to credentials, secrets, and customer datasets.
Review third-party connectors and integration permissions regularly.
Monitor unusual agent behavior and outbound communications.
Maintain visibility into endpoints and systems supporting AI workflows.

Featured resource

Hexnode for data security

Learn how UEM strengthens data security through policy enforcement, encryption, compliance, and access controls.

DOWNLOAD

How Hexnode supports investigation and response

As AI agents gain access to business systems, visibility and governance become increasingly important.

With Hexnode UEM, organizations can:

Enforce security policies across managed endpoints.
Monitor device compliance and enforce security policies across managed devices.
Restrict access to organizational resources based on device compliance requirements.

With Hexnode XDR, security teams can:

Investigate incidents using endpoint activity data and incident visibility provided by Hexnode XDR.
Detect security incidents and threats across monitored endpoints.
Take supported response actions from the Hexnode XDR console to contain and remediate detected threats.

Conclusion

The OpenClaw AI agent research highlights a broader challenge facing enterprise AI deployments. Rather than a single product issue, the findings show how autonomous systems can become vulnerable when trusted instructions and untrusted content are not clearly separated.

As organizations expand the use of AI agents, security teams should focus on least-privilege access, monitored actions, approval controls, and visibility into agent behavior. These measures can help reduce the risks associated with prompt injection, agent phishing, and AI data exfiltration.

Secure AI workflows with visibility

See how Hexnode helps security teams investigate threats and improve endpoint visibility.

FAQs

What is the OpenClaw AI agent?

OpenClaw is a self-hosted AI agent platform that can connect to external tools, services, and data sources to perform automated tasks.

What is agent phishing?

Agent phishing involves using deceptive messages or content to influence an AI agent’s decisions or actions.

Why is AI data exfiltration a growing concern?

Depending on how they are configured, AI agents may have access to sensitive information and external communication channels, increasing the potential impact of unauthorized data disclosure.

OpenClaw AI Agent Attacks Show Why Trust Boundaries Matter More Than Ever

TL; DR

Two research teams, one security problem

Why AI Agents change the prompt injection risk model

8 Security Blind Spots Putting Your Business at Risk

Why verification rules did not fully prevent data disclosure

Trust boundary breakdown

Understanding the enterprise risk

AI Agents expand the blast radius

Sensitive data becomes more accessible

Visibility gaps create investigation challenges

How to reduce exposure and mitigate risk

Featured resource

Hexnode for data security

How Hexnode supports investigation and response

With Hexnode UEM, organizations can:

With Hexnode XDR, security teams can:

Conclusion

Secure AI workflows with visibility

FAQs

Share

Sophia Hart

You May Also Like

Mastra npm Supply-Chain Attack Compromises 144 AI Framework Packages

Benefits of XDR: Why CISOs Are Making the Switch

Google Lawsuit Targets Alleged Gemini Phishing Operation Behind Outsider Smishing Kit

ShinyHunters Data Breach: The Ransomware “Double-Tap” on Healthcare

Join readers from 120 countries

Subscribe to Hexnode Blog

OpenClaw AI Agent Attacks Show Why Trust Boundaries Matter More Than Ever

TL; DR

Two research teams, one security problem

Why AI Agents change the prompt injection risk model

8 Security Blind Spots Putting Your Business at Risk

Why verification rules did not fully prevent data disclosure

Trust boundary breakdown

Understanding the enterprise risk

AI Agents expand the blast radius

Sensitive data becomes more accessible

Visibility gaps create investigation challenges

How to reduce exposure and mitigate risk

Featured resource

Hexnode for data security

How Hexnode supports investigation and response

With Hexnode UEM, organizations can:

With Hexnode XDR, security teams can:

Conclusion

Secure AI workflows with visibility

FAQs

Share

Sophia Hart

You May Also Like

Mastra npm Supply-Chain Attack Compromises 144 AI Framework Packages

Benefits of XDR: Why CISOs Are Making the Switch

Google Lawsuit Targets Alleged Gemini Phishing Operation Behind Outsider Smishing Kit

ShinyHunters Data Breach: The Ransomware “Double-Tap” on Healthcare

Join readers from 120 countries