OpenClaw AI Agent Flaws Might Allow Immediate Injection and Information Exfiltration

Ravie LakshmananMar 14, 2026Synthetic Intelligence / Endpoint Safety

China’s Nationwide Laptop Community Emergency Response Technical Crew (CNCERT) has issued a warning concerning the safety stemming from using OpenClaw (previously Clawdbot and Moltbot), an open-source and self-hosted autonomous synthetic intelligence (AI) agent.

In a submit shared on WeChat, CNCERT famous that the platform’s “inherently weak default safety configurations,” coupled with its privileged entry to the system to facilitate autonomous activity execution capabilities, could possibly be explored by dangerous actors to grab management of the endpoint.

This consists of dangers arising from immediate injections, the place malicious directions embedded inside an internet web page may cause the agent to leak delicate info if it is tricked into accessing and consuming the content material.

The assault can be referred to as oblique immediate injection (IDPI) or cross-domain immediate injection (XPIA), as adversaries, as a substitute of interacting straight with a big language mannequin (LLM), weaponize benign AI options like net web page summarization or content material evaluation to run manipulated directions. This may vary from evading AI-based advert evaluate programs and influencing hiring choices to SEO (search engine marketing) poisoning and producing biased responses by suppressing destructive opinions.

OpenAI, in a weblog submit revealed earlier this week, mentioned immediate injection-style assaults are evolving past merely putting directions in exterior content material to incorporate components of social engineering.

“AI brokers are more and more capable of browse the net, retrieve info, and take actions on a consumer’s behalf,” it mentioned. “These capabilities are helpful, however additionally they create new methods for attackers to attempt to manipulate the system.”

The immediate injection dangers in OpenClaw usually are not hypothetical. Final month, researchers at PromptArmor discovered that the hyperlink preview function in messaging apps like Telegram or Discord may be become an information exfiltration pathway when speaking with OpenClaw via an oblique immediate injection.

The thought, at a excessive stage, is to trick the AI agent into producing an attacker-controlled URL that, when rendered within the messaging app as a hyperlink preview, robotically causes it to transmit confidential information to that area with out having to click on on the hyperlink.

“Because of this in agentic programs with hyperlink previews, information exfiltration can happen instantly upon the AI agent responding to the consumer, with out the consumer needing to click on the malicious hyperlink,” the AI safety firm mentioned. “On this assault, the agent is manipulated to assemble a URL that makes use of an attacker’s area, with dynamically generated question parameters appended that comprise delicate information the mannequin is aware of concerning the consumer.”

Apart from rogue prompts, CNCERT has additionally highlighted three different considerations –

The chance that OpenClaw could inadvertently and irrevocably delete crucial info as a consequence of its misinterpretation of consumer directions.
Risk actors can add malicious expertise to repositories like ClawHub that, when put in, run arbitrary instructions or deploy malware.
Attackers can exploit lately disclosed safety vulnerabilities in OpenClaw to compromise the system and leak delicate information.

“For crucial sectors – resembling finance and vitality – such breaches might result in the leakage of core enterprise information, commerce secrets and techniques, and code repositories, and even end result within the full paralysis of whole enterprise programs, inflicting incalculable losses,” CNCERT added.

To counter these dangers, customers and organizations are suggested to strengthen community controls, forestall publicity of OpenClaw’s default administration port to the web, isolate the service in a container, keep away from storing credentials in plaintext, obtain expertise solely from trusted channels, disable automated updates for expertise, and preserve the agent up-to-date.

The event comes as Chinese language authorities have moved to limit state-run enterprises and authorities companies from operating OpenClaw AI apps on workplace computer systems in a bid to comprise safety dangers, Bloomberg reported. The ban can be mentioned to increase to the households of army personnel.

The viral recognition of OpenClaw has additionally led menace actors to capitalize on the phenomenon to distribute malicious GitHub repositories posing as OpenClaw installers to deploy info stealers like Atomic and Vidar Stealer, and a Golang-based proxy malware often known as GhostSocks utilizing ClickFix-style directions.

“The marketing campaign didn’t goal a selected business, however was broadly concentrating on customers trying to put in OpenClaw with the malicious repositories containing obtain directions for each Home windows and macOS environments,” Huntress mentioned. “What made this profitable was that the malware was hosted on GitHub, and the malicious repository grew to become the top-rated suggestion in Bing’s AI search outcomes for OpenClaw Home windows.”