From Clawdbot to OpenClaw: When Automation Becomes a Digital Backdoor by Lucie Cardiet

After this article was first published, the project formerly known as Clawdbot and Moltbot completed another rebrand and is now called OpenClaw. The underlying agentic architecture and security considerations discussed here remain relevant under the new name.

---

Clawdbot emerged as one of the most talked-about open source AI agents of early 2026, not because it was another chatbot, but because it crossed a line many tools had not. It combined large language models with direct, autonomous access to operating systems, files, credentials, and messaging platforms.

But what users gained in productivity, attackers gained as a new attack surface.

And when Clawdbot rebranded to Moltbot amid trademark concerns, the risk profile shifted again. Not because the underlying agent changed, but because rapid popularity collides with operational unpreparedness. In the rush to rename repositories, domains, and social accounts, ownership gaps appear. Attackers moved faster than maintainers, hijacking abandoned identities and exploiting community trust within seconds.

The project’s shift to OpenClaw was accompanied by a renewed focus on security-first design and clearer warnings about the risks of autonomous system access, signalling that maintainers now acknowledge the hardening gap that early adopters exposed.

Now the tool’s explosive adoption works in the attacker’s favor. A fast-growing wave of users are eager to try it, clone it, integrate it, and run it with broad permissions before security questions are fully understood. That combination, high privilege, viral adoption, and momentary identity confusion, turns an already sensitive automation tool into a highly attractive target.

Screenshot of Peter Steinberger's post on X. — *Clawdbot, now Moltbot, was created by Peter Steinberger (@steipete*). He has repeatedly described it as a young hobby project, unfinished, less than three months old, and not intended for most non-technical users. It was built to inspire experimentation, not to operate as a hardened enterprise product.

Clawdbot is not designed to be exposed by default. If you are not comfortable hardening a server, understanding reverse proxy trust boundaries, and running high-privilege services with least-privilege controls, this is not something to deploy on a public VPS.

The project assumes a level of operational maturity that many users underestimate, and most of the real-world failures seen so far trace back to configuration, not exploits.

Screenshot of a terminal showing Clawdbot installation — During installation, Clawdbot makes this risk explicit. Users are shown a clear warning that the agent can run commands, access files, and act across enabled tools, and must explicitly opt in to proceed. Selecting “No” stops the installation entirely.

This is not a story about a broken AI project. It is a story about how modern attacks form when trust, automation, and identity move faster than security controls.

The same pattern is already visible in emerging agent communities, where autonomous systems interact with each other directly. Experiments like Moltbook show how quickly “harmless” agent collaboration turns into a new class of security blind spots.

To understand why Clawdbot attracts attackers so quickly, it helps to look at how the agent actually operates once it is deployed.

TL;DR

Moltbot turns automation into a high-privilege attack surface when exposed or misconfigured.
Most real-world compromises stem from configuration errors and trust abuse, not zero-days.
Once compromised, the agent enables credential theft, lateral movement, and ransomware.
Autonomous AI agents must be treated as privileged infrastructure, not productivity tools.
If you cannot harden and monitor it, do not expose it.

The rest of this article explains how Moltbot became an attractive target so quickly, how attackers abuse it in practice, and what this signals about the future of agentic AI security. Readers looking for operational guidance can jump to the hardening section near the end.

The AI Agent as a Persistent Shadow Superuser

Clawdbot was designed to run on user-controlled infrastructure, laptops, home servers, cloud instances, and even single-board computers. It can execute shell commands, read and write files, browse the web, send emails or chat messages, and maintain persistent memory across sessions. In practice, it acts as a highly privileged automation layer, often holding API keys, OAuth tokens, chat credentials, and sometimes root-level access. This design collapses multiple trust boundaries into a single system. Messaging platforms, local operating systems, cloud APIs, and third-party tools all converge through one autonomous agent. Once deployed, Clawdbot is no longer just an application, it becomes part of the environment’s security fabric.

Screenshot of Moltbot's features — *Source:* *Moltbot*

From an attacker’s perspective, this is ideal. Compromise the agent once, inherit everything it can access, across environments.

> Read a real incident write-up on how attackers used automation and identity abuse to reach full administrative control in AWS in 8 minutes

Once that level of access exists, attackers do not need new techniques, they simply reuse familiar ones through a far more capable intermediary.

Clawdbot is rarely deployed in isolation. As adoption grows, these agents increasingly operate alongside other autonomous systems, reading shared content, exchanging context, and reacting to each other’s outputs. Platforms like Moltbook make this visible by showing how agent-only communities form, where bots consume untrusted input from other bots and propagate behavior without human oversight. When those same coordination patterns extend beyond communication into incentives and delegation, experiments like Molt Road preview how agents can outsource tasks, exchange services, and automate trust through escrow and reputation mechanisms.

How attackers abuse Clawdbot in practice

Once Clawdbot – now OpenClaw – is deployed, the question is no longer if it can be abused, but how. Some techniques are already being reported (misconfiguration exposure and supply‑chain tricks), while others—like prompt injection—are well‑documented in research and become realistic when agents can read untrusted content and execute tools.

Initial Access: Where the Agent Becomes the Entry Point

The most common way Clawdbot is compromised is not through a clever exploit, but through simple exposure. The Control UI is an administrative interface meant to be accessed locally. In practice, some users accidentally make it reachable from the public internet through misconfiguration, open ports, or unsafe proxy setups.

When that happens, a local admin dashboard can start behaving like a remote control panel.

Part of the confusion comes from how the word exposed is used. In practice, there are meaningful differences:

Internet-facing: reachable on a public IP and port.
Fingerprinted: detectable by Shodan or Censys because the HTTP response matches Moltbot or Clawdbot signatures, even if authentication is enabled.
Unauthenticated or bypassed: the Control UI can be accessed without valid pairing or credentials, often due to misconfiguration. This is the dangerous case.

This distinction matters. Once an administrative interface is reachable from the public internet, discovery becomes easy. Search engines like Shodan can surface these systems quickly, turning a single configuration mistake into a widespread problem.

Screenshot of the Shodan interface listing results for the "Clawdbot" query — *Screenshot showing large numbers of Clawdbot results in Shodan. Source:* *Shodan.io*

‍

Public screenshots showing large numbers of Clawdbot results can be misleading. Many represent systems that are visible but still protected. The real risk lies in the smaller number of cases where authentication is missing or ineffective.

In those situations, the Control UI is reachable from the public internet and attackers may be able to view configuration data, access conversation history, or issue commands, depending on how the agent was configured.

‍

Screenshot of a Clawdbot UI and its source code — *Screenshot of an exposed Clawdbot Control UI. Source:* X

‍

Even when users believe authentication is enabled, unsafe defaults and proxy misconfigurations have allowed attackers to sidestep it entirely. No zero-day is required. A single header rewrite or forwarding rule can collapse the boundary between trusted and untrusted access. Once inside, attackers inherit everything the agent can do.

Not all initial access requires network exposure or misconfiguration.

Social engineering and prompt injection

Clawdbot’s ability to read emails, chat messages, and documents creates a new attack surface where the payload is language, not malware.

Prompt injection has been demonstrated in research and practical agent setups: a crafted message can sometimes steer an agent toward leaking sensitive data or performing unintended actions, even when the attacker never touches the host directly.

Screenshot of a Reddit post about a malicious prompt injection command in Moltbot — Screenshot of a Reddit post mentioning that a malicious Moltbot “skill” was published to a public repository and presented as a helpful tool. It instructed users to copy and paste a command into their terminal. Anyone who followed those instructions would unknowingly run a remote script on their own machine. Source: *Reddit*

‍

The defense is less about “patching a bug” and more about limiting what the agent can do, who can talk to it, and how it handles untrusted content.

When attackers do gain a foothold on the host, Clawdbot’s local design creates a second, quieter path to takeover.

Malware targeting Clawdbot data

Endpoint malware and infostealers don’t need a Clawdbot‑specific exploit to benefit from agent deployments. If tokens, OAuth artifacts, or agent state are stored locally—and especially if they’re stored in plaintext—then a routine endpoint compromise can escalate into takeover of the agent and every connected service it can access.

*Multiple infostealer and threat-intel reports have discussed credential-stealing malware harvesting local app/config data; coverage and specifics vary over time. Screenshot source:* *HudsonRock*

‍

In other cases, attackers never touch the host directly, they arrive through trusted extensions and integrations.

Supply chain abuse, plugins and malicious extensions

Clawdbot’s plugin model introduces another initial access vector. Extensions run as code on the same machine as the agent and inherit its permissions. A malicious plugin, or a compromised legitimate one, functions as instant remote code execution. The agent becomes the delivery mechanism.

Beyond the core software, attackers have also abused the surrounding ecosystem. A fake Clawdbot VS Code extension was recently used to install the ScreenConnect remote access trojan, leveraging developer trust in the project name to deliver full remote control. The extension looked legitimate, carried the right branding, and relied on timing and name recognition rather than a software vulnerability.

*The fake ClawdBot extension. Source:* *Aikido*

‍

This class of attack becomes more effective during rebrands. When project names, repositories, and domains change quickly, attackers exploit the confusion by publishing lookalike repos or extensions that appear legitimate. The compromise often happens before any code is reviewed, because trust is assumed based on name and timing.

Defense here is less about patching and more about identity verification. After a rebrand, installations should be limited to the official GitHub organization and project domain. Newly created or sponsored repositories deserve extra scrutiny, and plugins or skills should be restricted to curated allowlists.

The same applies to developer tooling. For editor extensions, verify the publisher, review history, and update cadence, and check hashes or signatures when available. In this ecosystem, supply chain trust is part of the security boundary.

When a Website Becomes the Payload

Not all agent compromises require installing anything at all. Recent research has shown that a simple website visit can be enough once an autonomous agent is allowed to browse the web and invoke tools. In one documented case, a malicious page triggered one-click remote code execution and theft of agent data and API keys by abusing the agent’s trust in web content, not by exploiting a traditional software vulnerability.

A video of the exploit in action: pic.twitter.com/fcj91a4Ush
— International Cyber Digest (@IntCyberDigest) February 1, 2026

Demo of the exploit

This matters because it removes friction entirely. There is no plugin to vet, no binary to inspect, and no supply chain to audit. The web itself becomes an input channel, and language becomes the payload.

Once an attacker can influence what the agent reads and how it interprets that input, the next step is control. The same mechanisms that allow the agent to fetch pages, follow links, and execute actions can be repurposed into a live command channel, without introducing any new tooling at all.

Command-and-Control: Turning the Agent into the C2 Channel

If an attacker gains access to the Control UI or any channel that can run tools, Clawdbot itself becomes the command-and-control framework. Through the control interface or API, they can issue shell commands, collect output, and operate interactively. The risk is not that Moltbot magically breaks security by itself; it is that a compromised agent already has execution, persistence, and communications pathways baked in.

‍

*Screenshot of Clawdbot dumping various API keys in plaintext. Source:* X

‍

Abusing legitimate integrations

Clawdbot can send messages through Slack, Telegram, email, and other platforms using legitimate credentials. Attackers can abuse these integrations to exfiltrate data or receive instructions, blending seamlessly into normal traffic. From a detection standpoint, this looks like expected automation behavior, not C2 traffic.

Man-in-the-middle at the cognitive layer

If an attacker can modify the agent’s configuration, prompts, or stored memory (for example after gaining filesystem access), they can try to influence what the user sees and what the agent does.

In practice this could mean nudging summaries, suppressing alerts, or inserting hidden instructions that trigger later. This is best treated as a potential outcome of compromise, not a guaranteed capability in a properly isolated setup.

Control of the agent also collapses traditional privilege boundaries.

Privilege Escalation and Credential Abuse

If Clawdbot runs as a standard user, attackers can use it to execute traditional privilege escalation techniques. In many deployments, escalation is unnecessary because the agent already runs with elevated permissions, particularly in containerized or single-user setups.

‍

Clawdbot centralizes credentials by design. API keys, OAuth tokens, chat credentials, cloud secrets, and sometimes VPN access all live in one place. Once harvested, these credentials allow attackers to impersonate users, access cloud resources, and move laterally without touching the original system again.

In hybrid environments, this impact multiplies. A compromised agent running on a laptop can expose enterprise SaaS access. One running in the cloud may hold IAM permissions that allow attackers to create infrastructure, access data stores, or tamper with CI/CD pipelines. The agent becomes a bridge between environments that were never meant to be directly connected.

With access secured, attackers focus on staying in place without being noticed.

Persistence: When the Agent Remembers the Attacker

Because agents often keep long-term state on disk, a compromised host can turn that state into persistence. If an attacker can write to the agent’s memory or configuration, they may leave behind instructions or artifacts that survive restarts and keep causing harmful actions until the system is rebuilt and secrets are rotated.

Because Clawdbot is designed to run continuously and perform tasks on a schedule, attackers can embed recurring actions. Data exfiltration, system enumeration, or outbound communications can be triggered automatically, without further interaction.

With shell access, attackers can also fall back on classic persistence methods, scheduled tasks, startup scripts, or additional backdoors. The difference is that Clawdbot often hides these actions behind legitimate automation, making forensic analysis more difficult.

The real damage begins when the compromised agent is used as a bridge to everything else.

Lateral Movement and Broader Impact

From a single compromised agent, attackers can scan internal networks, reuse credentials, and move laterally. And since Clawdbot frequently spans personal, enterprise, and cloud contexts, compromise in one domain often leads to compromise in all three.

Stolen tokens allow attackers to pivot into SaaS platforms, internal collaboration tools, and cloud environments. They can impersonate users, send trusted messages, and spread further access through social channels that defenders rarely monitor as attack paths.

At the extreme end, attackers can use Clawdbot to deploy ransomware, destroy data, or sabotage operations. The agent already has file access, execution rights, and communication channels. Every destructive action is simply another “task” for the assistant to complete.

Taken together, these behaviors expose a broader shift in how modern attacks unfold.

When Automation Becomes an Attack Surface

Clawdbot captures both the promise and the risk of autonomous AI. It delivers powerful automation, but that same autonomy creates a high-value attack surface. Exposed interfaces, malicious inputs, and adapted malware allow attackers to repurpose the agent for command and control, privilege abuse, persistence, and lateral movement. These techniques blur the line between traditional intrusion paths and AI-driven abuse, with impact spanning personal systems, cloud environments, and enterprise networks. A compromised agent can enable anything from targeted data theft to full-scale ransomware, depending on what it is connected to.

Clawdbot/Moltbot should be approached like highly privileged infrastructure — not a casual weekend experiment — because it centralizes credentials and can execute actions across accounts and devices. Strong authentication, restricted network exposure, and careful handling of secrets are table stakes. Organizations need visibility into where these agents run, apply segmentation, and monitor for anomalous behavior such as unsolicited actions or unexpected communications. Prompt injection adds another layer of risk, where abuse may be invisible unless agent behavior itself is monitored.

Clawdbot behaves like a shadow superuser inside the environment. As autonomous agents become more common, the lesson is straightforward: capability without control is exposure. The responsibility now sits with users and security teams to secure these agents before attackers do it for them.

Understanding this shift is only the first step. Reducing risk requires treating Moltbot like any other privileged system and applying deliberate controls around exposure, identity, execution, and monitoring.

Clawdbot shows how a single autonomous agent can become a shadow superuser when trust and permissions collapse. But this risk does not stop at individual deployments.

As agents begin interacting with each other in shared environments, such as agent-only forums and social networks, the attack surface expands again. Platforms like Moltbook illustrate what happens when autonomous agents read, trust, and influence each other at scale, turning language, reputation, and interaction into new paths for compromise.

How to Reduce Risk When Running Moltbot

Autonomous agents do not fail gracefully. Once deployed with broad access, small configuration mistakes can have outsized impact. The goal of hardening is not to make Moltbot “safe,” but to constrain blast radius, reduce exposure, and make abuse detectable.

1. Baseline hardening checklist

Area	Safe default	Why it matters
Control UI access	Bind to localhost. Access only via SSH tunnel or VPN. Never public by default.	Prevents internet discovery and reduces brute-force or auth bypass risk.
Reverse proxy	Configure trusted proxies and real client IPs. Never treat all proxy traffic as localhost.	Avoids “remote looks local” mistakes that collapse authentication boundaries.
Channels (Telegram, Discord, etc.)	Deny-by-default allowlist for users, servers, and channels. Disable public DMs.	Prevents channels from becoming remote triggers for high-privilege actions.
Tools and OS permissions	Run non-root. No privileged Docker. No host mounts. Require confirmation for shell, file write, and browser actions.	Limits blast radius if the agent is tricked or a UI or channel is abused.

2. Host and infrastructure isolation

Do not expose the Control UI to the public internet. Prefer a VPN such as Tailscale or WireGuard, or SSH tunneling to localhost. Isolate the agent from production workloads.
Do not run Moltbot on the same VPS that hosts backend services, databases, or personal files.
Harden SSH on any VPS. Disable password authentication and root login, use keys only, and enable firewall rules plus rate limiting or fail2ban.
Run the agent as a non-root user. Avoid privileged containers and never mount the host filesystem or Docker socket.
Patch regularly. Apply OS updates, refresh container images, and rotate long-lived keys.

3. Control UI and gateway configuration

Bind the gateway and Control UI to 127.0.0.1 unless remote access is strictly required.
Enable Control UI authentication and verify reverse proxy settings so remote clients are not treated as localhost.
Log and alert on new device pairings, configuration changes, and tool execution events.

4. Channels and integrations

Use strict allowlists for who can message the bot. Default should be deny-by-default.
Disable or limit high-risk tools such as shell access, file writes, and browser automation unless absolutely required.
Use separate accounts with least-privilege scopes for integrations, such as dedicated Gmail accounts, restricted Slack bot scopes, and limited GitHub tokens.
Treat Signal device-linking QR codes and URIs like passwords. Do not leave pairing artifacts in world-readable locations, and rotate or relink if exposed.
Prefer per-service bot accounts and restrict exposure. Avoid public Discord servers and lock direct messages to an allowlist.

5. Secrets and data handling

These are common default locations to audit on your host. Treat them as sensitive and lock down permissions:

Area	Key controls
Host and infrastructure isolation	Do not expose the Control UI to the public internet. Prefer a VPN (Tailscale, WireGuard) or SSH tunneling to localhost. Isolate the agent from production workloads. Do not run Moltbot on the same VPS as backend services, databases, or personal files. Harden SSH on any VPS. Disable password auth and root login, use keys only, and enable firewall rules plus rate limiting or fail2ban. Run the agent as a non-root user. Avoid privileged containers and never mount the host filesystem or Docker socket. Patch regularly. Apply OS updates, refresh container images, and rotate long-lived keys.
Control UI and gateway configuration	Bind the gateway and Control UI to 127.0.0.1 unless remote access is strictly required. Enable Control UI authentication and verify reverse proxy settings so remote clients are not treated as localhost. Log and alert on new device pairings, configuration changes, and tool execution events.
Channels and integrations	Use strict allowlists for who can message the bot. Default should be deny-by-default. Disable or limit high-risk tools such as shell access, file writes, and browser automation unless required. Use separate accounts with least-privilege scopes for integrations, such as dedicated Gmail accounts or restricted Slack bot scopes. Treat Signal device-linking QR codes and URIs like passwords. Rotate or relink if exposed. Prefer per-service bot accounts. Avoid public Discord servers and lock direct messages to an allowlist.
Secrets and data handling	Treat default agent paths and configuration files as sensitive. Lock down file permissions, encrypt storage where feasible, rotate tokens aggressively after exposure, and limit retention of logs and conversation history.

Additional controls:

Treat the agent like a secrets manager. Lock down file permissions and encrypt storage where feasible.
Rotate and revoke tokens aggressively after any suspected exposure. Prefer short-lived OAuth tokens over long-lived API keys.
Limit retention of conversation history and attachments. A simple default is 7 to 14 days for session logs, rotated or deleted weekly, with encryption at rest if longer retention is required.

6. Prompt injection and untrusted content

Assume any email, document, web page, or chat message may contain hostile instructions.
Do not allow untrusted text to directly trigger shell commands or credential exports.
Use per-channel tool policies. For example, allow summarization on email or Slack, but deny shell or file-write actions from Telegram or Discord.
Require human confirmation for high-risk actions such as running commands, exporting files, changing settings, or initiating logins.
Treat browser automation as high risk. Use a separate, logged-out browser profile for the agent and never reuse a personal session.
A useful pattern is content-origin tagging combined with per-origin tool policy and mandatory confirmation for high-risk tools.

7. Monitoring and detection, minimum viable coverage

ATT&CK shorthand for defenders

Initial access: exposed Control UI or supply chain abuse
Execution: shell and tool invocations
Credential access: local tokens and config files
Persistence: modified configs, memory, or plugins
Command-and-control: chat channels
Exfiltration: uploads and webhooks

Minimum signals to watch

Control UI authentication failures, new device pairings, and configuration changes
Tool invocations such as shell runs, file reads and writes, browser automation, and outbound uploads
Unexpected filesystem changes under ~/.clawdbot/* and the agent workspace
Network activity such as new outbound domains, spikes in messaging, or unusual webhook traffic
Host signals such as firewall logs on Moltbot ports, SSH login events, and unusual child processes

8. If you suspect exposure: quick response

Stop the service and preserve evidence. Snapshot the VM or container and collect system logs plus ~/.clawdbot/*.
Rotate and revoke credentials. Replace tokens, revoke OAuth sessions, rotate SSH keys, and reissue long-lived API keys.
Invalidate access paths. Disable exposed channels, reset allowlists, and rotate Control UI pairing or passwords.
Clear sensitive history where appropriate. Purge session transcripts if they contain secrets, then enforce short retention.
Rebuild if in doubt. If the host was reachable without strong authentication or command execution is suspected, rebuild from a clean image and relink channels carefully.
Post-check persistence points such as systemd services, crontab, SSH authorized keys, and installed plugins or extensions.

What to take away

Treat Moltbot like privileged infrastructure. It holds secrets, runs commands, and communicates across trusted channels.
Most failures are configuration issues, not exploits. Public Control UIs, weak proxy settings, open channels, and overpowered tools account for the majority of incidents.
Identity is part of the attack surface. Trust only official organizations, domains, and extensions, especially during rebrands.
If you cannot harden it, do not expose it. Keep the Control UI on localhost or VPN, restrict channels, and require confirmation for risky actions.

Defending against this model requires visibility into behavior, not just assets. This is where the Vectra AI Platform becomes critical, giving security teams the ability to detect attacker behaviors that emerge when trusted automation is abused across identity, network, cloud, and hybrid environments, before those behaviors escalate into full compromise.

‍

---

Sources & further readings:

Official Moltbot/Clawdbot security guidance (hardening, reverse proxy, auth): https://docs.clawd.bot/gateway/security
Official project site: https://molt.bot (also https://clawd.bot) (also https://openclaw.ai/)
Official GitHub repository/org: https://github.com/moltbot/clawdbot (org page: https://github.com/clawdbot/)
Chirag (@mrnacknack) educational write‑up on common misconfigurations and prompt‑injection paths (use as secondary context): https://x.com/mrnacknack/article/2016134416897360212 (see also @theonejvo’s threads).
Cointelegraph coverage of exposed instances and credential leakage risk (via TradingView mirror): https://www.tradingview.com/news/cointelegraph:99cbc6b7d094b:0-viral-ai-assistant-clawdbot-risks-leaking-private-messages-credentials/
Binance News summary of the GitHub/X hijack + token scam confusion: https://www.binance.com/en/square/post/01-27-2026-clawdbot-founder-faces-github-account-hijack-by-crypto-scammers-35643613762385
DEV Community timeline on the rename and scam impersonation wave: https://dev.to/sivarampg/from-clawdbot-to-moltbot-how-a-cd-crypto-scammers-and-10-seconds-of-chaos-took-down-the-internets-hottest-ai-project-4eck
Aikido Security analysis of a fake Clawdbot VS Code extension: https://www.aikido.dev/blog/fake-clawdbot-vscode-extension-malware
Project changelog (for tracking security-related changes): https://github.com/clawdbot/clawdbot/blob/main/CHANGELOG.md
Secondary reporting on infostealer interest in Clawdbot/Moltbot ecosystems (directional, not definitive): https://www.infostealers.com/article/clawdbot-the-new-primary-target-for-infostealers-in-the-ai-era/
1-Click RCE To Steal Your Moltbot Data and Keys (CVE-2026-25253) https://blog.virustotal.com/2026/02/from-automation-to-infection-how.html
1-Click RCE via Authentication Token Exfiltration From gatewayUrl: https://github.com/openclaw/openclaw/security/advisories/GHSA-g8p2-7wf7-98mq

From Clawdbot to OpenClaw: When Automation Becomes a Digital Backdoor