Microsoft Security Blog — Running OpenClaw safely

2026-02-21 Security by al-ice.ai Editorial

AI relevance: Self-hosted agent runtimes like OpenClaw execute untrusted inputs and third‑party skills with real credentials, creating a distinct security boundary for AI operations.

Microsoft warns that OpenClaw has limited built‑in security controls while it can ingest untrusted text, pull skills, and act with assigned credentials.
This shifts execution from static code to dynamically supplied content and third‑party capabilities, without equivalent identity or input controls.
Three near‑term risks: credential/data exposure, poisoned persistent memory that steers future behavior, and host compromise via malicious code retrieval.
OpenClaw should be treated as untrusted code execution with persistent credentials, not as a normal workstation app.
Microsoft recommends running evaluations only in isolated VMs or dedicated hardware, with non‑privileged, dedicated credentials and no sensitive data access.
The post distinguishes runtime risk (OpenClaw) from platform risk (Moltbook), noting a single malicious post can influence many agents.
They outline two converging supply‑chain threats: indirect prompt injection via untrusted content and malicious skills distributed through public registries.

Why it matters

Self‑hosted agent pilots can quietly become credentialed execution surfaces if teams treat them like normal dev tools.
Isolation, scoped identity, and monitoring become operational requirements for AI agent rollouts, not optional hardening.

What to do

Run OpenClaw in isolation (VM or separate host) with dedicated, low‑privilege credentials and no sensitive data access.
Gate skill installation with approvals, provenance checks, and least‑privilege tool scopes.
Monitor runtime behavior and prepare a rebuild plan for compromised agent environments.

Microsoft Security Blog — Running OpenClaw safely

Why it matters

What to do

Sources