Browser (clawd-managed)

Clawdbot can run a dedicated Chrome/Chromium profile that the agent controls. It is isolated from your personal browser and is managed through a small local control server. Beginner view:

Think of it as a separate, agent-only browser.
It does not touch your personal Chrome profile.
The agent can open tabs, read pages, click, and type in a safe lane.

What you get

A separate browser profile named clawd (orange accent by default).
Deterministic tab control (list/open/focus/close).
Agent actions (click/type/drag/select), snapshots, screenshots, PDFs.
Optional multi-profile support (clawd, work, remote, …).

This browser is not your daily driver. It is a safe, isolated surface for agent automation and verification.

Quick start

clawdbot browser status
clawdbot browser start
clawdbot browser open https://example.com
clawdbot browser snapshot

If you get “Browser disabled”, enable it in config (see below) and restart the Gateway.

Configuration

Browser settings live in ~/.clawdbot/clawdbot.json.

{
  browser: {
    enabled: true,                    // default: true
    controlUrl: "http://127.0.0.1:18791",
    cdpUrl: "http://127.0.0.1:18792", // defaults to controlUrl + 1
    defaultProfile: "clawd",
    color: "#FF4500",
    headless: false,
    noSandbox: false,
    attachOnly: false,
    executablePath: "/Applications/Chromium.app/Contents/MacOS/Chromium",
    profiles: {
      clawd: { cdpPort: 18800, color: "#FF4500" },
      work: { cdpPort: 18801, color: "#0066CC" },
      remote: { cdpUrl: "http://10.0.0.42:9222", color: "#00AA00" }
    }
  }
}

Notes:

controlUrl defaults to http://127.0.0.1:18791.
If you override the Gateway port (gateway.port or CLAWDBOT_GATEWAY_PORT), the default browser ports shift to stay in the same “family” (control = gateway + 2).
cdpUrl defaults to controlUrl + 1 when unset.
attachOnly: true means “never launch Chrome; only attach if it is already running.”
color + per-profile color tint the browser UI so you can see which profile is active.

Local vs remote control

Local control (default): controlUrl is loopback (127.0.0.1/localhost). The Gateway starts the control server and can launch Chrome.
Remote control: controlUrl is non-loopback. The Gateway does not start a local server; it assumes you are pointing at an existing server elsewhere.
Remote CDP: set browser.profiles.<name>.cdpUrl (or browser.cdpUrl) to attach to a remote Chrome. In this case, Clawdbot will not launch a local browser.

Remote browser (control server)

You can run the browser control server on another machine and point your Gateway at it with a remote controlUrl. This lets the agent drive a browser outside the host (lab box, VM, remote desktop, etc.). Key points:

The control server speaks to Chrome/Chromium via CDP.
The Gateway only needs the HTTP control URL.
Profiles are resolved on the control server side.

Example:

{
  browser: {
    enabled: true,
    controlUrl: "http://10.0.0.42:18791",
    defaultProfile: "work"
  }
}

Use profiles.<name>.cdpUrl for remote CDP if you want the Gateway to talk directly to a Chrome instance without a remote control server.

Profiles (multi-browser)

Clawdbot supports multiple named profiles. Each profile has its own:

user data directory
CDP port (local) or CDP URL (remote)
accent color

Defaults:

The clawd profile is auto-created if missing.
Local CDP ports allocate from 18800–18899 by default.
Deleting a profile moves its local data directory to Trash.

All control endpoints accept ?profile=<name>; the CLI uses --browser-profile.

Isolation guarantees

Dedicated user data dir: never touches your personal Chrome profile.
Dedicated ports: avoids 9222 to prevent collisions with dev workflows.
Deterministic tab control: target tabs by targetId, not “last tab”.

Browser selection

When launching locally, Clawdbot picks the first available:

Chrome Canary
Chromium
Chrome

You can override with browser.executablePath. Platforms:

macOS: checks /Applications and ~/Applications.
Linux: looks for google-chrome, chromium, etc.
Windows: checks common install locations.

Control API (optional)

If you want to integrate directly, the browser control server exposes a small HTTP API:

Status/start/stop: GET /, POST /start, POST /stop
Tabs: GET /tabs, POST /tabs/open, POST /tabs/focus, DELETE /tabs/:targetId
Snapshot/screenshot: GET /snapshot, POST /screenshot
Actions: POST /navigate, POST /act
Hooks: POST /hooks/file-chooser, POST /hooks/dialog
Debugging: GET /console, POST /pdf

All endpoints accept ?profile=<name>.

Playwright requirement

Some features (navigate/act/ai snapshot, element screenshots, PDF) require Playwright. If Playwright isn’t installed, those endpoints return a clear 501 error. ARIA snapshots and basic screenshots still work.

How it works (internal)

High-level flow:

A small control server accepts HTTP requests.
It connects to Chrome/Chromium via CDP.
For advanced actions (click/type/snapshot/PDF), it uses Playwright on top of CDP.
When Playwright is missing, only non-Playwright operations are available.

This design keeps the agent on a stable, deterministic interface while letting you swap local/remote browsers and profiles.

CLI quick reference

All commands accept --browser-profile <name> to target a specific profile. Basics:

clawdbot browser status
clawdbot browser start
clawdbot browser stop
clawdbot browser tabs
clawdbot browser open https://example.com
clawdbot browser focus abcd1234
clawdbot browser close abcd1234

Inspection:

clawdbot browser screenshot
clawdbot browser screenshot --full-page
clawdbot browser screenshot --ref 12
clawdbot browser snapshot
clawdbot browser snapshot --format aria --limit 200
clawdbot browser console --level error
clawdbot browser pdf

Actions:

clawdbot browser navigate https://example.com
clawdbot browser resize 1280 720
clawdbot browser click 12 --double
clawdbot browser type 23 "hello" --submit
clawdbot browser press Enter
clawdbot browser hover 44
clawdbot browser drag 10 11
clawdbot browser select 9 OptionA OptionB
clawdbot browser upload /tmp/file.pdf
clawdbot browser fill --fields '[{"ref":"1","type":"text","value":"Ada"}]'
clawdbot browser dialog --accept
clawdbot browser wait --text "Done"
clawdbot browser evaluate --fn '(el) => el.textContent' --ref 7

Notes:

upload and dialog are arming calls; run them before the click/press that triggers the chooser/dialog.
upload can also set file inputs directly via --input-ref or --element.
snapshot defaults to ai when available; use --format aria for the accessibility tree.
click/type require a ref from snapshot (CSS selectors are intentionally not supported for actions).

Security & privacy

The clawd browser profile may contain logged-in sessions; treat it as sensitive.
For logins and anti-bot notes (X/Twitter, etc.), see Browser login + X/Twitter posting.
Keep control URLs loopback-only unless you intentionally expose the server.
Remote CDP endpoints are powerful; tunnel and protect them.

Troubleshooting

For Linux-specific issues (especially snap Chromium), see Browser troubleshooting.

Agent tools + how control works

The agent gets one tool for browser automation:

browser — status/start/stop/tabs/open/focus/close/snapshot/screenshot/navigate/act

How it maps:

browser snapshot returns a stable UI tree (AI or ARIA).
browser act uses the snapshot ref IDs to click/type/drag/select.
browser screenshot captures pixels (full page or element).
browser accepts:
- profile to choose a named browser profile (host or remote control server).
- target (sandbox | host | custom) to select where the browser lives.
- controlUrl sets target: "custom" implicitly (remote control server).
- In sandboxed sessions, target: "host" requires agents.defaults.sandbox.browser.allowHostControl=true.
- If target is omitted: sandboxed sessions default to sandbox, non-sandbox sessions default to host.
- Sandbox allowlists can restrict target: "custom" to specific URLs/hosts/ports.
- Defaults: allowlists unset (no restriction), and sandbox host control is disabled.

This keeps the agent deterministic and avoids brittle selectors.

Start Here

Install & Updates

CLI

Core Concepts

Gateway & Ops

Web & Interfaces

Providers

Automation & Hooks

Tools & Skills

Nodes & Media

Platforms

macOS Companion App

Reference & Templates

Experiments & Proposals

Browser

Browser (clawd-managed)

What you get

Quick start

Configuration

Local vs remote control

Remote browser (control server)

Profiles (multi-browser)

Isolation guarantees

Browser selection

Control API (optional)

Playwright requirement

How it works (internal)

CLI quick reference

Security & privacy

Troubleshooting

Agent tools + how control works

Start Here

Install & Updates

CLI

Core Concepts

Gateway & Ops

Web & Interfaces

Providers

Automation & Hooks

Tools & Skills

Nodes & Media

Platforms

macOS Companion App

Reference & Templates

Experiments & Proposals

​Browser (clawd-managed)

​What you get

​Quick start

​Configuration

​Local vs remote control

​Remote browser (control server)

​Profiles (multi-browser)

​Isolation guarantees

​Browser selection

​Control API (optional)

​Playwright requirement

​How it works (internal)

​CLI quick reference

​Security & privacy

​Troubleshooting

​Agent tools + how control works

Browser (clawd-managed)

What you get

Quick start

Configuration

Local vs remote control

Remote browser (control server)

Profiles (multi-browser)

Isolation guarantees

Browser selection

Control API (optional)

Playwright requirement

How it works (internal)

CLI quick reference

Security & privacy

Troubleshooting

Agent tools + how control works