Documentation Index
Fetch the complete documentation index at: https://base.bangwu.me/llms.txt
Use this file to discover all available pages before exploring further.
Browser Harness
Browser Harness is an open-source browser control layer from the Browser Use team. Its core idea is simple:- connect directly to Chrome through CDP
- keep the harness thin
- let the agent write missing helper code during the task
Why it is interesting
Compared with rigid browser automation flows, Browser Harness is designed for agentic use:- works with tools like Codex and Claude Code
- can connect to your real browser
- allows the agent to add helper functions while it is working
- supports free remote browsers for cloud or sub-agent scenarios
How it works
The project stays intentionally small:install.mdhandles first-time setupSKILL.mdexplains normal usagehelpers.pycontains the callable browser helper functionsdaemon.pyandadmin.pyhandle the CDP websocket and local bridge
helpers.py is not treated as fixed forever. The agent is expected to inspect it and extend it when needed.
In practice, the flow looks like this:
- Connect the harness to Chrome through CDP
- Ask the coding agent to complete a browser task
- If a helper is missing, the agent edits the harness
- Continue the task with the new helper available
When to use it
Browser Harness makes sense when you want:- a coding agent to operate a real browser for you
- interactive browser tasks rather than only deterministic regression tests
- a thinner layer than a large end-to-end testing stack
- browser work that can improve over time as helpers and skills accumulate
- logging into websites with your existing browser profile
- navigating complex dashboards
- submitting forms or uploading files
- doing repetitive browser workflows that are annoying to script upfront
When not to use it
If you need:- strict, reproducible end-to-end tests
- stable CI assertions
- browser automation with carefully fixed selectors and snapshots
Real browser vs remote browser
The project supports both:Real browser
Best for:- personal workflows
- sites where you are already logged in
- tasks that benefit from your existing cookies, sessions, and local browser state
Remote browser
Best for:- cloud execution
- sub-agents
- isolated sessions
- tasks that should not depend on your local machine
Practical setup notes
The README recommends a very direct setup pattern:- let the agent clone or open the repo
- read
install.mdfirst - read
SKILL.mdfor day-to-day usage - always inspect
helpers.py
Browser Use Box
Browser Use Box, also calledbux, is a useful pattern built on top of Browser Harness. Instead of running the agent only on your laptop, it runs a persistent Claude Code environment on a box you own, such as a VPS, Mac mini, or Raspberry Pi.
The pieces are simple:
claude -pdrives the agent loop- Browser Harness connects to a real Chromium session through CDP over WSS
- Browser Use Cloud hosts the browser session
- Telegram and
ttydprovide remote control surfaces /home/buxstores persistent state across restarts
How it fits into the current agent tooling landscape
There are now roughly three common layers for browser automation:- classic automation frameworks
Playwright, Puppeteer, Selenium - MCP/browser tools for assistants
tool-call-oriented browser control - agent-native browser harnesses
browser control designed for LLMs that reason, adapt, and patch their own workflow
My take
The interesting part is not just “LLMs can click a browser”. A lot of tools can already do that. The more durable idea is this:treat browser automation as an editable runtime surface for agents, not only as a fixed script written in advance.That is a useful mental model if you are building agent workflows around real websites.
