Driving the browser, four ways · A comparative study

Driving the browser,
four ways.

Vercel's agent-browser, Microsoft's Playwright MCP, Google's Chrome DevTools MCP, and the lighter Browser MCP: four tools that let coding agents drive a real browser, built around different bets about transport (CLI vs MCP), runtime, and how much debugging you actually want in the loop.

×4

~100k

Combined stars

All four repos, GitHub, May 2026.

Transports

CLI invoked from shell, or MCP server speaking to the agent over stdio.

Cross-browser

Only Playwright MCP. Everything else is Chrome-only.

Approaches to auth

Isolated profile, extension bridge, or live-attach to your real Chrome.

No. 01 · AB CLI · Rust · driving

agent-browser

Vercel Labs · MIT · agent-browser.dev

33k

★ Stars

50+

Commands

Rust

Runtime

CLI

Transport

A native Rust binary that talks to Chrome via the Chrome DevTools Protocol through a persistent local daemon. The agent calls it like any other shell command, agent-browser open example.com, agent-browser snapshot -i. The snapshot returns a compact accessibility tree with refs (@e1, @e2) the agent uses for deterministic targeting. The big practical wins are no Node dependency, instant cold start, and the fact that any agent that can run shell, Claude Code, Codex, Cursor, opencode, Copilot, can drive it without an MCP integration.

Pick this when

You want the lightest possible dependency footprint, you're driving the browser (not debugging it), and you want a tool any agent that runs shell can use without MCP wiring.

No. 02 · PW MCP · Node · driving · cross-browser

Playwright MCP

Microsoft · Apache-2.0 · github.com/microsoft/playwright-mcp

30k

★ Stars

25+

MCP tools

Node

Runtime

Browsers

The most established option, and the only one that does real cross-browser: Chromium, Firefox, WebKit, plus 143 device emulation profiles for mobile testing. Same accessibility-snapshot approach as agent-browser, just delivered as an MCP server: every tool definition lives in the agent's context, which means richer affordances at the cost of more tokens. Connection to an existing logged-in browser goes through a Playwright MCP Bridge extension rather than Chrome's native remote-debug path, which is workable but more setup.

Pick this when

You need real cross-browser testing (Firefox or WebKit), your team already lives in Playwright, or your agent flow is part of a broader Playwright test suite.

No. 03 · CD MCP · Node · debugging · perf

Chrome DevTools MCP

Chrome team / Google · Apache-2.0 · 0.20.3

37k

★ Stars

MCP tools

Node

Runtime

Chrome only

The debugging-side champion. Ships everything the other three ship plus the dedicated DevTools surface: performance_start_trace, network waterfall inspection, source-mapped console errors, CrUX field-data integration, Lighthouse audits. --autoConnect (Chrome 144+) lets the agent attach to your already-running Chrome with explicit user approval, same auth, same cookies, same tabs. The cost: full mode burns ~18k tokens of tool definitions. A --slim mode trims to 3 core tools and ~6k tokens.

Pick this when

The agent's job involves diagnosing what's wrong: perf traces, failed requests, console errors, or when you want it operating inside your real, logged-in Chrome session via --autoConnect.

No. 04 · BM MCP · Node + extension · session reuse

Browser MCP

browsermcp.io · MIT · extension-bridge

~7k

★ Stars

~15

MCP tools

Node

Runtime

Chrome only

The lightweight specialist. Pairs a small MCP server with a browser extension that exposes your current tab to the agent. The whole proposition is reuse what's already in your browser: your logins, your cookies, the page you're currently on, without spawning a separate Chrome instance or fiddling with remote-debugging ports. Smaller tool surface than the Microsoft or Google servers; no perf or network introspection. Useful when the agent's job is to do something on a page you're already authenticated on.

Pick this when

The flow is fundamentally "act on the tab I'm already looking at": an authenticated dashboard, an internal tool, and you don't need perf or network depth.

Dimension	agent-browser Vercel Labs	Playwright MCP Microsoft	Chrome DevTools MCP Chrome team	Browser MCP browsermcp.io
Transport	CLI: invoked per command from the shell	MCP server (stdio)	MCP server (stdio)	MCP server + browser extension
Runtime	Native Rust binary, no Node required	Node.js (`npx @playwright/mcp`)	Node.js (`npx chrome-devtools-mcp`)	Node.js + extension
Token cost in prompt	~0 (no tool defs registered; agent reads `--help`)	~12k (25+ tool defs)	~18k full / ~6k slim	~5k (smaller surface)
Cross-browser	Chromium only	Chromium, Firefox, WebKit + 143 device profiles	Chrome only	Chrome only
Page targeting	Accessibility-tree refs (`@e1`, `@e2`)	Accessibility-tree snapshots	Accessibility-tree + CDP IDs	Accessibility-tree via extension
Perf / debugging tools	No (driving only)	No (driving only)	Yes: performance traces, network, Lighthouse, CrUX	No
Reuse your real Chrome session	Persistent profile via daemon	Via `--extension` bridge	Native via `--autoConnect` (Chrome 144+)	Native via extension
Setup floor	`npm i -g` or `brew install`	MCP config in agent + `npx`	MCP config in agent + `npx`	MCP config + install extension
Works with	Any shell-capable agent: Claude Code, Codex, Cursor, Copilot, Gemini, opencode	MCP-aware agents (Claude Code, Cursor, Copilot, Cline, VS Code)	MCP-aware agents (Claude Code, Cursor, Gemini CLI, Antigravity)	MCP-aware agents
License	MIT	Apache-2.0	Apache-2.0	MIT

Dimension

agent-browser Vercel Labs

Playwright MCP Microsoft

Chrome DevTools MCP Chrome team

Browser MCP browsermcp.io

Transport

CLI: invoked per command from the shell

MCP server (stdio)

MCP server + browser extension

Runtime

Native Rust binary, no Node required

Node.js (npx @playwright/mcp)

Node.js (npx chrome-devtools-mcp)

Node.js + extension

Token cost in prompt

~0 (no tool defs registered; agent reads --help)

~12k (25+ tool defs)

~18k full / ~6k slim

~5k (smaller surface)

Cross-browser

Chromium only

Chromium, Firefox, WebKit + 143 device profiles

Chrome only

Page targeting

Accessibility-tree refs (@e1, @e2)

Accessibility-tree snapshots

Accessibility-tree + CDP IDs

Accessibility-tree via extension

Perf / debugging tools

No (driving only)

Yes: performance traces, network, Lighthouse, CrUX

Reuse your real Chrome session

Persistent profile via daemon

Via --extension bridge

Native via --autoConnect (Chrome 144+)

Native via extension

Setup floor

npm i -g or brew install

MCP config in agent + npx

MCP config + install extension

Works with

Any shell-capable agent: Claude Code, Codex, Cursor, Copilot, Gemini, opencode

MCP-aware agents (Claude Code, Cursor, Copilot, Cline, VS Code)

MCP-aware agents (Claude Code, Cursor, Gemini CLI, Antigravity)

MCP-aware agents

License

MIT

Apache-2.0

MIT

The interesting question isn't which is best: it's whether the MCP-server era is permanent.

agent-browser is the most architecturally distinct of the four, and the one most worth watching. Not because the Rust binary is faster than Node, that's a small win, but because it's a public bet that CLI is the right shape for agent tooling, and that MCP's tool-definitions-in-prompt model is the wrong shape for anything that doesn't need persistent state.

If that bet is right, the next year of agent tooling looks more like Unix and less like RPC: small composable binaries the agent learns on demand, rather than always-on servers that consume context just by existing. If it's wrong, MCP wins on developer ergonomics and the token cost gets absorbed by cheaper models. The empirical answer is probably both, for different jobs, but the argument is live, and worth watching.

Project knowledge · related

html-effectiveness-catalog: the visual register and card conventions used in this artifact.

External · cited

Steve Kinney, "Playwright vs. Chrome DevTools MCP: Driving vs. Debugging", source of the framing in §01.

Primary sources

agent-browser.dev · github.com/microsoft/playwright-mcp · github.com/ChromeDevTools/chrome-devtools-mcp · browsermcp.io

Caveat

Star counts and tool counts are accurate as of May 2026 and will drift; the architectural shape of each tool is more stable than the numbers.

Driving the browser,
four ways.

The CLI vs MCP question is the real one.

Four tools, four bets.

agent-browser

Playwright MCP

Chrome DevTools MCP

Browser MCP

The comparison matrix.

If this, then that.

The interesting question isn't which is best: it's whether the MCP-server era is permanent.

Driving the browser, four ways.

The CLI vs MCP question is the real one.

Four tools, four bets.

agent-browser

Playwright MCP

Chrome DevTools MCP

Browser MCP

The comparison matrix.

If this, then that.

The interesting question isn't which is best: it's whether the MCP-server era is permanent.

Driving the browser,
four ways.