Remix.run Logo
Show HN: Browser4 – an open-source browser engine for agents and concurrency(github.com)
6 points by galaxyeye a day ago | 4 comments

Hi HN,

I’d like to share an open-source project we’ve been working on for a while: Browser4.

The motivation came from a recurring frustration: most browser automation tools (Playwright, Selenium, Puppeteer) are excellent for human-written scripts, but start to show friction when used as a core execution layer for AI agents or at very high concurrency.

So instead of building “another wrapper around Playwright”, we experimented with a different direction: designing a browser engine where AI agents are first-class citizens.

### What Browser4 is

Browser4 is a browser automation engine built on native Chrome DevTools Protocol (CDP), with a focus on:

* Coroutine-safe concurrency (designed to run many browser sessions in parallel)

* Agent-oriented APIs (navigation, interaction, extraction as composable actions)

* Hybrid extraction: ML agent driven extraction + LLM extraction + structured selectors + an SQL-like DOM query language (X-SQL)

* Low-level control without Playwright-style abstraction overhead

It’s written in Kotlin/JVM, mainly because we needed predictable concurrency behavior and long-running stability under load.

The project is fully open-source (Apache 2.0).

### What it’s not

* It’s not a drop-in Playwright replacement.

* It’s not a no-code RPA tool.

* It’s not “LLM magic” — LLMs sit outside the browser engine.

Browser4 intentionally stays close to the browser execution layer and leaves planning/reasoning to external agent loops.

### Current use cases we’re testing

* Large-scale web data extraction

* Agentic workflows (search → navigate → extract → summarize)

* Price / content monitoring with frequent revisits

* High-concurrency crawling where browser startup and context switching are bottlenecks

On a single machine, we can sustain very high daily page visits, though we’re still validating benchmarks across different workloads.

### Open questions (where I’d love feedback)

* For agentic systems, does it make sense to bypass Playwright entirely and work closer to CDP?

* Where do you see the biggest pain points when combining LLMs with browser automation today?

* Is JVM a reasonable choice here, or is Python still the better tradeoff despite concurrency limits

* What abstractions would you want in a browser engine built for AI agents?

### Links

* GitHub: https://github.com/platonai/browser4

* Website (light overview): https://browser4.io

Happy to answer technical questions or hear criticism — especially from people running browser automation or agent systems in production.

Thanks for reading.

SkyRocknRoll a day ago | parent | next [-]

Very much excited about this. *Hybrid extraction: ML agent driven extraction

This is what I have been missing in the existing systems

galaxyeye 9 hours ago | parent [-]

Thank you for your comment. I think X-SQL based extraction could also be useful: its ideal for high-complexity data-extraction pipelines with multiple-dozen entities and several hundred fields per entity. What are your thoughts on this idea?

tomfox2 a day ago | parent | prev [-]

Therefore, we should not be paying for tokens, but for more effective agents.

galaxyeye 9 hours ago | parent [-]

Yes, that's why we develop it. And I think X-SQL based extraction could also be useful: its ideal for high-complexity data-extraction pipelines with multiple-dozen entities and several hundred fields per entity. What are your thoughts on this idea?