| ▲ | Show HN: BrowserOS – "Claude Cowork" in the browser(github.com) | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 41 points by felarof 8 hours ago | 21 comments | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Hey HN! We're Nithin and Nikhil, twin brothers building BrowserOS (YC S24). We're an open-source, privacy-first alternative to the AI browsers from big labs. The big differentiator: on BrowserOS you can use local LLMs or BYOK and run the agent entirely on the client side, so your company/sensitive data stays on your machine! Today we're launching filesystem access... just like Claude Cowork, our browser agent can read files, write files, run shell commands! But honestly, we didn't plan for this. It turns out the privacy decision we made 9 months ago accidentally positioned us for this moment. The architectural bet we made 9 months ago: Unlike other AI browsers (ChatGPT Atlas, Perplexity Comet) where the agent loop runs server-side, we decided early on to run our agent entirely on your machine (client side). But building everything on the client side wasn't smooth. We initially built our agent loop inside a Chrome extension. But we kept hitting walls -- service worker being single thread JS; not having access to NodeJS libraries. So we made the hard decision 2 months ago to throw away everything and start from scratch. In the new architecture, our agent loop sits in a standalone binary that we ship alongside our Chromium. And we use gemini-cli for the agent loop with some tweaks! We wrote a neat adapter to translate between Gemini format and Vercel AI SDK format. You can look at our entire codebase here: https://git.new/browseros-agent How we give browser access to filesystem: When Claude Cowork launched, we realized something: because Atlas and Comet run their agent loop server-side, there's no good way for their agent to access your files without uploading them to the server first. But our agent was already local. Adding filesystem access meant just... opening the door (with your permissions ofc). Our agent can now read and write files just like Claude Code. What you can actually do today: a) Organize files in my desktop folder https://youtu.be/NOZ7xjto6Uc b) Open top 5 HN links, extract the details and write summary into a HTML file https://youtu.be/uXvqs_TCmMQ --- Where we are now If you haven't tried us since the last Show HN (https://news.ycombinator.com/item?id=44523409), give us another shot. The new architecture unlocked a ton of new features, and we've grown to 8.5K GitHub stars and 100K+ downloads: c) You can now build more reliable workflows using n8n-like graph https://youtu.be/H_bFfWIevSY d) You can also use BrowserOS as an MCP server in Cursor or Claude Code https://youtu.be/5nevh00lckM We are very bullish on browser being the right platform for a Claude Cowork like agent. Browser is the most commonly used app by knowledge workers (emails, docs, spreadsheets, research, etc). And even Anthropic recognizes this -- for Claude Cowork, they have janky integration with browser via a chrome extension. But owning the entire stack allows us to build differentiated features that wouldn't be possible otherwise. Ex: Browser ACLs. Agents can do dumb or destructive things, so we're adding browser-level guardrails (think IAM for agents): "role(agent): can never click buy" or "role(agent): read-only access on my bank's homepage." Curious to hear your take on this and the overall thesis. We’ll be in the comments. Thanks for reading! GitHub: https://github.com/browseros-ai/BrowserOS Download: https://browseros.com (available for Mac, Windows, Linux!) | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | arjunchint 3 hours ago | parent | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
Hey cool stuff since last update! I still don't buy the we needed it to be a whole Browser and not a Chrome Extension argument: - your interface is still literally a chrome extension side panel - none of the agentic browsers from the bigger players like Atlas and Comet really took off either I do think the server side integration is required: - with rtrvr.ai a ton of users are integrating our web agent chrome extension via Remote MCP from chatgpt.com as well as triggering as an API endpoint remotely. Your implementation is limited to only local connections as I understand. - the biggest unlock for users is running at scale, so just being able to launch a hundred cloud browsers, do a task, and return results while you do other things. So we see hybrid cloud/local execution as the key unlock for this year Your workflow pipeline is really cool! Any blog post/summary on how you set it up? Last year was a lot of technical builders exploring the capabilities, and I am excited for this year of making these agentic browsers useful! | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | tekacs 5 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
https://github.com/browseros-ai/BrowserOS/issues/99#issuecom... I didn't hear back there, but huzzah, it looks like this is in there. I'm glad to see it! | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | mossTechnician 4 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
> we're adding browser-level guardrails (think IAM for agents) This sounds interesting, but where would I go to see these guardrails and their implementation? I tried searching in the repository and couldn't find them. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | rahimnathwani an hour ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
Which local model works best with this? (Assuming MacOS with 32GB unified RAM) | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | 4b11b4 5 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
IAM for agents sounds interesting but how is it reliably enforced? You also built evals? | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | Johnny_Bonk 5 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
What would be great is if it could work in the browser like Claude in chrome and communicate (with my control) back to objects on my desktop like my ide for example or really anything | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | ivysly 6 hours ago | parent | prev [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
why are you calling this an OS | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||