| ▲ | Self-hosted dev sandboxes with preview URLs (Docker, Go, no K8s)(github.com) | |||||||||||||||||||||||||||||||||||||||||||||||||
| 66 points by tastyeffectco 9 hours ago | 13 comments | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | rsyring an hour ago | parent | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
I'd like something like this but using firecracker VMs. Basically, a self hosted exe.dev. Anyone building or using a project like this? | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | indigodaddy 3 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
My project is kinda sorta similar but uses Incus (LXC) and Caddy: https://github.com/jgbrwn/vibebin | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | cadamsdotcom 6 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
Thanks for posting your project & congrats on shipping. I have to confess, I’m struggling to see how this beats having my agent write 100 lines of shell script in a couple of seconds to do just the subset of this I need.. Would be neat to be able to read about that on its landing page! | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | hk1337 4 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
I did a similar thing but with devcontainer and codespaces. | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | utibeumanah 5 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
interesting project does this have some form of isolation? | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | priyadarshy 2 hours ago | parent | prev [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
I think we're all under-indexing on how important it is going to be to be able to play-test an agent's changes before shipping it as a hedge against slop. I don't think people will do it unless it feels effortless, like clicking a preview link from a GitHub PR. If there's any friction, people will just click merge PR and move on with their life and your product-level slop will acccummulate. I think high-taste products require you to actually use the thing that was built to feel the gaps in your spec. I routinely find I am doing something dead simple that an agent will one-shot, e.g. add a new sort by option in the panel that lists a user's Linear tasks. If I looked at the PR diff I'd immediately think it was perfect and ship it. It's only when I actually get in there and play around with a dynamic UI that I realize, "you know this option really belongs at the top" or "hm, there's enough options here now that this dropdown feels cramped and we might need to consider another option". Simple examples obviously but the principle is that when I am landing the last 10% of a fix or a feature that when I need to interact and play-test. At the speed agents can generate fixes to new bugs or customer requests, I am bottle necked just doing that last 10% of steering and even the basic git mechanics to try something locally are enough to get me to not want to do it. Right now, it seems like the state of the art is to review PR diffs and just merge them in or if you are more sophisticated have your agent generate screenshots or screen recordings. Screenshots and images are moving in the right direction but if you are building something interactive, you've really got to interact with it to know if it feels great. My dream was to start my day with an agent having handled every bug and small enhancement request that came my way that day, worked on, and ready for my review, so I could spend an hour each day just steering them to the finish line. I could do this linearly, picking off a branch an agent worked on, testing it and iterating but most of these small fixes each day aren't big brain stuff, I can effectively multi-task them but when I've tried doing it on my own machine it's either worktree hell, git gymnastics, or agents deadlocking - one agent wants to self test with Chrome MCP but can't cause another is editing code causing hot reloads. I ended up building a Desktop app version of what OP posted to do this for myself and my team at Sunsama, it's called Macro: - Website: https://macro.land - Demo: https://www.loom.com/share/89c273e3a92d45cfb6860790d7b78bf6 I had a couple other specific needs that these repos don't cover: - I want complete control of what MCP tools my team uses and the ability to control their input/output etc e.g. I don't want someone using the Intercom MCP to accidentally reply to a customer with an agent. - I don't want my engineers spending time configuring all the boring stuff (preview urls, mcps, chrome mcp, etc) - I wanted the ability to steer agents to the finish line together. In Macro, any user can join the chat and steer the agent e.g. a colleague with UI taste jumps into tweak the last bits - I want to see common failure agentic failure cases across my team so I can improve our Claude.md and agentic practices so all work flowing through one interface allows that | ||||||||||||||||||||||||||||||||||||||||||||||||||