Remix.run Logo
adrianfcole 3 days ago

I feel you and I also think this is a tough time. I routinely use Claude even though he typically ignores me ;) I'm also a maintainer of Goose and have been engaged since before the rust rewrite. I'll admit I don't get exactly what I want out of any agent. I also don't buy the "it is you" thing that is typical with Agents. We often need to know too many things and are defensive in how we act. I truly hope this is temporary.

ok back to the point. Block is not trying to sell a frontier model, or Goose at all. As an open source enthusiast, I like this model (no pun intended). Features go where the prominent site or key contributors want, vs a commercial agenda. To get more practical, it was goose folks themselves who put themsemselves out there in tbench.ai and remain in the top 10

https://www.tbench.ai/leaderboard/terminal-bench/2.0

Does this invalidate poor experience on use cases. no way. However, there's a lot of work being done by block folks to help teach and share practice and get things together. I'm always looking for pure local everything and Mic is also super keen on this, Today? well it is like watching someone type each character at a time while your laptop melts. I don't think this invalidates the long term, but it acknowledges the short term.

Next, Goose doesn't care about you in a specific way. Literally there is a Claude agent so you can swap out the goosey parts if you like. It is clunky and I'm personally looking into aligning that interop via Zed's ACP. I think like the combination of openness and not having any angle.. like not anti claude, literally give you a way to use it.. is telling.

This is a ramble and maybe a waste of your context, but I hope it colors some things and will get to see you around.

sheikhlimon 3 days ago | parent [-]

Adding a quick note as someone who contributes to Goose but isn’t a maintainer. I agree with a lot of what you’re saying. The harness and overall UX have changed quite a bit recently, though it’s still very much evolving like everything else in this space. If anyone tried it a while back, the newer versions are worth a look. And any issues people hit in practice are genuinely useful for us to improve things.

Appreciate the thoughtful take here.