Remix.run Logo
phsource 4 days ago

This is pretty impressive results given that this is not from one of the major AI labs. Congrats: https://blog.withmeka.com/meka-achieves-state-of-the-art-per...

Out of curiosity, what do you think contributed to this working better than even OpenAI agent or some of the other tools out there?

I'm not that familiar with how OpenAI and other agents like Browser Use currently work, but is this, in your opinion, the most important factor?

> An infrastructure provider that exposes OS-level controls, not just a browser layer with Playwright screenshots. This is important for performance as a number of common web elements are rendered at the system level, invisible to the browser page

tcwd 4 days ago | parent [-]

Thanks! Quite a few factors, here's a detailed post on the architecture: https://blog.withmeka.com/introducing-meka-an-open-source-fr...

IMO, the combination of having an "evaluator model" at the end to verify if the intent of the task was complete, and using multiple models that look over each other's work in every step was helpful - lots of human organization analogies there, like "trust but verify" and pair programming. Memory management was also very key.