Remix.run Logo
onion2k 12 hours ago

I use AI agents to build UI features daily. The thing that kept annoying me: the agent writes code but never sees what it actually looks like in the browser. It can’t tell if the layout is broken or if the console is throwing errors.

I give agent either a simple browser or Playwright access to proper browsers to do this. It works quite well, to the point where I can ask Claude to debug GLSL shaders running in WebGL with it.

vunderba 7 hours ago | parent | next [-]

Agreed. Anthropic added a plugin accessible under `/plugins` to CC to make it even easier to add MCP Playwright to your project. It automatically handles taking screenshots.

It's not perfect though - I've personally found CC's VL to be worse than others such as Gemini but its nice to have it completely self contained.

This project desperately needs a "What does this do differently?" section because automated LLM browser screenshot diffing has been a thing for a while now.

wg0 11 hours ago | parent | prev | next [-]

Do you use Chrome DevTools MCP or how does it work?

roxolotl 10 hours ago | parent [-]

Playwright mcp has screenshotting built in

ioseph 10 hours ago | parent | prev | next [-]

Likewise, and often the playwright skill will verify using DOM API instead of wasting tokens on screenshots

embedding-shape 10 hours ago | parent [-]

> often the playwright skill will verify using DOM API instead of wasting tokens on screenshots

So... Bypassing the whole "sees what it actually looks like in the browser. It can’t tell if the layout is broken" parent commentator is talking about? Seems worse, not better.

TZubiri 9 hours ago | parent | prev [-]

+1

All the power to you if you build a product out of this, I don't wanna be that guy that says that dropbox is dead because you can just setup ftp. But with Codex/Claude Code, I was able to achieve this very result just from prompting.