Remix.run Logo
Show HN: Resurf – realistic, reproducible test framework for AI browser agents(github.com)
5 points by andrew_zhong 10 hours ago

Systematic testing of browser agents today is not easy: testing on real websites is flaky, rate-limited and potentially expensive (e.g. using proxies or bypassing Captcha), while static-HTML benchmarks lack state and dynamic behavior.

Resurf gives your browser agent a realistic, stateful, instrumented framework — built on synthetic websites with failure-mode injection:

- Realistic, dynamic, interactive environment - Deterministic & reproducible - Failure-mode injection (latency, payment errors, 5xx) - Auditable success eval (DB state, not LLM judge) - No dependency on live websites - Browser Use and Stagehand supported out of the box