| ▲ | Show HN: Resurf – realistic, reproducible test framework for AI browser agents(github.com) | |
| 5 points by andrew_zhong 10 hours ago | ||
Systematic testing of browser agents today is not easy: testing on real websites is flaky, rate-limited and potentially expensive (e.g. using proxies or bypassing Captcha), while static-HTML benchmarks lack state and dynamic behavior. Resurf gives your browser agent a realistic, stateful, instrumented framework — built on synthetic websites with failure-mode injection: - Realistic, dynamic, interactive environment - Deterministic & reproducible - Failure-mode injection (latency, payment errors, 5xx) - Auditable success eval (DB state, not LLM judge) - No dependency on live websites - Browser Use and Stagehand supported out of the box | ||