Remix.run Logo
Browser Agent Benchmark: Comparing LLM models for web automation(browser-use.com)
6 points by MagMueller 15 hours ago | 2 comments
pixel_popping an hour ago | parent | next [-]

It's lacking the best model (Opus 4.5) on the benchmark tho.

wiradikusuma an hour ago | parent | prev [-]

Since we're in this topic, can anyone suggest good AI-based tool for exploratory (fuzzy?) web testing?