| ▲ | Browser Agent Benchmark: Comparing LLM models for web automation(browser-use.com) | |
| 6 points by MagMueller 15 hours ago | 2 comments | ||
| ▲ | pixel_popping an hour ago | parent | next [-] | |
It's lacking the best model (Opus 4.5) on the benchmark tho. | ||
| ▲ | wiradikusuma an hour ago | parent | prev [-] | |
Since we're in this topic, can anyone suggest good AI-based tool for exploratory (fuzzy?) web testing? | ||