Remix.run Logo
Show HN: Verdict – model evals on your own data, not someone else's benchmark(github.com)
2 points by agunapal 7 hours ago