Remix.run Logo
Proposed New Test of AI Capabilities:)
1 points by VikRubenfeld 10 hours ago

When Microsoft can turn a fleet of LLMs loose on the Azure UX, and Google can do the same for the Google Adwords UX, and reduce their level of dreadfulness substantially, it will go a long way to showing that frontier models are as good as claimed.