Remix.run Logo
swyx 2 hours ago

he works on evals at canva

dannyw 2 hours ago | parent [-]

Yep. We have some interesting problems, like getting LLMs to create/edit Canva designs in our own proprietary format, which isn’t published or documented on the web. So the model has to work with it, purely from a very detailed system prompt spec / in-context learning.

I assume it might be a good barometer for generalised intelligence; esp in the visual space.