| ▲ | swyx 2 hours ago | |
he works on evals at canva | ||
| ▲ | dannyw 2 hours ago | parent [-] | |
Yep. We have some interesting problems, like getting LLMs to create/edit Canva designs in our own proprietary format, which isn’t published or documented on the web. So the model has to work with it, purely from a very detailed system prompt spec / in-context learning. I assume it might be a good barometer for generalised intelligence; esp in the visual space. | ||