Remix.run Logo
prophesi 11 hours ago

I imagine it's simply a matter of taking the CSV dataset of prompts from here[0], and prompting an LLM to turn each into a formal poem. Then using these converted prompts as the first prompt in whichever LLM you're benchmarking.

https://github.com/mlcommons/ailuminate