| ▲ | HaZeust 2 hours ago | |||||||||||||||||||||||||
I've seen this reply to Simon's benchmark for 2 years running now, and yet you still see improvements and objectively-bad results over time from new releases, even when I'm sure every frontier AI team has/had a person at least partially dedicated to better bicycle-pelican SVG outputs. Alas. | ||||||||||||||||||||||||||
| ▲ | sarreph 2 hours ago | parent | next [-] | |||||||||||||||||||||||||
I had intended to caveat that: I'm sure I'm not the first person to ask about this! > you still see improvements This is expected if they are training their models on it, right? > objectively-bad results Keen to learn when this has been the case, i.e. across version increments in major models. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
| ▲ | llm_nerd 2 hours ago | parent | prev [-] | |||||||||||||||||||||||||
I honestly assumed their comment was tongue in cheek humour, because positively no one actually cares how these models generate an SVG pelican riding a bicycle. It's some meme thing that this stuff always appears here. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||