| ▲ | willahmad 18 hours ago | |||||||
I think this benchmark could be slightly misleading to assess coding model. But still very good result. Yes, SVG is code, but not in a sense of executable with verifiable inputs and outputs. | ||||||||
| ▲ | jstummbillig 17 hours ago | parent | next [-] | |||||||
I love that we are earnestly contemplating the merits of the pelican benchmark. What a timeline. | ||||||||
| ||||||||
| ▲ | hdjrudni 8 hours ago | parent | prev [-] | |||||||
But it does have a verifiable output, no more or less than HTML+CSS. Not sure what you mean by "input" -- it's not a function that takes in parameters if that's what you're getting at, but not every app does. | ||||||||