Remix.run Logo
N_Lens 5 hours ago

Same with LLM benchmarks these days.

Metaluim 5 hours ago | parent [-]

Well, the pelican benchmark is easily verifiable.

echoangle 3 hours ago | parent [-]

Kind of hard to judge though, it’s not really objective how good a pelican looks.

supercoco9 28 minutes ago | parent [-]

Or a bicycle!