Remix.run Logo
pwython 5 hours ago

How many pelican riding bicycle SVGs were there before this test existed? What if the training data is being polluted with all these wonky results...

bwilliams18 3 hours ago | parent | next [-]

I'd argue that a models ability to ignore/manage/sift through the noise added to the training set from other LLMs increases in importance and value as time goes on.

nerdsniper 4 hours ago | parent | prev [-]

You're correct. It's not as useful as it (ever?) was as a measure of performance...but it's fun and brings me joy.