| ▲ | pwython 5 hours ago | |
How many pelican riding bicycle SVGs were there before this test existed? What if the training data is being polluted with all these wonky results... | ||
| ▲ | bwilliams18 3 hours ago | parent | next [-] | |
I'd argue that a models ability to ignore/manage/sift through the noise added to the training set from other LLMs increases in importance and value as time goes on. | ||
| ▲ | nerdsniper 4 hours ago | parent | prev [-] | |
You're correct. It's not as useful as it (ever?) was as a measure of performance...but it's fun and brings me joy. | ||