Remix clone Hacker News

new | show | ask | jobs Github

	▲	echelon 3 hours ago
		`1. Take the top ten searches on Google Trends (on day of new model release) 2. Concatenate 3. SHA-1 hash them 4. Use this as a seed to perform random noun-verb lookup in an agreed upon large sized dictionary. 5. Construct a sentence using an agreed upon stable algorithm that generates reasonably coherent prompts from an immensely deep probability space.` That's the prompt. Every existing model is given that prompt and compared side-by-side. You can generate a few such sentences for more samples. Alternatively, take the top ten F500 stock performers. Some easy signal that provides enough randomness but is easy to agree upon and doesn't provide enough time to game. It's also something teams can pre-generate candidate problems for to attempt improvement across the board. But they won't have the exact questions on test day.