Remix clone Hacker News

new | show | ask | jobs Github

	▲	GaggiX 3 days ago
		A model is capable of learning the "calibration" during the reinforcement learning phase, in this "old" post from OpenAI: https://openai.com/index/introducing-simpleqa/ you can see the positive correlation between stated confidence and accuracy.