Remix clone Hacker News

new | show | ask | jobs Github

	▲	andoando 4 hours ago
		The thinking models are additionally trained with reinforcement learning to produce chain of thought reasoning