Remix clone Hacker News

new | show | ask | jobs Github

	▲	kostaj an hour ago
		Search was enabled for 2 of the 5 models -- Gemini and Sonar Pro. The disagreement between them is still high - different verdict on 42% of the claims. Fully agree, that some of those claims are hard to classify for a human as well -- the real-world messiness...
	▲	dcreater an hour ago \| parent [-]
		Why was it enabled for only 2 of the 5? Other burning questions: What methodology was used to choose the question set? Why not allow explanations? How many passes were done for each LLM?