Remix clone Hacker News

new | show | ask | jobs Github

	▲	xnx 3 days ago
		Have you looked at comparing to Google's foundation models or specialty medical models like MedGemma (https://developers.google.com/health-ai-developer-foundation...)?
	▲	fertrevino 3 days ago \| parent [-]
		That would be an interesting extension. MedGemma isn't part of the original benchmark either [1]. Since Gemini 2.0 Flash is on 6th place, expectations are for MedGemma to achieve higher than that :) [1]https://crfm.stanford.edu/helm/medhelm/latest/#/leaderboard