Remix.run Logo
throwaway219450 2 hours ago

It benefits Google's bottom line to have very capable small models that can cheaply cache results for search queries, even if they're frequently wrong. But I wonder if they use Gemini for the top X% of search terms to try and get better retention? Also the TPU vertical gives a good advantage here. I've never been super impressed with Gemini out of the box, but surely, surely, Google is best positioned here.

As a consumer, 24-32 GB VRAM is affordable ($1-2 k) and that's the frontier I'm most interested in. It's very "two papers down the line". Those models are also feasible to fine-tune, unlike the O(100+B) behemoths. The 4000 Pro Blackwell has very good TDP compared to people insisting on using 300-600W gaming cards. If I was freelancing, I would definitely consider getting a 6000 for work.