| ▲ | cyanydeez 3 hours ago | |||||||||||||||||||||||||||||||||||||||||||
I'd say it's a malefactor of: 1. Amazing, you just tweaked 1% efficiency 2. You idiot, you just spent an hour trying to trouble shoot a hallucinated api. On average, it's really hard to tell which ones going to win here. | ||||||||||||||||||||||||||||||||||||||||||||
| ▲ | dakolli 3 hours ago | parent [-] | |||||||||||||||||||||||||||||||||||||||||||
Its not hard to tell at all, just look at how much it costs to run a 10T param model (especially with parallelized agents). Those costs are not worth the occasional slot machine-eque jackpot you get. For an entity like Google it might be worth it, but that's it. They definitely aren't going to let us use these things for cost they are now for much longer. Imagine going back to 2020 and tell people in 6 years going to be able to spend $200.00 a month and be able to spin up $2mm in GPUs at full throttle to respond to your emails. None of this makes sense. | ||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||