| ▲ | FieryTransition 6 hours ago | |||||||||||||||||||||||||||||||
If it's not reprogrammable, it's just expensive glass. If you etch the bits into silicon, you then have to accommodate the bits by physical area, which is the transistor density for whatever modern process they use. This will give you a lower bound for the size of the wafers. This can give huge wafers for a very set model which is old by the time it is finalized. Etching generic functions used in ML and common fused kernels would seem much more viable as they could be used as building blocks. | ||||||||||||||||||||||||||||||||
| ▲ | audunw 5 hours ago | parent | next [-] | |||||||||||||||||||||||||||||||
Models don’t get old as fast as they used to. A lot of the improvements seem to go into making the models more efficient, or the infrastructure around the models. If newer models mainly compete on efficiency it means you can run older models for longer on more efficient hardware while staying competitive. If power costs are significantly lower, they can pay for themselves by the time they are outdated. It also means you can run more instances of a model in one datacenter, and that seems to be a big challenge these days: simply building an enough data centres and getting power to them. (See the ridiculous plans for building data centres in space) A huge part of the cost with making chips is the masks. The transistor masks are expensive. Metal masks less so. I figure they will eventually freeze the transistor layer and use metal masks to reconfigure the chips when the new models come out. That should further lower costs. I don’t really know if this makes sanse. Depends on whether we get new breakthroughs in LLM architecture or not. It’s a gamble essentially. But honestly, so is buying nvidia blackwell chips for inference. I could see them getting uneconomical very quickly if any of the alternative inference optimised hardware pans out | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | booli 4 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
Reading the in depth article also linked in this thread, they say that only 2 layers need to change most of the time. They claim from new model to PCB in 2 months. Let's see, but sounds promising. | ||||||||||||||||||||||||||||||||
| ▲ | MagicMoonlight 5 hours ago | parent | prev [-] | |||||||||||||||||||||||||||||||
You don’t need it to be reprogrammable if it can use tools and RAG. | ||||||||||||||||||||||||||||||||