Remix.run Logo
vlovich123 2 hours ago

Is the model structure going to be easy to reverse engineer just from the weights? Also, I'm going to guess it's an MoE and thus it's possible there's no single machine that hosts all of Fabel / Mythos.

himata4113 2 hours ago | parent [-]

kvcache residency requirements and general latency for good throughput wants good locality, but you're right it could be split across multiple different parts of a single datacenter, but as I mentioned before the weakest link is before the model is ever loaded onto the gpus.

as for reverse engineering I doubt it's something that state sponsored actors would struggle with for too long.