| ▲ | Imustaskforhelp 20 hours ago | |||||||||||||||||||||||||||||||
Is modal running every single service inside gvisor? I have heard that gvisor isn't recommended to run every single production but rather only some front facing or some other activities but it has some serious performance degradation which is why most end up using firecracker This is really cool though, does this mean that we could probably have AI models that are snapshotted? Are the states of checkpoint/recovery encrypted by default or how would that even work? Like what are the privacy aspects of it. I don't think even using something like modal would be the private llm that many people sometimes want on subreddits like localllama but the people dont have gpu. of course nothing beats privacy if you have your own gpu's but I'd be curious to know what people's thoughts are | ||||||||||||||||||||||||||||||||
| ▲ | markasoftware 20 hours ago | parent [-] | |||||||||||||||||||||||||||||||
the thing is modal is running untrusted containers, so there's not really a concept of "some front facing" containers. Any container running an untrusted workload is at high risk / is "front facing". If Modal's customers' workloads are mainly GPU-bound, then the performance hit of gvisor isn't as big as it might be for other workloads. GPU activity does have to go through the fairly heavyweight nvproxy to be executed on the host, but most gpu activity is longer-lived async calls like running kernels so a bit of overhead in starting / retrieving the results from those calls can be tolerated. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||