| ▲ | NUMA: Cores, memory, and the distance between them(edera.dev) | |
| 20 points by sys_call 5 days ago | 1 comments | ||
| ▲ | lukax 7 minutes ago | parent [-] | |
NUMA can cause really crappy performance. We deployed a Go based LLM gateway in Kubernetes deployed on a server with hundreds of CPU cores. We didn't explicitly set GOMAXPROCS so Go runtime scheduled goroutines over different CPUs and it constantly used 200% CPU and GC was causing latency spikes. Then we set GOMAXPROCS 8 and all performance issues went away. Until recently Kubernetes didn't work well with NUMA. | ||