How is the routing to the hardware available? Let's say that a request hit the datacenter, how is it routed to an available GPU in a rack?