i was able to get it in 3.5 mins from a single image on my 24gb m4 pro macbook
I'm still working on this to try to replicate nvdiffrast better. Found an open source port, might look it tonight