Remix.run Logo
xfalcox 3 hours ago

We have vLLM for running text LLMs in production. What is the equivalent for this model?

mh- 2 hours ago | parent [-]

I would say there's isn't an equivalent. Some people will probably tell you ComfyUI - you can expose workflows via API endpoints and parameterize them. This is how e.g. Krita AI Diffusion uses a ComfyUI backend.

For various reasons, I doubt there are any large scale SaaS-style providers operating this in production today.