▲ | minimaxir 19 hours ago | |
As usual for AI startups nowadays, using this API you can create a downstream wrapper for image generation with bespoke prompts. A pro/con of the multimodal image generation approach (with an actually good text encoder) is that it rewards intense prompt engineering moreso than others, and if there is a use case that can generate more than $0.17/image in revenue, that's positive marginal profit. |