Remix clone Hacker News

new | show | ask | jobs Github

▲

tezza 18 hours ago

For the curious I generated the same prompt for each of the quality types. ‘Auto’, ‘low’, ‘medium’, ‘high’.

Prompt: “a cute dog hugs a cute cat”

https://x.com/terrylurie/status/1915161141489136095

I also then showed a couple of DALL:E 3 images for comparison in a comment

▲

echelon 15 hours ago | parent | next [-]

> a cute dog hugs a cute cat

This prompt is best served by Midjourney, Flux, Stable Diffusion. It'll be far cheaper, and chances are it'll also look a lot better.

The place where gpt-image-1 shines if if you want to do a prompt like:

"a cute dog hugs a cute cat, they're both standing on top of an algebra equation (y=\(2x^{2}-3x-2\)). Use the first reference image I uploaded as a source for the style of the dog. Same breed, same markings. The cat can contrast in fur color. Use the second reference image I uploaded as a guide for the background, but change the lighting to sunset. Also, solve the equation for x."

gpt-image-1 doesn't make the best images, and it isn't cheap, and it isn't fast, but it's incredibly -- almost insanely -- powerful. It feels like ComfyUI got packed up into an LLM and provided as a natural language service.

▲

stavros 14 hours ago | parent [-]

I wonder if we can use gpt-image-1 outputs, with some noise, as inputs to diffusion models, so GPT takes care of adherence and the diffusion model improves the quality. Does anyone know whether that's at all possible?

	▲	AuryGlenz 11 hours ago \| parent \| next [-]
		Sure. I suppose with API support 3 hours ago someone probably made a Comfy node all of 2 hours ago. From there you can either just do a low denoise or use one of the many IP-Adapter type things out there.
	▲	levzzz 12 hours ago \| parent \| prev [-]
		yes it's what a lot of people have been doing with newer models which have better prompt adherence, passing them through older models with better aesthetics

▲

MoonGhost 15 hours ago | parent | prev | next [-]

Not bad. Photo forums will be soon full of them. Slightly edited to remove metadata and make them look like human made.

▲

latexr 16 hours ago | parent | prev | next [-]

> the same prompt for each of the quality types. ‘Auto’, ‘low’, ‘medium’, ‘high’.

“Auto” is just whatever the best quality is for a model. So in this case it’s the same as “high”.

▲

whywhywhywhy 7 hours ago | parent | prev | next [-]

Crazy even photos have the OpenAI yellow color grade

▲

mclau157 2 hours ago | parent | prev [-]

please use BlueSky