| ▲ | TZubiri 6 hours ago | ||||||||||||||||
I am confused, how can functions that output images help with functions that should take images as input? | |||||||||||||||||
| ▲ | taneq 5 hours ago | parent [-] | ||||||||||||||||
They’re multimodal LLMs trained for image generation. Turns out that if you want to generate images you gotta know what things look like. | |||||||||||||||||
| |||||||||||||||||