| ▲ | FLUX.2 [Klein]: Towards Interactive Visual Intelligence(bfl.ai) | |||||||||||||||||||||||||
| 50 points by GaggiX 4 hours ago | 9 comments | ||||||||||||||||||||||||||
| ▲ | dfajgljsldkjag 3 minutes ago | parent | next [-] | |||||||||||||||||||||||||
I appreciate that they released a smaller version that is actually open source. It creates a lot more opportunities when you do not need a massive budget just to run the software. The speed improvements look pretty significant as well. | ||||||||||||||||||||||||||
| ▲ | pavelstoev 9 minutes ago | parent | prev | next [-] | |||||||||||||||||||||||||
If we think of GenAI models as a compression implementation. Generally, text compresses extremely well. Images and video do not. Yet state-of-the-art text-to-image and text-to-video models are often much smaller (in parameter count) than large language models like Llama-3. Maybe vision models are small because we’re not actually compressing very much of the visual world. The training data covers a narrow, human-biased manifold of common scenes, objects, and styles. The combinatorial space of visual reality remains largely unexplored. I am looking towards what else is out there outside of the human-biased manifold. | ||||||||||||||||||||||||||
| ▲ | codezero 2 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
I am amazed, though not entirely surprised, that these models keep getting smaller while the quality and effectiveness increases. z image turbo is wild, I'm looking forward to trying this one out. An older thread on this has a lot of comments: https://news.ycombinator.com/item?id=46046916 | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
| ▲ | psubocz 22 minutes ago | parent | prev | next [-] | |||||||||||||||||||||||||
> FLUX.2 [klein] 4B The fastest variant in the Klein family. Built for interactive applications, real-time previews, and latency-critical production use cases. I wonder what kind of use cases could be "latency-critical production use cases"? | ||||||||||||||||||||||||||
| ▲ | SV_BubbleTime an hour ago | parent | prev [-] | |||||||||||||||||||||||||
Flux2 Klein isn’t some generation leap or anything. It’s good, but let’s be honest, this is an ad. What will be really interesting to me is the release of Z-image, if that goes the way it’s looking, it’ll be natural language SDXL 2.0, which seems to be what people really want. Releasing the Turbo/Distilled/Finetune months ago was a genius move really. It hurt Flux and Qwen releases on a possible future implication alone. If this was intentional, I can’t think of the last time I saw such shrewd marketing. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||