| ▲ | tartoran 7 hours ago | |
I have to admit I'm seeing this for the first time and am somewhat impressed by the results and even think they will get better with more training, why not... But are these multimodal LLMs still LLMs though? I mean, they're still LLMs but with a sidecar that does other things and the training of the image takes place outside the LLMs so in a way the LLMs still don't "know" anything about these images, they're just generating them on the fly upon request. | ||
| ▲ | boxedemp 6 hours ago | parent [-] | |
Maybe we should drop one of the L's | ||