Remix.run Logo
pwatsonwailes 5 days ago

Vision language models. Basically an LLM plus a vision encoder, so the LLM can look at stuff.