Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
killingtime74
13 hours ago
Yes, it could just make one call to a multimodal llm to describe the scene