Remix.run Logo
Show HN: I run a vision model on every screenshot, locally, on a 4GB GPU(github.com)
27 points by skye0110 15 hours ago | 4 comments
aynite 4 hours ago | parent | next [-]

I tried to use gemma-e4b for text generation, was thinking about to use for image

but i found gemma-e4b is still too "dumb", and barely capable to provide any good response.

could you share your experience with how you use e2b to generate good result?

torunar 14 hours ago | parent | prev [-]

> Microsoft showed the world wants screen-aware AI with Recall.

Considering the massive backlash it caused, it showed the exact opposite.

shmoogy 2 hours ago | parent | next [-]

I used the Rewind app on Mac and it was very nice to have the ability to search almost anything you did / saw on the computer. If it's local, opt in, and secure then it's potentially very worth exploring.

RobotToaster 3 hours ago | parent | prev [-]

I think the backlash was more against everything being sent to a cloud server where Microsoft can see everything you do.