Remix.run Logo
Show HN: Omi – watches your screen, hears conversations, tells you what to do(github.com)
8 points by kodjima33 10 hours ago | 5 comments

Spent 4 months and built Omi for Desktop, your life architect: It sees your screen, hears your conversations and will advise you on what to do next

Basically Cluely + Rewind + Granola + Wisprflow + ChatGPT + Claude in one app

I talk to claude/chatgpt 24/7 but I find it frustrating that i have to capture/send screenshots of my screen and that it doesn't help proactively during my work

Whenever omi sees something wrong about my workflow, it will send me a proactive notification with advice. It will also point to something I'm missing.

The hardest part was to nail proactivity - after trying 20+ similar tools I didn't find a single one with smart proactive notifications based on content on your screen. I made it look at your screen every second with 4 main prompts:

1. Is the user productive or distracted?

2. Is there anything useful to say right now?

3. is there any task to add to do later?

4. is there anything important to remember about the user?

Full stack: - Swift - Rust backend - Deepgram transcription - Claude code for messaging - GPT 5.4 summaries - Gemini for embeddings and translation

Open source, stores screenshots locally, uses Claude Code for chat. Has cloud to sync with hardware or mobile app but can be disabled in settings

smartypant 2 hours ago | parent | next [-]

this sounds cool but on the website I saw the previous version where its more like a passive device to listen, transcribe and save. how does it record the screen and doens't capturing the screen and converting that into text takes a lot of time? That will make it super slow. isnt it?

smartypant 2 hours ago | parent | prev | next [-]

this sounds cool but on the website I saw the previous version where its more like a passive device to listen, transcribe and save.

nprateem 2 hours ago | parent | prev | next [-]

You could pitch it as your "digital nagging housewife", or a "micromanager in a box". How about "your time wasting interrupt-otron" or just "flow-breaker"?

Seriously why would you think AI could read my mind and tell me what to do next without knowing my goals?

This sounds like the irritating tangential follow-on questions they ask on steroids. Generally irrelevant and take the conversation in a direction you don't want to go.

bakaev 10 hours ago | parent | prev [-]

imagine getting micro managed by this omi lol

Biologist123 4 hours ago | parent [-]

I guess you have to turn it on. Might be handy for things like electrician courses, DIY jobs, maths homework etc. Maybe one day even surgery!