Remix.run Logo
Show HN: PerceptAI – Give AI agents eyes on any screen, not just browsers
1 points by Neerajj04 14 hours ago | 4 comments

I built PerceptAI because every agent tool I found only works on websites via DOM.

75% of real computer work happens in desktop apps, legacy software, and tools with zero APIs. Agents are completely blind to all of it.

PerceptAI uses EasyOCR + Groq Vision to read any screen and PyAutoGUI to act on it. One plain English instruction executes autonomously with self-healing and memory.

Demo: percept-ai-phi.vercel.app GitHub: github.com/Neeraj04-CY/PerceptAi

Would love feedback from anyone building agents.

andsoitis 13 hours ago | parent | next [-]

Claude can already do this: https://code.claude.com/docs/en/computer-use

Neerajj04 an hour ago | parent | next [-]

Claude Computer Use requires Anthropic's API and runs in a sandboxed Docker container.

PerceptAI runs locally on your machine, works with any LLM backend, costs $0 to run, and is fully open source.

Different use case this is infrastructure for developers building their own agents, not an end-user tool.

polski-g 10 hours ago | parent | prev [-]

And the dozen or so AutoHotkey MCPs.

nigardev 13 hours ago | parent | prev [-]

[flagged]