Remix.run Logo
pwython 4 hours ago

I had Claude whip up a local solution for me using Gemma 4 26b-a4b on my Mac and a Raspberry Pi Zero 2 W. It can do web search (valyu), file reading -- I have business & personal context stored in many markdown files -- weather, Apple reminders, and has cross-session memory. Streamline but capable agent that has been pieced together over a few weeks after using Karpathy's LLM wiki pattern, with bits of Hermes logic. Orpheus-TTS streams the spoken reply back with the first word usually landing in half a second. Voice input is openWakeWord for the wake word plus faster-whisper for speech-to-text, all on-device. I can run it straight on the Mac but I use it with the pi satellite and a cheap USB speakerphone (ConfCall MS13B). You can barge in by just talking over it rather than having to say the wake word again. Pretty handy Google Home replacement.

ravetcofx 4 hours ago | parent | next [-]

Do you have a GitHub or something for the project? Sounds lovely.

pwython 4 hours ago | parent [-]

Not yet, I have to decouple some of the specific/personal logic. When I do push it, it'll be here: https://github.com/alisorcorp

jimbokun 4 hours ago | parent | prev [-]

Sounds like you could have a successful product.