Remix.run Logo
protocolture 2 days ago

Keen for this also. Been having issues getting a smooth voice experience from HA to ChatGPT. I dont like the whole wakeword concept for the receiver either. I think theres work to be done on the whole stack.

fennecbutt 2 days ago | parent | next [-]

What's wrong with the wakeword stuff?

Great timing as I was looking into it yesterday as was thinking about writing my own set of agents to run house stuff. I don't want to spent loads of time on voice interaction so HA wakeword stuff would've been useful. If not I'll bypass HA for voice and really only use HA via mcp.

I can do fw dev for micros...but omg do I not want to spend the time looking thru a datasheet and getting something to run efficiently myself these days.

protocolture a day ago | parent [-]

You can use the vendor supported wakewords, and they are generally pretty good.

However-> These are device specific. The devices I purchased for this purpose have very few vendor supported wakewords, but even more prominently, refuse to integrate with HA. Possible firmware issue, but I have reloaded the firmware 30 times. I dont necessarily want to purchase something else for this purpose. Which is where building a bespoke HA audio box becomes its own can of worms.

But if you want a custom wake word, or more like a wake phrase, you go down a rabbit hole of training/cost/memory etc that starts to get annoying fast.

I kind of know I am being unreasonable. I dont want a device that just ships off everything it hears to an LLM, even local, that would suck. I just want a third way.

Then theres other stuff. Like HA has a hard time with providing context to an LLM, because it sends the whole conversation thus far off to the LLM for context. It can get really weird really quickly. This caused me a lot of issues with lights for example. It would remember switching a light on, and if that was in the context, would refuse to switch it on a second time if it turned off due to a rule or manual intervention. But if you dont send the context, you cant have deeper conversations. You cant ask subsequent questions basically.

estimator7292 10 hours ago | parent [-]

On my new AMD laptop, it took about 90 minutes to run 50k training rounds on OpenWakeWord.

It's not really a big burden.

A tiny AI running locally is the third option you want. That's the only reasonable way to do configurable wake word detection

nickthegreek 2 days ago | parent | prev | next [-]

you can use a physical button instead of wakeword.

protocolture 2 days ago | parent [-]

Doesnt suit my use case sadly.

0xdeadbeefbabe 2 days ago | parent [-]

Back to the drawing board. What about a proximity sensor?

protocolture a day ago | parent [-]

I think what I want to do, is have a dodgy local LLM that picks up the context that the user is speaking to the LLM, and then enables it for 20 minutes or so.

But even thats a bit of a wild tradeoff.

6510 2 days ago | parent | prev [-]

It should participate in all conversations, take initiative and experiment.

sdenton4 2 days ago | parent [-]

"Hey, hey, are you still asleep? Using spare cycles, I have designed an optimal recipe for mashed potatoes, as you mentioned ten days ago. I need you to go get some potatoes."

terribleperson 2 days ago | parent | next [-]

A local AI system that hears your conversations, identifies problems, and then uses spare cycles to devise solutions for them is actually an incredible idea. I'm never going to give a cloud system the kind of access it would need to do a really good job, but a local one I control? Absolutely.

"Hey, are you still having trouble with[succinct summary of a problem it identified]?" "Yes" "I have a solution that meets your requirements as I understand them, and fits in your budget."

PaulDavisThe1st 2 days ago | parent | next [-]

> A local AI system that hears your conversations, identifies problems, and then uses spare cycles to devise solutions for them is actually an incredible idea.

I call that Dreaming.

(TM)

BizarroLand a day ago | parent [-]

If you could get an AI to listen to the conversations that happen in your sphere of influence and simply jot down the problems it identifies over the course of the day/week/month/year, that in itself would be an amazing tool.

Doubly so if you could just talk and brainstorm while it's listening and condensing, so you can circle back later and see what raindrops formed from the brainstorm.

Call that DayDreaming (TM)

darkwater 2 days ago | parent | prev [-]

"Did you find how to make peace with $FRIEND_OF_SPOUSE after they came here last week and they were pretty mad at you because you should tell something to $SPOUSE ? I thought about it in my spare cycles and all psychologists agree that truth and trust are paramount in a healthy relationship"

FeepingCreature 2 days ago | parent | prev | next [-]

I unironically want this.

estimator7292 10 hours ago | parent | next [-]

Agreed, this is hilarious.

6510 2 days ago | parent | prev [-]

I forget who but someone onhere a while back said he made a contraption that listens in and tries to determine the winner of each conversation.

6510 2 days ago | parent | prev | next [-]

I ponder the concept in the 90's. Initially I thought it should be an assistant but with age came wisdom and now I think it should be a virtual drill instructor. "Rise and shine $insult $insult, the sun is up, the store is open, we will be getting some potatoes today, $insult $insult, it was all your idea now apply yourself!" Bright lights flashing, loud music, the shower starts running. "Shower time, you have 7 minutes! $insult $insult" 4 minutes in the coffee machine boots up. "You will be wearing the blue pants, top shelve on the left stack, the green shirt, 7th from the left. Faster faster! $insult $insult"

quietsegfault 2 days ago | parent | prev [-]

This sounds a lot like gptars. I want a little gptars tearing around my house.

https://youtube.com/shorts/e2t0RxX4b54

6510 a day ago | parent [-]

I forgot about him. Great project!

Reminds me of a video from the 90's where some wizard put a camcorder and a giant antenna on a petrol powered rc car, an even bigger antenna on his house and controlled it from a 40's style sofa and a huge tube TV in his cramped garage. Over a mile range. Surrounded by enormous cars I think he was going 40-50 mph but with the screaming engine sound and the camera so low to the ground it looked like 500 mph. I'm still laughing, it looked like he was having all of the fun.

quietsegfault 12 hours ago | parent [-]

I've been meaning to put an FPV drone camera on one of my RC cars! It's very, very simple to do nowadays and requires none of the know-how you needed back in the day.