▲ | hadlock 9 hours ago | |
Speech input + speech output is a big deal. In theory you can talk to it using voice, and it can respond in your language, or translate for someone else, without intermediary technologies. Right now you need wakeword, speech to text, and then text to speech, in addition to your core LLM. A couple can input speech, or output speech, but not both. It looks like they have at least 3 variants in the ~32b range. Depending on the architecture this is something you could feasibly have in your house in a couple of years or in an expensive "ai toaster" | ||
▲ | data-ottawa 8 hours ago | parent | next [-] | |
The opportunities of plugging this into your home automation through tool calls is huge. Ever since ChatGPT added this feature I've been waiting for anyone else to catch up. They're are tons of hands free situations like cooking where this would be amazing ("read the next step please, my hands are covered in raw pork", "how much flour for the roux", "crap, I don't have any lemons, what can I substitute") | ||
▲ | CamperBob2 9 hours ago | parent | prev [-] | |
Seems like a big win for language learning, if nothing else. Also seems possible to run locally, especially once the unsloth guys get their hands on it. |