Remix.run Logo
mfrye0 2 days ago

My optimization hack is that I'm using speech recognition now with Claude Code.

I can just talk to it like a person and explain the full context / history of things. Way faster than typing it all out.

onprema 2 days ago | parent | next [-]

The SuperWhisper app is great, if you use a Mac.

mfrye0 2 days ago | parent | next [-]

I checked that one out. The one that Reddit recommended was Voice Type. It's completely offline and a one-time charge:

https://apps.apple.com/us/app/voice-type-local-dictation/id6...

The developer is pretty cool too. I found a few bugs here and there and reported them. He responds pretty much immediately.

jwr 2 days ago | parent | prev | next [-]

There is also MacWhisper which works very well. I've been using it for several months now.

I highly recommend getting a good microphone, I use a Rode smartlav. It makes a huge difference.

sipjca 2 days ago | parent | prev [-]

Open source and cross platform one: https://handy.computer

foob 2 days ago | parent | prev | next [-]

I've been pretty happy with the python package hns for this [1]. You can run it from the terminal with uvx hns and it will listen until you press enter and then copy the transcription to the clipboard. It's a simple tool that does one thing well and integrates smoothly with a CLI-based workflow.

[1] - https://github.com/primaprashant/hns

mfrye0 2 days ago | parent [-]

I'll check that one out.

The copy aspect was the main value prop for the app I chose: Voice Type. You can do ctrl-v to start recording, again to stop, and it pastes it in the active text box anywhere on your computer.

theshrike79 2 days ago | parent | prev | next [-]

So you sit in a room, talking to it? Doesn't it feel weird?

I type a lot faster than I speak :D

mfrye0 2 days ago | parent | next [-]

Haha yeah, it does feel a bit weird.

I often work on large, complicated projects that span the whole codebase and multiple micro services. So it's often a blend of engineering, architectural, and product priorities. I can end up talking for paragraphs or multiple pages to fully explain the context. Then Claude typically has follow-up questions, things that aren't clear, or issues that I didn't catch.

Honestly, I just get sick of typing out "dissertations" every time. It's easier just to have a conversation, save it to a file, and then use that as context to start a new thread and do the work.

jon-wood 2 days ago | parent | prev [-]

Not only do I type faster than I speak I'm also able to edit as I go along, correcting any mistakes or things I've stumbled over and can make clearer. Half my experience of using even basic voice assistants is starting to ask for something and then going "ugh, no cancel" because I stumbled over part of a sentence and I know I'll end up with some utter nonsense in my todo list.

mfrye0 2 days ago | parent [-]

I know what you mean. This new generation of speech recognition is different though. It's able to understand fairly technical terms, specialized product names and other stuff that previously would have been garbled text.

Even if it gets slightly garbled, I often will add a note in my context that I'm using speech recognition. Then Claude will handle the potentially garbled or unclear sections perfectly or ask follow-up questions if it's unclear.

apwell23 2 days ago | parent | prev [-]

any options for ubuntu ?

sipjca 2 days ago | parent [-]

as above, open source and cross platform, should work on Ubuntu

https://handy.computer