Remix.run Logo
ldenoue 4 hours ago

I developed a stack on Cloudflare workers where latency is super low and it is cheap to run at scale thanks to Cloudflare pricing.

Runs at around 50 cents per hour using AssemblyAI or Deepgram as the STT, Gemini Flash as LLM and InWorld.ai as the TTS (for me it’s on par with ElevenLabs and super fast)

pugio 3 hours ago | parent [-]

Do you have anything written up about how you're doing this? Curious to learn more...