Remix.run Logo
rishabhaiover 5 hours ago

I'm assuming this is for tool call and orchestration. I didn't know we needed higher exploitable parallelism from the hardware, we had software bottlenecks (you're not running 10,000 agents concurrently or downstream tool calls)

Can someone explain what is Vera CPU doing that a traditional CPU doesn't?

kibibu 5 hours ago | parent | next [-]

> you're not running 10,000 agents concurrently or downstream tool calls

Cursor seem to be doing exactly that though

urig 5 hours ago | parent | prev [-]

Lots and lots of CPUs pooled. Faster more efficient power RAM accessible to both GPU and CPU. IIUC.

rishabhaiover 5 hours ago | parent [-]

But at what stage are we asking for that RAM? if it's the inference stage then doesn't that belong to the GPU<>Memory which has nothing to do with the CPU?

I did see they have the unified CPU/GPU memory which may reduce the cost of host/kernel transactions especially now that we're probably lifting more and more memory with longer context tasks.