Remix clone Hacker News

new | show | ask | jobs Github

	▲	qrios 5 days ago
		Isn't "Private Compute Cloud" just a marketing term with some restrict sec architecture? The real personal assistant LLM would mean to have the realtime data available in hot memory (to make sure to give instant responses). Audio, video, screen recordings, etc. from a single customer could be something between 1 and 10 GByte per day on average. After training you might get something like 3 MByte in additional model size per day. Even with 1 billion active users you would need to store additional data with 1 billion GByte (again on hot storage, like expensive GPU memory). The total amount of the memory of GPUs sold by NVIDIA is not even close to 400mio GByte (NVIDIA 3.8mio data center GPUs in 2023).
	▲	therealpygon 4 days ago \| parent [-]
		That assumes you are trying to compress every bit of information ingested at all times. We have a document so we can reference it, we don’t need video of the steps to create it. If we know we drove to work and nothing significant happened, we don’t need to store every detail from the drive. When distilled, most people’s days consist of very few actual newly discovered facts, decisions and changes to context.