▲ | williamtrask 10 hours ago | |||||||||||||||||||||||||
(OP here) I agree with this in spirit, but also it's hard to imagine the world can be fully described with 200 terabytes of data. There's a lot more good stuff out there. But to your point, a crucial question in AI right now is: how much quality data is still out there? As far as the impracticality, it's a great point. I disagree and have spent about 10 years working in the area. But that can be a post for another day. I understand and appreciate the skepticism. | ||||||||||||||||||||||||||
▲ | lxgr 10 hours ago | parent [-] | |||||||||||||||||||||||||
> it's hard to imagine the world can be fully described with 200 terabytes of data Why? Intelligence and compression might just be two sides of the same coin, and given that, I'd actually be very surprised if a future ASI couldn't make due with a fraction of that. Just because current LLMs need tons of data doesn't mean that that's somehow an inherent requirement. Biological lifeforms seem to be able to train/develop general intelligence from much, much less. | ||||||||||||||||||||||||||
|