Remix.run Logo
anonymous_user9 a day ago

> I think if the bar is to consider it not a replacement for knowledge work as long as there is a human in the loop.

That's where I put it personally, because of humans' limited amount of useful focus during a work day.

Anything that requires human attention will take some of that resource, and don't think models' rate of improvement will be fast enough to overcome that in the near future. Reviewing an output that is 99%, 99.9%, or 99.99% correct all take about the same amount of time, so the output needs to be correct enough not to need review before any knowledge work is replaced.

djhn a day ago | parent [-]

I’m afraid your numbers, all over 99%, are anchoring the conversation to an unreasonably high quality level.

I would have personally gone for 75%, 85% and 95%, which are all still best case scenario answers.

Had I taken on chatbot advice on electronics or chemistry I’d have died every couple of weeks (doing some hands-on real world R&D in my basement as a distraction from software).