| ▲ | AvAn12 8 hours ago | |||||||
I think the gap is because 1. For coding, Claude is amazing - mainly because of its curated skills and because massive amounts of working code has already been carefully labeled over the last decade or so via GitHub. And because with any Turing complete language, there is only so much one can do. But 2. For most other things, LLMs are fairly underwhelming. Research is usually mediocre. Try being rigorous and repeat your research prompt many times - then make a confusion matrix to tally up how many false positives and false negatives occur. And for the rest, be honest and ask yourself if the LLM is doing much more than a basic search engine query or trip to Wikipedia would have told you. For “normie” use cases, it’s handy-ish but far from revolutionary | ||||||||
| ▲ | Gigachad 3 hours ago | parent | next [-] | |||||||
Also because programming is self contained in a computer where the results can be tested and iterated easily. For programming the agent can just run the compiler and tests and keep retrying until it works. If I wanted to for example sew a T shirt, AI is useless. | ||||||||
| ▲ | calf 5 hours ago | parent | prev [-] | |||||||
I didn't understand your comment about Turing complete languages, could you explain that part please? | ||||||||
| ||||||||