| ▲ | sillyfluke 3 hours ago | |||||||||||||||||||||||||||||||
>Your top coder has guard rails in place to prevent him autonomously going free - right? The parent is implying they would prefer an AI when working in the airline and health industry because it makes less errors. Read the comment again. They have not said, "Hey, I work in the airline and health industry and I'd love to use AI for a couple of the bullshit IT UIs we have as long as we can put guardrails on the AI to stay in its lane." I asked a yes or no question. The guardrails you can put to mitigate errors are the same guardrails pre-AI for the humans (tests, regressions, reviews). If you were wary of employing a top lead engineer with verifiable dementia prior to AI for a mission critical system, logic implies you should think twice giving that much responsibility to an AI as well. > The hallucination thing I think is mostly overblown Can you predict when and how the SOTA model will hallucinate? Yes or no. Can you predict the severity impact of that error beforehand? Yes or no. >from speaking to colleagues it seems to vary wildly depending on which model and harness you are using You have partially answered my question it would seem. | ||||||||||||||||||||||||||||||||
| ▲ | deanc 2 hours ago | parent [-] | |||||||||||||||||||||||||||||||
> Can you predict when and how the SOTA model will hallucinate? Yes or no. Can you predict the severity impact of that error beforehand? Yes or no. No, but the same can be said for your colleagues. You might call what the LLM does hallucinations, I'd call them mistakes. I think we have totally forgotten that humans make them all the time and are confidently wrong too. Your original question, doesn't really get to the bottom of the point I'm trying to make, and I don't really feel it fairly represents the issue we are talking about here. They are not the same things. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||