Once the hallucination rate drops, the remaining LLM failures will become increasingly harder to spot.