| ▲ | ACCount37 2 days ago | |||||||
You can tell it "be careful about the tokenizer issues" in Swedish and see how that changes the behavior. The only thing that this stupid test demonstrates is that LLM metacognitive skills are still lacking. Which shouldn't be a surprise to anyone. The only surprising thing is that they have metacognitive skills, despite the base model training doing very little to encourage their development. | ||||||||
| ▲ | lossyalgo a day ago | parent [-] | |||||||
LLMs were not designed to count letters[0] since they work with tokens, so whatever trick they are now doing behind the scenes to handle this case, can probably only handle this particular case. I wonder if it's now included in the system prompt. I asked ChatGPT and it said it's now using len(str) and some other python scripts to do the counting, but who knows what's actually happening behind the scenes. | ||||||||
| ||||||||