▲ | ACCount37 5 days ago | ||||||||||||||||||||||
That's a really complex, very out-of-distibution, hard-to-know question for the early LLMs. Not that it's too hard to fix that, mind. Those LLMs weren't very aware of tokenizer limitations - let alone aware enough to recognize them or work around them in the wild. | |||||||||||||||||||||||
▲ | lapcat 5 days ago | parent [-] | ||||||||||||||||||||||
> That's a really complex, very out-of-distibution, hard-to-know question No, it's not. It's a trivial question in any context. > for the early LLMs. Early? Claude 3.7 was introduced just 6 months ago, and Deepseek-V3 9 months ago. How is that "early"? | |||||||||||||||||||||||
|