| ▲ | addandsubtract 4 hours ago | |
I know you're just sharing a single sample, but is this even the same test? In the article, the model is being inspected while generating the next token(s), and the probabilities are listed. Here, you're asking the model to retrospectively fill in a missing word, and it's answering your prompt. We have no idea what the actual token probability in Claude is and no way of probing it by asking it. | ||