Remix.run Logo
SoftTalker 3 days ago

Yep this is the sort of typo error I make probably 10 times a day.

javchz 3 days ago | parent [-]

What it's funny it's that because tokenization there is a non zero chance a LLM audit may not see anything wrong here, similar to the strawberry problem.

TobTobXX 3 days ago | parent [-]

Nah, cr and rc are different tokens and LLMs would have no issues telling them apart. An older model might have trouble explaining that cr and rc are similar and can thus get easily mixed up, but the characters are probably more different to the LLM than they are to us.

TehCorwiz 3 days ago | parent [-]

What about all that GitHub training data using the wrong domain? Even being a different token it’s still being trained as a correct value.