Remix.run Logo
stevepike 5 hours ago

I'm a bit surprised it gets this question wrong (ChatGPT gets it right, even on instant). All the pre-reasoning models failed this question, but it's seemed solved since o1, and Sonnet 4.5 got it right.

https://claude.ai/share/876e160a-7483-4788-8112-0bb4490192af

This was sonnet 4.6 with extended thinking.

bobbylarrybobby 4 hours ago | parent | next [-]

Interesting, my sonnet 4.6 starts with the following:

The classic puzzle actually uses *eight 8s*, not nine. The unique solution is: 888+88+8+8+8=1000. Count: 3+2+1+1+1=8 eights.

It then proves that there is no solution for nine 8s.

https://claude.ai/share/9a6ee7cb-bcd6-4a09-9dc6-efcf0df6096b (for whatever reason the LaTeX rendering is messed up in the shared chat, but it looks fine for me).

2 hours ago | parent | prev | next [-]
[deleted]
malfist 4 hours ago | parent | prev | next [-]

Chatgpt doesn't get it right: https://chatgpt.com/share/6994c312-d7dc-800f-976a-5e4fbec0ae...

``` Use digit concatenation plus addition: 888 + 88 + 8 + 8 + 8 = 1000 Digit count:

888 → three 8s

88 → two 8s

8 + 8 + 8 → three 8s

Total: 3 + 2 + 3 = 9 eights Operation used: addition only ```

Love the 3 + 2 + 3 = 9

simianwords 3 hours ago | parent [-]

chatgpt gets it right. maybe you are using free or non thinking version?

https://chatgpt.com/share/6994d25e-c174-800b-987e-9d32c94d95...

leumon 4 hours ago | parent | prev | next [-]

My locally running nemotron-3-nano quantized to Q4_K_M gets this right. (although it used 20k thought tokens before answering the question)

layer8 5 hours ago | parent | prev [-]

Off-by-one errors are one of the hardest problems in computer science.

anonymous908213 4 hours ago | parent [-]

That is not an off-by-one error in a computer science sense, nor is it "one of the hardest problems in computer science".

layer8 4 hours ago | parent [-]

This was in reference to a well-known joke, see here: https://martinfowler.com/bliki/TwoHardThings.html