▲ | mmmore a day ago | |
Sonnet with extended thinking solved it after 30s for me: https://claude.ai/share/b974bd96-91f4-4d92-9aa8-7bad964e9c5a Normal Opus solved it: https://claude.ai/share/a1845cc3-bb5f-4875-b78b-ee7440dbf764 Opus with extended thinking solved it after 7s: https://claude.ai/share/0cf567ab-9648-4c3a-abd0-3257ed4fbf59 Though it's a weird puzzle to use a benchmark because the answer is so formulaic. | ||
▲ | j_maffe a day ago | parent [-] | |
It is formulaic which is why it surprised me that Sonnet failed it. I don't have access to the other models so I'll stick with Gemini for now. |