▲ | felipeerias 4 months ago | |
Both Claude 4 Sonnet and Opus fail this one, even with extended thinking enabled, and even with a follow-up request to double-check their answers: “What is heavier, 20 pounds of lead or 20 feathers?” | ||
▲ | ttoinou 4 months ago | parent | next [-] | |
Can humans answer this correctly ? It is ambiguous | ||
▲ | cdelsolar 4 months ago | parent | prev [-] | |
chatgpt (whatever fast model they use) passed that after i told it to "read my question again" |