| ▲ | revachol 3 days ago | |||||||||||||||||||||||||
I just tried it in ChatGPT "Auto" and it didn't work > Yes — ((((()))))) is balanced. > It has 6 opening ( and 6 closing ), and they’re properly nested. Though it did work when using "Extensive Thinking". The model wrote a Python program to solve this. > Almost balanced — ((((()))))) has 5 opening parentheses and 6 closing parentheses, so it has one extra ). > A balanced version would be: ((((())))) Testing a couple of different models without a harness such that no tool calls are possible would be interesting | ||||||||||||||||||||||||||
| ▲ | kenjackson 3 days ago | parent [-] | |||||||||||||||||||||||||
Weird. I tried in chatGPT auto and it worked perfectly. I tried like 10 variations. I also did the letters in words. Got all of them right. The one thing I did trip it up on was "Is there the sh sound in the word transportation". It said no. And then realized I asked for "sound" not letters. It then subsequently got the rest of the "sounds-like" tests I did. Clearly, my ChatGPT is just better than yours. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||