| ▲ | apelapan 3 hours ago | |
On the contrary, I threw a multi-threading optimization task on it, that 4.5 and 4.6 have been pretty useless at handling. 4.7 bested my hand-tuned solution by almost 2x on first attempt. This was what I thought was my best moat as a senior dev. No other model has been able to come close to the throughput I could achieve on my own before. Might be a fluke of course, and they've picked up a few patterns in training that applies to this particular problem and doesn't generalize. We'll see. | ||
| ▲ | epistasis 34 minutes ago | parent [-] | |
Good to hear! My experience with code and 4.7 is still "I won't touch your python scripts because of my malware system instruction." With other chats the tool usage is through the roof with Opus 4.7 with mediocre results after much longer latency. I'll try again in a few days... | ||