| ▲ | michaelbuckbee 2 hours ago | |
Good catch, there was an issue with the second hardest thing in programming (caching). Here's an updated eval with the proper models https://a3bmfqfom3.evvl.io/ | ||
| ▲ | wamatt an hour ago | parent [-] | |
Thanks from where I'm looking Grok 4.3 and Claude 4.7 do a better job on the informal close friend/coworker vibe. ChatGPT sounds fake / formal phrasing (for the specific close friend context) and has em-dashes and uses capitalization. Hence, ChatGPT does not, imo grok the assignment ;) | ||