| ▲ | digdugdirk 2 hours ago | |
Do you have a collection of these benchmark apps saved anywhere? I'd be particularly interested in seeing the relative cost differences between different models in a use case like this. | ||
| ▲ | senko an hour ago | parent [-] | |
I'm saving them all as gists here: https://gist.github.com/senko But I just vibe-coded a handy list of all the tests I did (unfortunately without the commentary I usually leave in social media posts -- I should add those at some point): https://senko.net/vibecode-bench/ | ||