| ▲ | stephc_int13 2 hours ago | |
They should add this to the benchmark suite, and create a custom eval for how good the resulting compiler is, as well as how maintainable the source code. | ||
| ▲ | snek_case an hour ago | parent [-] | |
This would be an expensive benchmark to run on a regular basis, though I guess for the big AI labs it's nothing. Code quality is hard to objectively measure, however. | ||