| ▲ | mortenjorck 3 hours ago | |
I assume those don't just work automatically with an off-the-shelf gguf. What do you need in your local inference stack to take advantage of M5's neural accelerators? | ||
| ▲ | aurareturn 3 hours ago | parent [-] | |
They do work with llama.cpp and MLX automatically. | ||