▲ | jackhalford 7 days ago | |
Why does the unsloth guide for gemma 3n say: > llama.cpp an other inference engines auto add a <bos> - DO NOT add TWO <bos> tokens! You should ignore the <bos> when prompting the model! That makes the want to try exactly that? Weird |