▲ | cpursley 5 days ago | |
That’s hilarious. How’s this model in practice? | ||
▲ | allisdust 5 days ago | parent [-] | |
it has been quite impressive so far. It makes very less number of mistakes. Cons: Context size if less so compaction happens frequently. Interesting bit is that the compaction doesn't seem to affect it as much as the Claude models. So I don't have to continuously look at the context size. Also it doesn't seem to lose the coherence even when nearing like 1% of the context. |