Remix.run Logo
giancarlostoro 8 hours ago

I hope their open source variants are just as good, having a 1 million token window for a fully offline model would be VERY interesting.

sosodev 8 hours ago | parent [-]

I don't know how well it performs, but you can extend Qwen3.5 to 1 million token context using YaRN. Also, Nemotron 3 Super was recently released and scales up to 1 million token context natively.