Remix.run Logo
tedivm 9 hours ago

The thing that is interesting though is that China is releasing the models they train (for the most part). Unlike OpenAI, for instance, China seems to still be developing in an open way including sharing code, weights, and publishing techniques.

These models are pretty amazing too. Their performance remains high even with smaller models, allowing them to competitively run on consumer grade hardware. I can (and do) run the Qwen models at home and use them for real tasks. As the tool ecosystem matures we can offload what the model is bad at to other systems, which makes it even easier to use smaller models. The focus on efficiency and performance from the Chinese companies has had huge practical gains.

Any US startup, or even large company, in the AI space can and should take advantage of this and use these models. The cost benefit compared to something like Azure or OpenAI is massive.

oezi 7 hours ago | parent | next [-]

The key insight is that these models are becoming obsolete so fast that it doesn't make sense to not publish them. If others use your tech and stack and build on top or refine your ideas then you benefit greatly.

Nobody is releasing their training scripts. Unfortunately.

pjmlp 8 hours ago | parent | prev [-]

This is a replay from having cryptography algorithms on the 1990's classified as military weapons went down, other international algorithms were eventually created.