Remix.run Logo
gpm 3 hours ago

> Do you think that Chinese labs will continue to release open models forever

Yes.

I think the Chinese government either already has, or will soon, grasp that if they train the models that people use they dictate what people believe (at least around the margins where that's malleable), and they will happily throw resources at that.

And simultaneously that the only way they can actually get everyone to use their models is if it's possible for us to run them on our own hardware.

(This isn't exactly a utopian view of the future)

jychang 2 hours ago | parent | next [-]

This is going to age very poorly when the best Chinese labs ALREADY just started not open sourcing their models.

Qwen 3.7 is not open source; previous Qwen versions would have open source releases, but Qwen 3.7 plus does not. The second best Chinese model, Minimax M3, is testing the waters by taking longer and longer between “model release” and open sourcing it. This time, they spent 2 weeks after release before open sourcing it. There’s also a lot of rumors of GLM and Deepseek not open sourcing future models.

It’s pretty obvious that you cannot take Chinese models as open source for granted, they’ll be closed source soon.

ls612 2 hours ago | parent | next [-]

The main reason the Chinese labs are releasing models as open weights is because they don't have the compute necessary to provide all of the inference. For the US frontier models something like 80-90% of the lifetime compute required for the model is inference rather than training. China wants to shepherd as much of their limited compute as possible towards training to keep up in the race.

londons_explore an hour ago | parent [-]

With nearly everyone using inference accelerators, the pool of hardware is no longer shared between training and use.

zardinality 2 hours ago | parent | prev [-]

[dead]

nine_k 2 hours ago | parent | prev | next [-]

The US administration restricting the use of US-trained models is one of the best gifts it could make to the Chinese LLM producers, and to the PRC government.

dozerly 2 hours ago | parent [-]

This entire administration is a gift to everybody but the US. It’s either in service of Russia, China or whoever is willing to pay Trump the most.

rjzzleep an hour ago | parent [-]

Chinese have a nickname for Trump. 川建国. Trump the nation builder(meaning China). But Biden actually continued most of Trumps policies.

tw1984 3 hours ago | parent | prev [-]

> I think the Chinese government either already has, or will soon, grasp that if they train the models that people use they dictate what people believe (at least around the margins where that's malleable), and they will happily throw resources at that.

that doesn't require the model to be SOTA, it can be just a compact model capable of running on some inexpensive hardware. that is vastly different from SOTA models like Mythos which can potentially disrupt lots of things.

strangegecko 2 hours ago | parent [-]

Of course it requires SOTA, people will always choose better models over some compact thing that is obviously more limited. You can't control the truth with models nobody wants to use.

columnarx3 an hour ago | parent | next [-]

People choose SOTA right now because of the heavily subsidised model subscriptions. People aren't going to pay 20x the price for a model that's maybe 10% better.

ezst 44 minutes ago | parent [-]

And the fact that "better" is highly subjective and domain/task/vibe-specific

adrianN an hour ago | parent | prev | next [-]

Why do I want the model I use for coding to know Shakespeare or vice versa?

rjzzleep an hour ago | parent | prev [-]

Small models are the future.