Remix.run Logo
wolttam 4 hours ago

If you have no need for Anthropic/OpenAI's frontier model capability, you may be better served with an open-weight model that can't be taken away.

Edit:

> GPT-5 does the job.

I bring up DeepSeek V4 Flash a lot on HN, but I want to mention that according to Artificial Analysis, it trades blows with GPT-5 (high) (from August, 2025) [0]

[0]: https://artificialanalysis.ai/models/comparisons/deepseek-v4...

lmf4lol an hour ago | parent | next [-]

We rolled out Deepseek V4 Flash to our customers and it was an absolute disaster, unfortunately. It was not able to follow simple commands, always "forgot" to do things, lied consistently about its work, and so on. It was pretty good though on on-off work, like summarizing something or executing simple commands, so we are experimenting now with using it for subagent work with clear instructions and hand off.

Deepseek V4 Pro on the other hand is a really really good main driver and we have a lot of success using it. Its not Opus or GPT-5.5 level but on its way. Kimi 2.6 as well btw.. so there is already quite some choice.

wolttam 34 minutes ago | parent [-]

I found Flash to be a bit shaky as well until I started using it in xhigh/max thinking effort, then it became my daily driver. It runs quite well on a couple of DGX Sparks.

I still wish it was a little better, but there's hope for another model checkpoint (maybe with some of GLM 5.2's goodness distilled into it, that would be nice).

RALaBarge 2 hours ago | parent | prev | next [-]

It’s my daily driver in opencode

paxys 4 hours ago | parent | prev | next [-]

Unless you are hosting it yourself on your own infrastructure it absolutely can be taken away.

atherton94027 4 hours ago | parent | next [-]

For all intents and purposes you'll be able to move an open weight model wherever you want.

I really dislike this rhetoric, you sound like the FSF guys who are like "you're not free until you're running coreboot with zero binary blobs". Sure they have a point but also, most people are fine running regular linux.

salviati 2 hours ago | parent | next [-]

Reading your comment made me realize that I love that the position of the FSF is held by someone, in the interest of stretching the Overton Window to that side.

adrianN 4 hours ago | parent | prev | next [-]

Most FSF guys actually have very nuanced views on the topic and you’re doing everyone a disservice by reducing it to an extremist sound bite.

jjmarr an hour ago | parent | next [-]

That's literally the official FSF position.

https://www.fsf.org/resources/hw

> For example: the Free Software Foundation only purchases desktop machines which support Libreboot, and Thinkpad X200 and X60 laptops with Libreboot. All desktops and servers we buy are KGPE-D16 motherboards, which are supported by Libreboot. As a result, all of the workstations used by the FSF staff have a free BIOS.

https://www.gnu.org/distros/common-distros.html

> Except where noted, all of the distributions listed on this page fail to follow the guidelines in at least two important ways:

> ...The kernel that they distribute (in most cases, Linux) includes “blobs”: pieces of object code distributed without source, usually firmware to run some device.

They are extreme, uncompromising, and live by their principles.

They are also the reason you can buy a computer meeting those requirements instead of being a pipe dream.

ffsm8 3 hours ago | parent | prev | next [-]

Thankfully he didn't say that they're all like that. Instead he pointed out the few that are as a well known example of similar behavior.

If you reread the comment with a fresh mind you'll notice that you misunderstood what he wrote

citadel_melon 2 hours ago | parent [-]

When attacking archetypes of people, there is some responsibility to make clear who you’re attacking and why, even to someone who’s not being hyper-open-minded. At least if you want them to learn from you: which may or may not be your goal. When you attack/signal you’re on the offensive, it is foolish to believe that they won’t knee-jerk attack back and become closed minded at least a little.

Regardless, the “misinterpretation” of the parent comment is actually a plausible interpretation. I suspend my judgement on what the actual “correct” interpretation of the original comment is: there are too many plausible interpretations to deductively decide. But I do know that since they first comment brought up a contentious issue, they should have put more work into crafting their message so there aren’t so many plausible interpretations that are contradictory. Or alternatively, they should have specified more precisely who they were talking about without a shadow of a doubt. That is if the commenter cared to be properly interpreted, but that may not be their goal. There are many reasonable reasons why that wouldn’t be their goal.

morgoo 30 minutes ago | parent | next [-]

You used a lot of words to defend a strawman argument

verve_rat 31 minutes ago | parent | prev | next [-]

When you read someone's comment there is some responsibility to read the words they wrote and not attempt to attack them for an argument no reasonable person would extract from those words.

NamlchakKhandro 42 minutes ago | parent | prev [-]

Angry girlfriend SMS essay

charcircuit 3 hours ago | parent | prev [-]

It is the FSF itself who has these extremist views.

sauwan 4 hours ago | parent | prev [-]

Unless the US Gov bans inference companies from serving Chinese models to US customers...

tancop 4 hours ago | parent [-]

good luck doing it to inference companies in singapore or the netherlands. or one of the decentralized networks that dont look useful right now. the world is already sick of america acting like it can do whatever and force their rules on the rest of us.

GTP 4 hours ago | parent | prev | next [-]

Still, with the same model being served by multiple providers, it is much less likely to disappear entirely, even if you would like to keep using a cloud provider. Worst-case scenario, you change providers. Or you use OpenRouter as a proxy.

dgellow 3 hours ago | parent | prev | next [-]

There is actual market competition to host open models. If one provider stops offering a model you likely can find another provider that will

an hour ago | parent [-]
[deleted]
theptip 3 hours ago | parent | prev | next [-]

No. As long as you downloaded the weights, you can run them somewhere.

amunozo 4 hours ago | parent | prev | next [-]

But you have multiple providers, not just one.

paxys 4 hours ago | parent | next [-]

And every single one of those providers would buckle under government pressure.

Fable itself is hosted on all major cloud providers. How many offer it today?

eli 3 hours ago | parent | next [-]

This seems a little fanciful.

There's really no comparison between a model that Anthropic allows Google and Amazon to host with one that has been downloaded hundreds of thousands of times and has dozens of public inference providers.

Art9681 an hour ago | parent [-]

I don't think they "allow" Google or Amazon to host them so much as Anthropic itself is deploying and managing their services on multiple cloud providers just like every other global scale business. Even the models served via OpenRouter are just being routed to compute under Anthropic control. Same with OpenAI. They aren't going to hand the world's most valuable intellectual property at the moment to some third party to run independently.

Now for the Chinese models on OpenRouter, yea. Those providers could be legit. Or it could be a failed crypto mining operation pivoting to providing AI compute. Who knows.

minimaxir 4 hours ago | parent | prev | next [-]

The providers on OpenRouter are not all in the US.

paxys 4 hours ago | parent [-]

That doesn’t mean they are immune to US laws. If they want to continue to operate in the largest market in the world they will fall in line.

And if you are a legit American business you aren’t going to illegally bypass import/export controls.

svachalek 4 hours ago | parent | prev [-]

More importantly, the download is out there. You can download it yourself today, and if it's that important to you, you can buy the hardware too.

cyanydeez 4 hours ago | parent | prev [-]

I'm sure he's referring to the tightening of internet controls around social media as an extrapolation to controlling websites, etc.

logicchains 4 hours ago | parent [-]

Even in that case it can't be taken away; GPT and Claude are banned in China yet there's still a huge black market for tokens.

supern0va 3 hours ago | parent | prev | next [-]

>Unless you're running Linux yourself, it can absolutely be taken away.

Zambyte 2 hours ago | parent [-]

Yes. The difference is obviously that full, fat Linux runs on a superset of anything a layperson would call a computer, and can be built from source on roughly the same set of hardware. Running the full, fat Deepseek (as in the 1.6T model, unquantized) is too big to run on anything a layperson would call a computer, and being able to actually build it is even harder.

supern0va 29 minutes ago | parent [-]

It's famously difficult to find people willing to rent you time on big computers over the internet.

GaggiX 4 hours ago | parent | prev [-]

Popular open models on Openrouter have dozens of providers.

ai_fry_ur_brain 34 minutes ago | parent | prev [-]

Deepseek V4 flash is actually useless. Sorry I've tested it after seeing so many comments like these. On Open router when trying to get it to output tool calls for creating tables, instead of providing the structured output correctly it was sending me peoples dropbox links and other image sharing site urls that led to pictures of random tables...

Llms seem to only impress a certain type of person. Hint, this type of person also was really excited about NFTs.