| ▲ | bertili 7 hours ago |
| The "Deepseek moment" is just one year ago today! Coincidence or not, let's just marvel for a second over this amount of magic/technology that's being given away for free... and how liberating and different this is than OpenAI and others that were closed to "protect us all". |
|
| ▲ | motoboi 4 hours ago | parent | next [-] |
| What amazes me is why would someone spend millions to train this model and give it away for free. What is the business here? |
| |
| ▲ | whizzter 3 hours ago | parent | next [-] | | Chinese state that maybe sees open collaboration as the way to nullify any US lead in the field, concurrently if the next "search-winner" is built upon their model the Chinese worldview that Taiwan belongs to China and Tiamen Square massacre never happened. Also their license says that if you have a big product you need to promote them, remember how Google "gave away" site searche widgets and that was perhaps one of the major ways they gained recognition for being the search leader. OpenAI/NVidia is the Pets.com/Sun of our generation, insane valuations, stupid spend, expensive options, expensive hardware and so on. Sun hardware bought for 50k USD to run websites in 2000 are less capable than perhaps 5 dollar/month VPS's today? "Scaling to AGI/ASI" was always a fools errand, best case OpenAI should've squirreled away money to have a solid engineering department that could focus on algorithmic innovations but considering that Antrophic, Google and Chinese firms have caught up or surpassed them it seems they didn't. Once things blows up, those closed options that had somewhat sane/solid model research that handles things better will be left and a ton of new competitors running modern/cheaper hardware and just using models are building blocks. | | |
| ▲ | dev_l1x_be 2 hours ago | parent | next [-] | | > Taiwan belongs to China So they are on the same page as the UN and US? The One China policy refers to a United States policy of strategic ambiguity regarding Taiwan.[1] In a 1972 joint communiqué with the PRC, the United States "acknowledges that all Chinese on either side of the Taiwan Strait maintain there is but one China and that Taiwan is a part of China" and "does not challenge that position." https://en.wikipedia.org/wiki/One_China
https://en.wikipedia.org/wiki/Taiwan_and_the_United_Nations | | |
| ▲ | 9cb14c1ec0 31 minutes ago | parent [-] | | The One China policy is a fiction of foreign policy statecraft, designed to sideline the issue without having to actually deal with it. It is quite clear that apart from the official fiction there is a real policy that is not One China. This is made clear by the weapons sales to Taiwan that specifically calibrated to make a Chinese military action harder. |
| |
| ▲ | zozbot234 3 hours ago | parent | prev [-] | | > "Scaling to AGI/ASI" was always a fools errand Scaling depends on hardware, so cheaper hardware on a compute-per-watt basis only makes scaling easier. There is no clear definition of AGI/ASI but AI has already scaled to be quite useful. |
| |
| ▲ | tokioyoyo 2 hours ago | parent | prev | next [-] | | Moonshot’s (Kimi’s owner) investors are Alibaba/Tencent et al. Chinese market is stupidly competitive, and there’s a general attitude of “household name will take it all”. However getting there requires having a WeChat-esque user base, through one way or another. If it’s paid, there’ll be friction and it won’t work. Plus, it undermines a lot of other companies, which is a win for a lot of people. | |
| ▲ | Balinares 3 hours ago | parent | prev | next [-] | | Speculating: there are two connected businesses here, creating the models, and serving the models. Outside of a few moneyed outliers, no one is going to run this at home. So at worst opening this model allows mid-sized competitors to serve it to customers from their own infra -- which helps Kimi gain mindshare, particularly against the large incumbents who are definitely not going to be serving Kimi and so don't benefit from its openness. Given the shallowness of moats in the LLM market, optimizing for mindshare would not be the worst move. | |
| ▲ | WarmWash an hour ago | parent | prev | next [-] | | It's another state project funded at the discretion of the party. If you look at past state projects, profitability wasn't really considered much. They are notorious for a "Money hose until a diamond is found in the mountains of waste" | |
| ▲ | ggdG 3 hours ago | parent | prev | next [-] | | I think this fits into some "Commoditize The Complement" strategy. https://gwern.net/complement | |
| ▲ | testfrequency 4 hours ago | parent | prev | next [-] | | Curious to hear what “OpenAI” thinks the answer to this is | |
| ▲ | YetAnotherNick 4 hours ago | parent | prev [-] | | Hosting the model is cheaper per token, the more batched token you get. So they have big advantage here. |
|
|
| ▲ | jimmydoe 3 hours ago | parent | prev | next [-] |
| It’s not coincidence. Chinese companies tend to do big releases before Chinese new year. So expect more to come before Feb 17. |
|
| ▲ | PlatoIsADisease an hour ago | parent | prev | next [-] |
| I am convinced that was mostly just marketing. No one uses deepseek as far as I can tell. People are not running it locally. People choose GPT/Gemini/Claude/Grok if you are giving your data away anyway. My biggest source of my conspiracy is that I made a reddit thread asking a question: "Why all the deepseek hype" or something like that. And to this day, I get odd, 'pro deepseek' comments from accounts only used every few months. Its not like this was some highly upvoted topic that is in the 'Top'. I'd put that deepseek marketing on-par with an Apple marketing campaign. |
| |
| ▲ | logicprog 41 minutes ago | parent | next [-] | | I don't use DeepSeek, but I prefer Kimi and GLM to closed models for most of my work. | |
| ▲ | mekpro an hour ago | parent | prev [-] | | Except that, In OpenRouter, Deepseek always maintain in Top 10 Ranking. Although I did not use it personally, i believe that their main advantage over other model is price/performance. |
|
|
| ▲ | catigula an hour ago | parent | prev [-] |
| I mean, there are credible safety issues here. A Kimi fine-tune will absolutely be able to help people do cybersecurity related attacks - very good ones. In a few years, or less, biological attacks and other sorts of attacks will be plausible with the help of these agents. Chinese companies aren't humanitarian endeavors. |
| |