This weekly roundup thread is intended for all culture war posts. 'Culture war' is vaguely defined, but it basically means controversial issues that fall along set tribal lines. Arguments over culture war issues generate a lot of heat and little light, and few deeply entrenched people ever change their minds. This thread is for voicing opinions and analyzing the state of the discussion while trying to optimize for light over heat.
Optimistically, we think that engaging with people you disagree with is worth your time, and so is being nice! Pessimistically, there are many dynamics that can lead discussions on Culture War topics to become unproductive. There's a human tendency to divide along tribal lines, praising your ingroup and vilifying your outgroup - and if you think you find it easy to criticize your ingroup, then it may be that your outgroup is not who you think it is. Extremists with opposing positions can feed off each other, highlighting each other's worst points to justify their own angry rhetoric, which becomes in turn a new example of bad behavior for the other side to highlight.
We would like to avoid these negative dynamics. Accordingly, we ask that you do not use this thread for waging the Culture War. Examples of waging the Culture War:
-
Shaming.
-
Attempting to 'build consensus' or enforce ideological conformity.
-
Making sweeping generalizations to vilify a group you dislike.
-
Recruiting for a cause.
-
Posting links that could be summarized as 'Boo outgroup!' Basically, if your content is 'Can you believe what Those People did this week?' then you should either refrain from posting, or do some very patient work to contextualize and/or steel-man the relevant viewpoint.
In general, you should argue to understand, not to win. This thread is not territory to be claimed by one group or another; indeed, the aim is to have many different viewpoints represented here. Thus, we also ask that you follow some guidelines:
-
Speak plainly. Avoid sarcasm and mockery. When disagreeing with someone, state your objections explicitly.
-
Be as precise and charitable as you can. Don't paraphrase unflatteringly.
-
Don't imply that someone said something they did not say, even if you think it follows from what they said.
-
Write like everyone is reading and you want them to be included in the discussion.
On an ad hoc basis, the mods will try to compile a list of the best posts/comments from the previous week, posted in Quality Contribution threads and archived at /r/TheThread. You may nominate a comment for this list by clicking on 'report' at the bottom of the post and typing 'Actually a quality contribution' as the report reason.
Jump in the discussion.
No email address required.
Notes -
By who? What is this supposed to mean?
With a few weeks of space between the initial marketing hype and observation, and Deepseek seems to be most notable for (a) claiming to have taken less money to develop (which is unclear given the nature of China subsidies), (b) being built off of other tech (which helps explain (a), and (c) being relatively cheap (which is partially explained by (a).
If someone feels it's inspired, okay- the vibe war for propaganda is what it is and anyone in a different set of contraints is liable to feel it's novel rather than just different- and it's not like it's impossible for good cinema to come out of a state censorship apparatus. But is 'Deepseek for cinema' supposed to imply 'Chinese government constraints, but cheaper'?
Man, you're really committed to the bit of an old spook who disdains inspecting the object level because you can't trust nuthin' out of Red Choyna. I could just go watch this guy or his pal instead.
It wouln't be an exaggeration to say that the technical ML community has acknowledged DeepSeek as the most impressive lab currently active, pound for pound. What subsidies, there is nothing to subsidize, except perhaps sharashkas with hidden geniuses.
More options
Context Copy link
Deepseek is by far most notable for being state of the art reasoner model, out of China, released openly, with credible promises of more such releases to follow.
Secondary would be culture/vibe contrast to US labs, amount of cope and panic generated, widespread positive reaction and adoption in China, insight into actual costs and margins on US side, proof smaller players absolutely should not take American AI lab people at their word when they say not to bother trying to compete.
More options
Context Copy link
It implies - a discipline that the west was thought to dominate, but PRC China created a premier alternative out of nowhere.
Black Myth Wukong & Xiaomi SU7 are other examples.
The 'deepseek' moment is twitter sheeps trying to find a contrived analogy. But it gets the point across.
Deep Seek had its mainstream moment last month. But, it has been on the radar since early 2024 and the release of Deep Seek coder. While China has been competitive in the LLM space, their major players are massive organizations like Alibaba, Huawei or BAAI. Deep Seek was special because it reflects a cultural appetite for ambitious risk taking paired with technical excellence, all within a small upstart. It's a combination that has been missing from non-US startups, and for a long time was credited for China's inability to execute moonshots.
That aside, you're right. It is more that people are suddenly noticing what were well-funded efforts that were a long time coming. The outcomes themselves weren't sudden at all.
I generally agree with you. I deleted the above post because I realized that you were characterizing a view rather than advancing a claim, which upon realizing my mistake made my skepticism a non-helpful response that might have come off more aggressive than intended.
I have my doubts on the characterization of Deep Seek, since to me its corporate history reads less like a moonshot and more like a planned / choreographed emergence, but we agree on the general point that the change is people suddenly noticing pre-existing efforts.
Ayy no worries. I'm so used to people on TheMotte being skeptical, that the bar for aggressiveness is usually high.
The rate at which they're producing papers has me convinced that DeepSeek is operating at a much much larger scale than than the west realizes. (not taking anything away from them ofc. Deepmind is much larger, and seems to barely be keeping up with Deepseek)
Just today they released a sparse attention paper that makes some pretty bonkers claims - https://x.com/deepseek_ai/status/1891745487071609327
What gives them away is the especially mature way in which they write papers. There is maturity that only comes from writing a lot of them. These are the best ML PhDs China has. Not some trading geniuses who picked up LLMs on the side.
Conspiracy theories around DeepSeek are pretty funny, people twist themselves into pretzels to not acknowledge the most parsimonious hypothesis. Because it feels too wild, I guess. Maybe too scary as well, because it suggests that China can birth like a hundred more such companies if it finds a hundred such CEOs. I collect these stories. They've used all of Singapore's compute! They pay $1.3M to top researchers! It's a «choreographed emergence» to deceive the oh-so-important Dean (but he knows better than to trust ChiComs)! The scale is much bigger, there are hidden disciples in cloistered cultivation! It's all so very creative.
In any case I have the direct opposite impression about papers. They are kids, overwhelmingly under 30 and often 20, and they write very naturally and not academically. It's just raw intelligence and curiosity, not experience. It is known that roughly everyone at DeepSeek speaks fluent English – not normal for Chinese labs; they pay extreme attention to culture fit and aptitude in recruiting, and are severely ageist. Many core innovations come from undergrad interns; the first author on the NSA paper is an intern with anime pfp too. We have reports from competitors' employees who had rejected their offer because they perceived the company as too small and weak for the declared ambition. I don't know how to break it to you, but there's no Iron Curtain, things are fairly transparent.
Maybe. I'm not sure if DeepSeek has even 50 Ph.Ds though, and ByteDance has thousands.
If your hypothesis is correct, they will not significantly accelerate, now that they're acknowledged as national champions and are offered OOMs more resources than they had before. I think they will.
I see them as the Chinese equivalent to Mistral. They came out of no where, but they definitely weren't upstarts. Post that, they had sufficient resources that it couldn't be called a 'rag tag group of nobodies'. For a while, the media portrayed mistral as a bunch of nobodies fighting against the giants. That characterization was just as incorrect.
Same for Deepseek. None of this takes away from their achievement. They are all cracked, no doubt.
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link