Be advised: this thread is not for serious in-depth discussion of weighty topics (we have a link for that), this thread is not for anything Culture War related. This thread is for Fun. You got jokes? Share 'em. You got silly questions? Ask 'em.
- 114
- 1
What is this place?
This website is a place for people who want to move past shady thinking and test their ideas in a
court of people who don't all share the same biases. Our goal is to
optimize for light, not heat; this is a group effort, and all commentators are asked to do their part.
The weekly Culture War threads host the most
controversial topics and are the most visible aspect of The Motte. However, many other topics are
appropriate here. We encourage people to post anything related to science, politics, or philosophy;
if in doubt, post!
Check out The Vault for an archive of old quality posts.
You are encouraged to crosspost these elsewhere.
Why are you called The Motte?
A motte is a stone keep on a raised earthwork common in early medieval fortifications. More pertinently,
it's an element in a rhetorical move called a "Motte-and-Bailey",
originally identified by
philosopher Nicholas Shackel. It describes the tendency in discourse for people to move from a controversial
but high value claim to a defensible but less exciting one upon any resistance to the former. He likens
this to the medieval fortification, where a desirable land (the bailey) is abandoned when in danger for
the more easily defended motte. In Shackel's words, "The Motte represents the defensible but undesired
propositions to which one retreats when hard pressed."
On The Motte, always attempt to remain inside your defensible territory, even if you are not being pressed.
New post guidelines
If you're posting something that isn't related to the culture war, we encourage you to post a thread for it.
A submission statement is highly appreciated, but isn't necessary for text posts or links to largely-text posts
such as blogs or news articles; if we're unsure of the value of your post, we might remove it until you add a
submission statement. A submission statement is required for non-text sources (videos, podcasts, images).
Culture war posts go in the culture war thread; all links must either include a submission statement or
significant commentary. Bare links without those will be removed.
If in doubt, please post it!
Rules
- Courtesy
- Content
- Engagement
- When disagreeing with someone, state your objections explicitly.
- Proactively provide evidence in proportion to how partisan and inflammatory your claim might be.
- Accept temporary bans as a time-out, and don't attempt to rejoin the conversation until it's lifted.
- Don't attempt to build consensus or enforce ideological conformity.
- Write like everyone is reading and you want them to be included in the discussion.
- The Wildcard Rule
- The Metarule
Jump in the discussion.
No email address required.
Notes -
Question for the software engineers:
Is there anything uniquely innovative or difficult to reproduce about the software/codebase for any of the big social media platforms (particularly Twitter/Facebook/Instagram/Reddit/TikTok/Youtube) or is their hold on the market mostly a result of network effects and their established large user bases?
Edit: Having clarified my thoughts after early responses, I think the core of what I want to understand is this: I know that there a many very intelligent people being paid handsomely as software engineers for these sites. Given the apparent simplicity and minimal improvement in the basic functions (from a user perspective) of many of these sites, what is it that these engineers are actually being paid to work on? Aside from server reliability, what other things do they need all these bigbrains for?
I think asking the question with the word "need" is likely to lead to confusion. Instead, note that as long as the marginal benefit of adding one more developer is larger than the amount it costs to do so, they will keep on hiring, and so the key is to look at what those marginal developers are doing.
Large organizations have non-obvious advantages of scale. This can combine with the advantages of scale that companies have to produce surprising results.
Let's say you have a company with a billion users and a revenue model with net revenue of $0.25 / user / year, and only 50 employees (like a spherical-cow version of WhatsApp in 2015). Let's further say that it costs $250,000 / year to hire someone.
The questions that you will be asking include
At a billion $0.25 / year users, and let's say $250k / year to hire a person, that person would only have to do one of
Realistically you will instead try to do 100+x each of these with teams of 100+ people, and keep hiring as long as those teams keep wanting more people. But those are the sorts of places to look.
More options
Context Copy link
Making a UI clone for Twitter should be not hard. Same for reddit, though moderation and customization functions may require some more work. Making full clone - with whatever ads, analytics, system functions, metrics, etc. exist and not visible to the public may be more complicated. Making it work reliably at scale Twitter works at may be a serious project for a serious qualified team, though it's definitely nothing impossible, just needs investment. Reddit I'd say the same with more investment since there are more options, but bare bones clones of both, especially if they don't need ads/analytics and billion scale, would probably not be too hard.
Facebook is a bit tougher due to a myriad of privacy settings and modes which may require some non-trivial approaches to data retrieval, and then there's whatever filthy black magic that underlies their feed algorithm... Plus it has streaming video, which is its own big can of worms, with which I personally never worked but heard it has a lot of dark magic into it too. Instagram/Youtube are also based on that, so the same applies there.
On the market - definitely. Even if they had some super awesome technologies, it would likely be possible to reproduce the same results maybe with slightly higher costs and slightly less awesome performance, and for social media network effects beat technology any time of the day. Don't get me wrong - you need a lot of technology to run code at the scale of Facebook, and a lot of it goes not only to the site itself but to support the organization that supports it and makes money from it, but code superiority has nothing to do with their success. In fact, I have seen very successful projects (not ones you named, but also famous names) where the code and the technology behind it are very subpar, but as long as it works and brings in the sweet dollars...
You can make a toy twitter in a weekend. Taking it from a toy to billion-users business with billions of revenue is the hard part.
Probably a couple of dozens of other things I forgot to mention.
More options
Context Copy link
To put it in perspective, I was able to put together a decent Yelp clone within 8 hours. I had a webserver framework and Adderall but that'll give you an idea of what a ~3x engineer can do. With AI that's probably gotten better.
More options
Context Copy link
"Describe how you would implement a Twitter clone" is a fairly standard and easy interview question a senior software engineer should be able to answer to a reasaonable amount of detail. (The same question about Ticketmaster is significantly harder which would surprise most people outside the industry.)
Ticketmaster is way harder of course due to heterogeneity of the underlying data.
More options
Context Copy link
More options
Context Copy link
Network effects, mostly.
The last novel model was TikTok's: short vertical videos fed to you by an algorithm forever. Everyone replicated it as quickly as possible, but couldn't defeat TikTok.
Twitter is another good example: no amount of money spent by FAANGs helped them build a viable clone. Bluesky is thriving purely on network effects.
The engineers are paid to lower operating costs and improve engagement with the ads. Social networks are some of the biggest datasets in the world, and people expect them to work for free and 24x7: every dollar you spend on running them is coming out of your ad revenue. At this scale it makes sense to do things like develop your own compression algorithm for data and get the major browsers to support it to lower your traffic costs by 1%. Or to hire the author of the programming language your software is written in and to give him a team to improve its performance.
More options
Context Copy link
Hardest part to replicate is probably the server reliability because that takes lots of intricate work and the AI-driven systems (mostly recommendation / advertising) because you need data.
This matters different amounts for different companies. But I would say that network effects are a far bigger hurdle; the above is just sauce.
So in comparing, say, this site to Reddit, there's probably some complex code for managing the orders of magnitude greater traffic that themotte just doesn't worry about? Or are you mainly referring to baseline server reliability?
@lagrangian covered some of it: the fault tolerance you need as your system scales up. At that scale, freak incidents happen everyday. I still remember the chaos in my office when Google services dropped for a few hours.
Consider also the kind of bugs you start to get when you have users worldwide, all expecting to use their own language and writing system, and expecting UI and help to be available in that language.
Then moderation. If you’re building an up and coming social media, sooner or later someone is going to livestream a beheading or use it to send plausible death threats, and you’re going to be forced to deal with that.
Of course, most startups fail, so these are problems you want to have. But still problems.
"A webform I can paste content into for others to see? Guess I'll programmatically post enormous amounts of child pornography into it."
Scott had a point about witches overrunning communities. He was right. The devil and his followers notice your website and have endless amounts of suffering children to show everyone.
More options
Context Copy link
More options
Context Copy link
Right. Zorba pays for the site out of pocket, but that is not scalable. The site occasionally goes down - we even lost most of a day of posts not too long ago. That's no big deal at our scale - just ssh in, figure out the bug, deploy something manually, etc.
But at e.g. Google scale, it's $500k/minute of gross revenue on the line in an outage, to say nothing of customers leaving you permanently over the headache. Fractions of a percent of optimization are worth big bucks. Compliance headaches are real. Hardware failures are a continual certainty.
Read about the brilliance behind Spanner, the database a huge amount of Google is built on: their own internet cables, because why choose between C[onsistency] and A[vailability] when you can just not have network P[artitions]?
You need an incredible degree of fault tolerance in large systems. If n pieces in a pipeline each have a probability p of working, the system has p^n - exponential decay.
Plenty of it is feature bloat, that said. You really can serve an astonishing amount of traffic from a single cheap server.
I don't have a good sense of scale—how much would you expect running this site costs per month?
Off the top of my head, $50.
Fermi estimate:
Multiply the following:
Comparing to dev tools, which shows 5.3MB, a factor of 10 I can't account for.
Let's assume we want to serve a peak traffic of 10x average and don't care enough to set something that automatically scales up and down:
This is... jack shit.
Google cloud charges for egress:
(127 * 4 - 200) * 0.11 = $34
.Worst case:
@ZorbaTHut, how'd I do? And would you be willing to share what % of costs you've had donated, vs paid out of pocket? You really shouldn't have to be paying yourself.
That was all without chatgpt, but here's a transcript from my talking to it afterwards. I think it looks reasonable until maybe the end when I ask about vpn costs. Still comes out to ~$50. It did a decent job analyzing the amount of cpu used (which I skipped in the "jack shit" section).
People upload images to the Motte, could that account for the difference between the 400kb and 5mb?
I think some variety of "I'm an idiot" is more likely. I don't see any images. If they're included as hyperlinks, they're not loaded until you click (I think, and ~confirmed by dev tools)
Attached a screenshot of the resource usage breakdown. Largest element is 402kb for the banner, compared to 215kb (fifth place) for what I think is the actual comments (compare to my 414kb estimate - not bad).
Some of the overestimate is from my extensions. Filtering those out, I see 2.0MB (2.3 uncompressed). 1.15 of that is fonts (unclear to me why that needs reloading each time - presumably this could be optimized out.)
/images/17366984362281015.webp
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link
Probably something on the order of $20-40 a month. Depends how fancy it is set up, and how much traffic we get.
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link