site banner

Culture War Roundup for the week of October 14, 2024

This weekly roundup thread is intended for all culture war posts. 'Culture war' is vaguely defined, but it basically means controversial issues that fall along set tribal lines. Arguments over culture war issues generate a lot of heat and little light, and few deeply entrenched people ever change their minds. This thread is for voicing opinions and analyzing the state of the discussion while trying to optimize for light over heat.

Optimistically, we think that engaging with people you disagree with is worth your time, and so is being nice! Pessimistically, there are many dynamics that can lead discussions on Culture War topics to become unproductive. There's a human tendency to divide along tribal lines, praising your ingroup and vilifying your outgroup - and if you think you find it easy to criticize your ingroup, then it may be that your outgroup is not who you think it is. Extremists with opposing positions can feed off each other, highlighting each other's worst points to justify their own angry rhetoric, which becomes in turn a new example of bad behavior for the other side to highlight.

We would like to avoid these negative dynamics. Accordingly, we ask that you do not use this thread for waging the Culture War. Examples of waging the Culture War:

  • Shaming.

  • Attempting to 'build consensus' or enforce ideological conformity.

  • Making sweeping generalizations to vilify a group you dislike.

  • Recruiting for a cause.

  • Posting links that could be summarized as 'Boo outgroup!' Basically, if your content is 'Can you believe what Those People did this week?' then you should either refrain from posting, or do some very patient work to contextualize and/or steel-man the relevant viewpoint.

In general, you should argue to understand, not to win. This thread is not territory to be claimed by one group or another; indeed, the aim is to have many different viewpoints represented here. Thus, we also ask that you follow some guidelines:

  • Speak plainly. Avoid sarcasm and mockery. When disagreeing with someone, state your objections explicitly.

  • Be as precise and charitable as you can. Don't paraphrase unflatteringly.

  • Don't imply that someone said something they did not say, even if you think it follows from what they said.

  • Write like everyone is reading and you want them to be included in the discussion.

On an ad hoc basis, the mods will try to compile a list of the best posts/comments from the previous week, posted in Quality Contribution threads and archived at /r/TheThread. You may nominate a comment for this list by clicking on 'report' at the bottom of the post and typing 'Actually a quality contribution' as the report reason.

3
Jump in the discussion.

No email address required.

It shouldn't be "X% Trump, Y% Kamala," it should be "X% Trump, Y% Kamala, Z% irreducible uncertainty."

What would this irreducible uncertainty mean for an event with a binary outcome? I think Silver already accounts for increasing uncertainty as he propagates his current prediction into the future (what he calls forecast vs. nowcast).

Error bars would make sense around the expected vote percentage. Of course the probability distribution over vote percentages becomes broader as you look into the future, and perhaps he does show that to paying customers. But in the end you still have to integrate over that when the layman asks for the probabilities of who wins the election. And that still amounts to two numbers that sum to 100%.

Evaluating a predictor's performance seems straightforward to me via the usual log-likelihood score. Record the final outcome and take the log of the predictor's probability for that outcome. That score can then be summed over multiple different elections, if you like. (Not sure though if I'd call that scoring rule particularly frequentist.)

But the outcome ISN'T really binary, is it?

Biden dropped out, Trump could have been killed by that bullet, and then we'd have a whole new ball game. The "Trump vs. Biden" model almost certainly didn't include a variable for "the Candidate abruptly drops out" and I doubt assassination risk was plugged in either.

And the fact that it tries to 'call' an election months out but has to adjust radically to new info is why I call it 'gimmicky.'

Taleb had his own discussion of this a while back, and this is the best summary of it I've found.

https://towardsdatascience.com/why-you-should-care-about-the-nate-silver-vs-nassim-taleb-twitter-war-a581dce1f5fc

But the outcome ISN'T really binary, is it?

Sure, in this trivial sense it is not.

I've heard this same argument from other Taleb-readers and got the impression that "irreducible uncertainty" means something entirely different than just a "none of the above" outcome.

If it is just that, then the same arguments apply: When you make the final probabilistic prediction, you integrate over the uncertainty, resulting in three numbers that add up to 100%. After recording the actual outcome, take the log of your predicted probability for this outcome, and that's your performance score.

Of course, if Silver rounds down the probability of "none of the above" to 0% for convenience and it still occurs, he'd technically incur a score of -Inf, which should tank his credibility forever. But I find that a boring technicality.

From your linked article:

Because FiveThirtyEight only predicts probabilities, they do not ever take an absolute stand on an outcome: No ‘skin in the game’ as Taleb would say.

Taleb is wrong here. Under this standard, no reasonable predictor could survive any real world application, as it would have to be trashed after the first mistake. And those that do survive would be hopelessly overfitted to past data.

For standard models, like logistic regression, the default decision boundary is assumed to be 50% (or 0.5 on a 0 to 1 scale) or the alternative with the highest value. [...] If FiveThirtyEight has no stated decision boundary, it can be difficult to know how good their model actually is.

The article got it precisely the wrong way round. Classical models such as logistic regression are trained and evaluated using their full probabilistic predictions.

It is only thresholded to a deterministic choice when used as input to a human decision, where the prediction is appropriately weighted for costs of false positives vs false negatives etc. (which you cannot do if it were a deterministic prediction in the first place).

And the fact that it tries to 'call' an election months out but has to adjust radically to new info is why I call it 'gimmicky.'

How's that bad? I'd call that perfectly rational behaviour.

How's that bad? I'd call that perfectly rational behaviour.

Perfectly rational behavior would probably be saying "I don't think I can accurately predict outcomes this far in advance."

So they add in the caveat "if the election were held today here is what the model says about the odds."

But the election isn't being held today. They know that, the audience also knows that but will still read the model.

Without a gimmick they have nothing to sell.