Remix.run Logo
shakow 15 hours ago

> every single other result is even less likely.

But the summed probability of the “not too far away results” is much higher, i.e. P([93, 107]\{100}) > P([100]).

So if you only shoot 100/100 with your coin, that's definitely weird.

tshaddox 14 hours ago | parent | next [-]

Okay, but it doesn't make sense to arbitrarily group together some results and compare the probability of getting any 1 result in that group to getting 1 particular result outside of that group.

You could just as easily say "you should be suspicious if you flip a coin 200 times and get exactly 93 heads, because it's far more likely to get between 99 and 187 heads."

wat10000 14 hours ago | parent [-]

It's suspicious when it lands on something that people might be biased towards.

For example, you take the top five cards, and you get a royal flush of diamonds in ascending order. In theory, this sequence is no more or less probable than any other sequence being taken from a randomly shuffled deck. But given that this sequence has special significance to people, there's a very good reason to think that this indicates that the deck is not randomly shuffled.

In theory terms, you can't just look at the probability of getting this result from a fair coin (or deck or whatever). You have to look at that probability, and the probability that the coin (deck etc.) is biased, and that a biased coin would produce the outcome you got.

If you flip a coin that feels and appears perfectly ordinary and you get exactly 100 heads and 100 tails, you should still be pretty confident that it's unbiased. If you ask somebody else to flip a coin 200 times, and you can't actually see them, and you know they're lazy, and they come back and report exactly 100/100, that's a good indicator they didn't do the flips.

tshaddox 10 hours ago | parent [-]

> It's suspicious when it lands on something that people might be biased towards.

Eh, this only makes sense if you're incorporating information about who set up the experiment in your statistical model. If you somehow knew that there's a 50% probability that you were given a fair coin and a 50% probability that you were given an unfair coin that lands on the opposite side of its previous flip 90% of the time, then yes, you could incorporate this sort of knowledge into your analysis of your single trial of 200 flips.

wat10000 8 hours ago | parent [-]

If you don’t have any notion of how likely the coin is to be biased or how it might be biased then you just can’t do the analysis at all.

tshaddox 2 hours ago | parent [-]

You can certainly do the frequentist analysis without any regard to the distribution of coins from which your coin was sampled. I’m not well studied on this stuff, but I believe the typical frequentist calculation would give the same results as the typical Bayesian analysis with a uniform prior distribution on “probability of each flip being heads.”

fainpul 15 hours ago | parent | prev | next [-]

> So if you only shoot 100/100 with your coin, that's definitely weird.

Not if you only try once.

shakow 15 hours ago | parent | next [-]

Even if you shoot only once, you still have a higher chance of hitting something slightly off the middle than the perfect 100/100. And this because that's one point-precise result (100/100) vs. a cumulated range of individually less-probable results, but more probable when taken as a whole.

For a fair coin, hitting 100/100 is ~5%, vs. ~30% falling in [97; 103] \ {100}. You can simulate here: https://www.omnicalculator.com/statistics/coin-flip-probabil...

tshaddox 14 hours ago | parent | next [-]

> you still have a higher chance of hitting something slightly off the middle than the perfect 100/100

That's because "something slightly off the middle" is a large group of possible results. Of course you can assemble a group of possible results that has a higher likelihood than a single result (even the most likely single result!). But you could make the same argument for any single result, including one of the results in your "slightly off the middle" group. Did you get 97 heads? Well you'd have a higher likelihood of getting between 98 and 103 heads. In fact, for any result you get, it would have been more likely to get some other result! :D

zdragnar 14 hours ago | parent [-]

> But you could make the same argument for any single result

Isn't that the point? The odds of getting the "most likely result" are lower than the odds of getting not the most likely result. Therefore, getting exactly 100/100 heads and tails would be unlikely!

tshaddox 13 hours ago | parent | next [-]

But as I said, getting any one specific result is less likely than getting another other possible result. And the disparity in likelihoods is greater for any one specific result other than the 50% split.

alwa 12 hours ago | parent | prev [-]

I think the disagreement is about what that unlikeliness implies. "Aha! You got any result? Clearly you're lying!"... I'm not sure how far that gets you.

There's probably a dorm-quality insight there about the supreme unlikeliness of being, though: out of all the possible universes, this one, etc...

zdragnar 10 hours ago | parent [-]

Let's look at the original quote:

> "Remember, if you flip a coin 200 times and it comes heads up exactly 100 times, the chances are the coin is actually unfair. You should expect to see something like 93 or 107 instead".

Inverting the statement makes it read something like this:

You are more likely to not get 100/100 than you are to get exactly 100/100

...which is exactly what I was saying. Nobody is arguing that there is a single value that might be more likely than 100/100. Rather, the argument is that a 100/100 result is suspiciously fair.

e12e 13 hours ago | parent | prev [-]

Should that be 25% for 97..193 excluding 100?

shakow 12 hours ago | parent [-]

“[97; 103] \ {100}” means the interval [97; 103] without the set {100}; so no, still ~30%.

kalaksi 15 hours ago | parent | prev [-]

I'm sorry, but try what once? 200 flips once?

dredmorbius 6 hours ago | parent [-]

In statistics, various examples (e.g., coin flips) often stand in for other activities which might prove expensive or infeasible to make repeated tries of.

For "coin flips", read: human lives, financial investments, scientific observations, historical observations (how many distinct historical analogues are available to you), dating (see, e.g., "the secretary problem" or similar optimal stopping / search bounding problems).

With sufficiently low-numbered trial phenomena, statistics gets weird. A classic example would be the anthropic principle: how is it that the Universe is so perfectly suited for human beings, a life-form which can contemplate why the Universe it so perfectly suited for it? Well, if the Universe were not so suited ... we wouldn't be here to ponder that question. The US judge Richard Posner made a similar observation in his book "Catastrophe: Risk and Response" tackles the common objection to doomsday predictions that all have so far proved false. But then, of all the worlds in which a mass extinction event has wiped out all life prior to the emergence of a technologically-advanced species, there would be no (indegenous) witnesses to the fact. We are only here to ponder that question because utter annihilation did not occur. As Posner writes:

By definition, all but the last doomsday prediction is false. Yet it does not follow, as many seem to think, that all doomsday predictions must be false; what follow is only that all such predictions but one are false.

-Richard A. Posner, Catastrophe: Risk and Response, p. 13.

<https://archive.org/details/catastropheriskr00posn/page/13/m...>

grraaaaahhh 10 hours ago | parent | prev [-]

>But the summed probability of the “not too far away results” is much higher, i.e. P([93, 107]\{100}) > P([100]).

That's true of every result. If you're using this to conclude you have a weird coin then every coin is weird.