The p-value is the probability of getting a result “at least this extreme” given the null hypothesis, where “extreme” means “deviating from the null hypothesis”, however that’s defined. So, the test cut the outcome space into pieces, the most extreme of which had at least a 5% chance of happening.
That part isn’t right, but the rest is.
So I should have said “for the nine outcomes they considered, they all had at least 5% chance of happening”?
The p-value is the probability of getting a result “at least this extreme” given the null hypothesis, where “extreme” means “deviating from the null hypothesis”, however that’s defined. So, the test cut the outcome space into pieces, the most extreme of which had at least a 5% chance of happening.
I think.
… under the null hypothesis. I actually forgot this detail when replying to komponisto.