(no subject)
May. 22nd, 2020 10:07 amДобавление ко вчерашнему:

the data in Figure 2 shows a decrease in infection rates after countries eased national lockdowns with >99% statistical significance""

the data in Figure 2 shows a decrease in infection rates after countries eased national lockdowns with >99% statistical significance""
no subject
Date: 2020-05-26 04:30 pm (UTC)1. Compute from the observations the observed value t_obs of the test statistic T.
2. Calculate the p-value. This is the probability, under the null hypothesis, of sampling a test statistic at least as extreme as that which was observed.
3. Reject the null hypothesis, in favor of the alternative hypothesis, if and only if the p-value is less than (or equal to) the significance level (the selected probability) threshold
What exactly is null hypothesis? That we got our "good fit" by accident, right? No distributional assumptions are required.
Случайно наткнулся на экономиста, который решил написать серию заметок типа введения в статистику.
https://hroniki-paisano.livejournal.com/118867.html
https://hroniki-paisano.livejournal.com/119264.html
Интересно, что он тоже не заморачивается спецификацией нулевой гипотезы.
no subject
Date: 2020-05-26 08:42 pm (UTC)For example what is 99% in your original post refers to? I have no idea, but here is one possibility: assume that the point has equal probability to be above and below the line and assume that the points are sampled iid. Under this model the picture shown in the graph is very unlikely and we can reject this hypothesis. However, iid is an key assumption here. If these data are not iid, this conclusion in general cannot be made.
For example, imagine that the rates are perfectly correlated between the states. Then either all of them will be below the line or all of them will be above the line.
no subject
Date: 2020-05-26 09:57 pm (UTC)I thought your original complaint was that the model that we hope to reject based on low p-value is not stated.
no subject
Date: 2020-05-26 11:50 pm (UTC)no subject
Date: 2020-05-26 11:55 pm (UTC)99% is not the worst actually -- sometimes people have 99.9999% or something like that and you immediately suspect something is fishy.
no subject
Date: 2020-06-27 06:10 pm (UTC)no subject
Date: 2020-07-08 05:00 pm (UTC)You can take a look at a simple synthetic model with linear regression, where the features of this curve can be seen: https://arxiv.org/abs/1903.07571