gwern comments on Do small studies add up?

gwern 16 Mar 2022 21:14 UTC
3 points
0
The approach I suggest is that you can model standard biases like p-hacking via shrinkage, and you can treat extremely discrete systematic biases like fraud or methodological errors (such as confounding which is universal among all studies) as a mixture model, where the different mixtures correspond to the different discrete values. This lets you model the ‘flip-flop’ behavior of a single key node without going full Pearl DAG.

So for example, if I have a survey I think is fraudulent—possibly just plain made up in a spreadsheet—and a much smaller survey which I trust but which has large sampling error, I can express this as a mixture model and I will get a bimodal distribution over the estimate with a small diffuse peak and a big sharp peak, which corresponds to roughly “here’s what you get if the big one is fake, and here’s what you get if it’s real and pooled with the other one”. If you can get more gold data, that updates further the switching parameter, and at some point if the small surveys keep disagreeing with the big one, the probability of it being fake will approach 1 and it’ll stop visibly affecting the posterior distribution because it’ll just always be assigned to the ‘fake’ component and not affect the posteriors of interest (for the real components).

You can take this approach with confounding too. A confounded study is not simply going to exaggerate the effect size X%, it will deliver potentially arbitrarily different and opposite signed estimates, and no matter how many confounded studies you combine, they will never be the causal estimate and they may all agree with each other very precisely if they are collecting data confounded the same way. So if you have an RCT which contradicts all your cohort correlational results, you’re in the same situation as with the two surveys.
- Gerald Monroe 17 Mar 2022 20:18 UTC
  1 point
  0
  Parent
  Just to simplify your approach to a non-mathematician, you’re proposing not doing any information flow analysis but finding autonomously cases where an information input, like the opinion poll, is not adding any useful information. And you name one way to do this.
  Fair enough but the problem is that if you do an information flow analysis—“does any causal mechanism exist where this source could provide information”? - you can skip considering the faulty information with 100% probability. Sheer chance can show a correlation using your proposed approach.